For years I've on and off looked for web archiving software that can capture most sites, including ones that are "complex" with lots of AJAX and require logins like Reddit. Which ones have worked best for you?
Ideally I want one that can be started up programatically or via command line, an opens a chromium instance (or any browser), and captures everything shown on the page. I could also open the instance myself and log into sites and install addons like UBlock Origin. (btw, archiveweb.page must be started manually).
https://www.httrack.com/may not be exactly what you want but fits much of what you describe.
I haven’t tried to, but there does seem to be a way to capture “logged in” websites:
https://stackoverflow.com/questions/20362821/httrack-possible-using-cookies#58354077
Fail2Ban working - too well
2y 11mon ago by feddit.de/u/Pete90 in selfhosted@feddit.deErfahrung mit zu-Hause Hosting?
3y 16d ago by sh.itjust.works/u/wellnowletssee in selfhosted@feddit.deWas sind eure top 5 selbst gehosteten Projekte (neben den Klassikern)?
3y 17d ago by sh.itjust.works/u/wellnowletssee in selfhosted@feddit.deHugo Leisink / Orb · GitLab
3y 22d ago by feddit.de/u/Stardenver in selfhosted@feddit.de from gitlab.com