Hacker News .hnnew | past | comments | ask | show | jobs | submitlogin

The main archive formats for web content are WARC, ZIM, Memento, and static HTML (e.g. from a tool like wget or Singlefile).

If you want 1 page per URL I recommend Singlefile.

Lots more info here if you want to compare different software options: https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-...



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: