Wow, all the US Department of State files have just gone missing from archive.or...

benyami · on May 18, 2015

Check out the Internet Archive FAQ on how to remove a document from their archives. https://archive.org/about/exclude.php

It looks like they used robots.txt to do that.

neil_s · on May 18, 2015

Huh, so the wild-card user-agent will block not just searchbots, but also archivebots. Wonder how OP managed to get screenshots of archive.org having archives available for those documents.

kjell · on May 18, 2015

They're there, at least the two I looked at.

https://web.archive.org/web/20130413152316/http://www.state....

Each line is missing `/documents` in the snippet of the `robots.txt`

maxmcd · on May 18, 2015

I have been able to view multiple pdfs and view the page screenshotted by the author.