Archive.is blog

Blog of http://archive.is/ project
  • ask me anything
  • rss
  • archive
  • Saving web page seem broken, I tried to save 5 different pages and it return a "blank page" as saved page. Moreover when I look to see ALL archived pages for this (www). (sciencedaily). (com) all recent page most of them are blank page :(
    Anonymous

    There is a problem with saving pages from sites behind of Incapsula (a DDOS-protection CDN similar to Cloudflare), such as sciencedaily.com, offshoreleaks.icij.org, monsanto.com, …

    They do not ask for CAPTCHA, they just return a blank page for half of requests and even retrying via proxy does not help.

    I will investigate it further.

    • 1 day ago
  • Pages at NASA(gov) are not being archived by your service. The pages turn up blank, yet archive (org) is able to preserve them completely. I prefer the archive(is) service, and would like to know if y'all can look into how your service can archive NASA(gov) pages. Thank you very much! Examples of pages being blank: archive(is)/TMOnl archive(is)/yj70L
    Anonymous

    I will see, thank you for the report.

    • 4 days ago
  • Google Play Store pages have an expandable description, but right now archiveis doesn't capture the full description, like archiveis/3C6cB would it be possible to archive the full description?
    Anonymous

    ok, fixed.

    • 4 days ago
  • Facebook has blocked your account again, please change it ?, because your archiving system is amazing ?, if not consider open sourcing the systems source code ? :) would help alot XD
    Anonymous

    ok

    • 1 month ago
    • 1 notes
  • Why are URLs beginning with web-archive-org/save/ or web-archive-org/record/ invalid?
    Anonymous

    When such urls are requested, web.archive.org starts saving a page, and archive.is starts saving how web.archive.org is saving; 

    it most cases such race results in pages saved very badly: http://archive.is/xYpxk

    • 1 month ago
  • I have received several bug reports about archive.is saving empty or 404 pages from Google Cache although there expected to be some content.
    It seems that there is more than one Google Cache, and what you get depends not only on the URL but also on which one of the Google datacenters serves you request.

    Examples of pages saved via different proxies:
    http://archive.is/https://webcache.googleusercontent.com/search?q=cache:_PVt8WPb4DEJ:*
    http://archive.is/https://webcache.googleusercontent.com/search?q=cache:CO15sF9zSrQJ:*

    I think, the archive should perform few requests simultaneously and then save all successful versions.

    • 2 months ago
  • All of 8chan's archives are down. Is there a way to bring this back up?
    Anonymous

    There are too much snapshots from 8ch.net and media.8ch.net with child porn.

    I see that blocking the whole 8ch.net is not a good solution, but I
    cannot review all the snapshots manually.

    Any ideas how to separate pages with CP from the rest of 8ch content?

    • 2 months ago
    • 22 notes
  • Hi there, I asked about YouTube earlier today. After seeing your reply, I resaved the same URL but the comments are still not expanding. Am I doing something wrong? Here is the archive link... (can't post the link). Thank you :)
    Anonymous

    Please email me the link, I will have a look.

    • 2 months ago
  • Hi there. Lately, archived comments posted under YouTube videos are not being expanded to show all comments. I see from your replies to other posts here that they should so I guess YouTube have changed something. Could you please look into this? Thank you for this excellent and helpful tool.
    Anonymous

    Aren’t  they?

    I just checked the last snapshot saved from youtube (http://archive.is/hJu9a) and see the comments expanded.

    • 2 months ago
  • Hey, archiving tumblr posts, can you hit "show more notes" at the bottom of the post repeatedly
    Anonymous

    Sure. Compare http://archive.is/0HIzc (before fix) and http://archive.is/PX1P3 (after fix).

    • 2 months ago
Next page
  • Page 1 / 26