Archive.is blog

Blog of http://archive.is/ project
  • ask me anything
  • rss
  • archive
  • Added “download .zip” function

    • 1 day ago
  • You have typo in FAQ: “It there any limit on the page size ?” should be “Is there (…)”. kthx.
    Anonymous

    Thank you

    • 1 day ago
  • what is the name of the archive is robot?
    Anonymous

    It used to be “archive.is bot”.

    Now it impersonates as a regular browser, because some popular sites (instagram to name one) try to detect if the page was requested by bot or human and in the former case can show the ugly version of page which is optimized for bots, not for people.

    • 1 week ago
  • Is it possible to get the source code of this tool?
    Anonymous

    No, it is more a set of hacks than a project.

    But you can find similar open source projects, for example https://github.com/gildas-lormeau/SingleFile/

    • 1 week ago
  • What software is used to make the actual snapshot? (I wrote a personal version of this which uses wget -p -etc, which is less than ideal on js-heavy pages)
    Anonymous

    http://phantomjs.org/ with some patches.

    • 1 week ago
  • Can the archived pages be downloaded for local use on our computers? Will you be releasing the software that you use for archival?
    Anonymous

    1. In browser’s menu: File -> Save As -> Compele page.

    Anyway, adding something like “download as .zip” can make sense, for example for mobile users which do not have full featured browsers. I will add it.

    2. I think, no. It is very tricky to run, it depends on an exact version of Chrome, which binary also must be patched in order to reduce security (to allow saving content of frames, etc).

    • 3 weeks ago
  • I tried to archive some pages yesterday and today (2013-03-16), but I always got: "Error: Network error." It seems to be the same with different target sites. I had also tried with addresses targeting my own server here and had the same effect, while I and others could reach my server that way. Looking at my logs, I did not even find an attempt to connect to my server from this site (archive is). What is wrong?
    Anonymous

    Thank you for reporting, the problem was on my side.

    Should be fixed now.

     

    • 1 month ago
  • Hey! It's a great resourse. Is there any possibility, that my bookmark will be removed? It's not porn or anything, but i'm not sure about copyrights :)
    Anonymous

    To increase reliability and be more confident, you can put your link to all the archiving sites, not only to  http://archive.is/ (there are also  http://peeep.us/  http://webcitation.org/ http://hiyo.jp/ http://megalodon.jp/ ).

    • 1 month ago
  • Can you recommend the best method/script so I may batch archive about 7000 urls?
    Anonymous

    something like

    curl —data url=http://url-to-submit.com/ http://archive.is/submit/ 

    Please note, that it may take up to 1 hour to process 7000 urls (after you submit them and before they will be visible on the site).

    • 1 month ago
  • Does archive have plans for an API? Just curious. =)
    newhopegriffin

    What kind of API do you need? 

    • 1 month ago
© 2012–2013 Archive.is blog
Next page
  • Page 1 / 3