(cache) Archive.is blog

Added “download .zip” function

1 day ago

You have typo in FAQ: “It there any limit on the page size ?” should be “Is there (…)”. kthx.

Anonymous

Thank you

1 day ago

what is the name of the archive is robot?

Anonymous

It used to be “archive.is bot”.

Now it impersonates as a regular browser, because some popular sites (instagram to name one) try to detect if the page was requested by bot or human and in the former case can show the ugly version of page which is optimized for bots, not for people.

1 week ago

Is it possible to get the source code of this tool?

Anonymous

No, it is more a set of hacks than a project.

But you can find similar open source projects, for example https://github.com/gildas-lormeau/SingleFile/

1 week ago

What software is used to make the actual snapshot? (I wrote a personal version of this which uses wget -p -etc, which is less than ideal on js-heavy pages)

Anonymous

http://phantomjs.org/ with some patches.

1 week ago

Can the archived pages be downloaded for local use on our computers? Will you be releasing the software that you use for archival?

Anonymous

1. In browser’s menu: File -> Save As -> Compele page.

Anyway, adding something like “download as .zip” can make sense, for example for mobile users which do not have full featured browsers. I will add it.

2. I think, no. It is very tricky to run, it depends on an exact version of Chrome, which binary also must be patched in order to reduce security (to allow saving content of frames, etc).

3 weeks ago

I tried to archive some pages yesterday and today (2013-03-16), but I always got: "Error: Network error." It seems to be the same with different target sites. I had also tried with addresses targeting my own server here and had the same effect, while I and others could reach my server that way. Looking at my logs, I did not even find an attempt to connect to my server from this site (archive is). What is wrong?

Anonymous

Thank you for reporting, the problem was on my side.

Should be fixed now.

1 month ago

Hey! It's a great resourse. Is there any possibility, that my bookmark will be removed? It's not porn or anything, but i'm not sure about copyrights :)

Anonymous

To increase reliability and be more confident, you can put your link to all the archiving sites, not only to http://archive.is/ (there are also http://peeep.us/ http://webcitation.org/ http://hiyo.jp/ http://megalodon.jp/ ).

1 month ago

Can you recommend the best method/script so I may batch archive about 7000 urls?

Anonymous

something like

curl —data url=http://url-to-submit.com/ http://archive.is/submit/

Please note, that it may take up to 1 hour to process 7000 urls (after you submit them and before they will be visible on the site).

1 month ago

Does archive have plans for an API? Just curious. =)

newhopegriffin

What kind of API do you need?

1 month ago