(cache) Archive.is blog

Can you give an option to download webpages in .7z, they are much more efficient than zip.

Anonymous

There is only one well compressible file in each archive (html), the rest are png and jpg images which are already compressed and the archiver keeps them untouched. So the choice of the archiver would not affect the resulting size of archives significantly.

Also, many new unpackers (7z, rar, …) are able to unpack zip-files, but not the other way around.

4 days ago

I noticed that you support a download link for pages that have been archived. Would it be possible to support downloading a page in WARC format?

Anonymous

It is possible but I am afraid it would not add the value you expect from WARC.

Archive.is’ snapshots are not result of the crawl but snapshots of the internal browser state.

So there is almost no metadata and even the original URLs of images are not stored (moreover, some of the images were not downloaded at all but produced by rendering complex WebKit-specific CSS sentences in order the snapshot could be simpler and less dependent on the browser of the user).

2 weeks ago

Can newspaper articles from behind a paywall be archived?

Anonymous

Only those which either have “happy hours” of free access or registration-free access to all articles but limits the per-day or per-month number of articles to see.

Those which always shows “enter you credit card” instead of articles - definitely no.

2 weeks ago

Hi, The following hashbang URLs not working: archive is /RcaO0 Won't webpage capture automatically go to the section concerned? Thanks.

Anonymous

this is a bug, thank you for reporting!

3 weeks ago

For how long will this website and the archives be available, how many people maintain this project?Thank you.

Anonymous

Forever. Actually, I think, in 3-5-10 years all the content of the archive (it is only ~20Tb) could fit in a mobile phone memory, so anyone will be able to have a synchronized copy of the full archive. So my “forever” is not a joke.

Two persons, currently.

Webcite has a comb feature where we can archive the links on a specific page. This comes in handy for some of my research papers. Is there an equivalent way to do this with archive. is?

meeedeee

Not yet.

You are the first person asking for this :)

1 month ago

Can I create a user? I want to know what url I have saved?

Anonymous

No.

You can create a collection of your archived paged on http://delicious.com/ or http://pinterest.com/

1 month ago

sin querer no di permiso para mi microfono y camara ,, necesito que se me habilite el permiso para utilizar webcams

Anonymous

necesitas navegar al websitio original para usar la multimedia de la pagina

1 month ago

your website has a list of personal identifying information (PII) including credit card info; who do i send legal correspondence to?

Anonymous

You should contact the issuer bank to ensure they have the cards blocked. Banks can be found by the prefix of the card number (http://en.wikipedia.org/wiki/List_of_Issuer_Identification_Numbers).

1 month ago

where these information is saved. How can I retrieve a page that I have saved from my system?

Anonymous

You can download a .zip file (there is a link in the header).

1 month ago