Archive.is blog

Anonymous asked:

Hi. I'm trying to use ArchiveIs in an Alfred workflow. It submits webpages via the archive now python library. The problem is that during testing, I'm getting a lot of 503 from the server. Is there a rate limit? Should I identify as a specific user-agent? What am I missing?

Yes, it is likely rate limiting showing a page with captcha

Anonymous asked:

Hi, I am not familiar with programming, I read question 75363302879 (does-archive-is-have-api) but I don't really understand it. I want to ask if this idea I had is feasible: I want to write a program that saves bookmarks, and when adding a url, it will automatically run each url through archiveis as well. Is this allowed, do you limit use of your service? Or do I need to "self-host" whatever technologies that archiveis uses. Thanks.

Anonymous asked:

whats the domain name? the name everyone calls it is archive. is, when i visit archive. is the logo says archive. today and when i click that it takes me to archive. vn. whats going on?

It is intentional.

No single domain is reliable and I have no means to enforce control on each domain.

* archive.today - threatened with confiscation http://blog.archive.today/post/116913927371/the-domain-registrar-gransy-s-r-o-aka, also a troll attack caused service interruption https://blog.archive.today/post/138982909006/domain-problems-again
* archive.is - threatened with confiscation https://twitter.com/archiveis/status/1081276424781287427, asked not to use “archive.IS” for branding (that’s why you see “archive.TODAY” in the top-left corner; although many people remembered it as “archive.IS” and refer it so)
* archive.fo - threatened with confiscation https://twitter.com/archiveis/status/1188222460598116353
* archive.li - attacked by trolls impersonating police, caused few days service interruption https://twitter.com/archiveis/status/956025540028268547
* archive.ec - attacked by trolls causing service interruption and finally lost https://twitter.com/archiveis/status/1093608363647291393
* archive.vn - ok so far
* archive.ph - ok so far
* archive.md - ok so far
* a nice domain unrelated to archive - one day whois started showing someone’s else information and the registrar did not response, the domain was lost

Anonymous asked:

Thank you for everything, as always! Just wondering if there's any chance of eventually bypassing whatever mechanism instagram implemented to disallow archiving? Hope you are well!

They do not disallow.

The problem is if many requests come from a single source IP address, Instagram redirects to login page prompting to log in (and then limit access rate per-account instead of per-IP).

The archiver just retries using another IP. Usually it helps, but apparently it does not have enough different IPs to cope with the spikes when a bundle of Instagram pages are submitted.

Anonymous asked:

Hello, on some websites there are cookie opt-in pop-up windows, which hide the content underneath. Can I skip this pop-up to archive the website?Example: the websites of jimdofree or jimdo (links cannot be entered here) regards Daniel

Fixed

Anonymous asked:

How did you change back to the old Twitter layout? I'd thought they disabled it at the start of June, so I was pleasantly surprised to see my archives from earlier today show up with Twitter's old interface.

They show old interface to Googlebot, so a fake User-Agent does the trick

Anonymous asked:

tor v2 hidden services will be depreciated in a year or two. are you planning on upgrading to a v3 hidden service, and if so, when? thank you for your service :)

It works already at

archiveiya74codqgiixo33q62qlrqtkgmcitqx5u2oeqnmn5bpcbiyd.onion

Anonymous asked:

I tried to donation from Japan more than 8 times with various settings and methods, but all attempts rejected.

Anonymous asked:

Facebook pages aren't archiving properly. I used to be able to archive and it would bypass the login. Now when I archive pages I either get the public page or a 'Page Not Found' which usually indicates not being logged in

It requires many new Facebook accounts.

They are not free (at least phone numbers are in the costs).

That makes bypassing Facebook login a premium feature someone has to sponsor (I wouldn’t just because I do not need the feature).

Anonymous asked:

There is an option to report archives for "Government Criticism." Please elaborate on exactly what this means.

The whole list is a copy-paste from another website (I forgot which one; probably GitHub), no item has implied meaning.

Anonymous asked:

Hi, is it possible to archive a facebook video from a link?

No, videos are not archived.

Try something likr youtube-dl (despite the name it supports almost all video websites)

Anonymous asked:

For the last two days, the archiving of images in tweets with the "This media may contain sensitive material. Learn more" label no longer works. Now, only the tweet text is archived, rather than the images as well.

Examples?

Anonymous asked:

Can I donate to you with BAT?

Technically - yes.

I have transferred the domains to an account of a friend who has verified Uphold account, but we have not received BATs yet (there is 1 month hold).

Compared to LiberaPay and PayPal, it is still untested channel.

Anonymous asked:

The "download zip" button has been giving a "Not found" error for quite some time.

Yes, it is broken since December 2019.
Since then the archiver works differently and although that improves
quality of the pages, the internal format (how pages are stored) got
far from good old html+image files and that requires additional coding to produce .zip files on demand. I hope to fix that in few weeks

Anonymous asked:

You know how when you archive a Google cache page it shows the "Saved From" & "Original" links, and allows you to search for the archive with the original link? Well archiving from Bing and Yandex cache shows the saved from & original links, but doesn't allow you to search by the original link, only the "Saved From" cache url. Can you fix this please?

Could you profide specific example?

gfair asked:

Why is everyone so hostile and angry in their questions? Was it always like this?

I think it is ok. The style of bug reports in open-source world is very similar.

Anonymous asked:

Why the hell should I use you or donate if I have to go through dozens of captchas for having used TOR? Are you spying assed Google or Apple or NSA?

You do not have to use. The less people use the site, the faster it works. The less are my expenses. So you could donate by not using.

As for Tor: archivecaslytosk.onion is catchaless, the captcha is there only when vanilla websites accessed via Tor exits (which are apparently in many blacklists including one of recaptcha)

Anonymous asked:

How many does it cost to host this site per day?

Under $100