Archive.is blog

Blog of http://archive.is/ project
  • ask me anything
  • rss
  • archive
  • What percentage of 5-char-codes is used now? Full capacity is (10+26+26)^5 is above 900 millions, it is correct number?

    Anonymous

    A bit less than a half. Afterwards the codes will be 6-char, 7-char, …

    • 4 days ago
  • What is time zone that is displayed for snapshots? Thank you

    Anonymous

    UTC

    • 5 days ago
  • Is there a reason why there is so much traffic from Japan to this site?

    Anonymous

    Maybe because it has localized version (there are only three more or less good ones: Japanese, Korean and Polish), or just popularity in certain communities.

    • 1 week ago
  • When archiving "wayback vefsafn is" the carrusel of dates appears at the beginning (in "web archive org" that doesn't happen). Example: pSP7g. Maybe it can be removed.

    jakeukalane

    yes

    • 1 week ago
  • Why is access blocked in /t77dr? Can you remove the block? Thanks in advance

    Anonymous

    yes

    • 1 week ago
  • Thanks for removing the pop-up window and display full content. Can you make the links within the archived page (especially "THIS REVIEW IS HELPFUL" and "Flag as inappropriate") still hoverable and clickable? See the original pages for details. ysKuG, UmDJf, 5esIa.

    Anonymous

    No, JavaScript is not stored. It’s hard to allow it to run correctly on snapshots.

    • 1 week ago
    • 1 notes
  • Could you fix GameSpot GameFAQ archives? For example: P6r0D and 3mj5i

    Anonymous

    yes

    • 1 week ago
  • Could you incorporate as a charity so that people can make tax deductible donations to you?

    Anonymous

    In which country?

    • 1 week ago
  • Is there any structure in place to assure the continuity of this project? I have seen you answer to a similar question that you "can leave a will". I am genuinely concerned about the loss of data archived here, were you - God forbid - to be hit by a bus or something.

    Anonymous

    I had an email provided by my mobile operator. I considered it very reliable - even if it was hacked, I had a paper contract, I could visit their office to restore.  So I used it in critical places: in contracts, as an email to register on all sorts of utility sites, as an email to restore access to various services. After 10+ years, the mobile operator decided it wasn’t their business and shut down the email service.

    What structure in place could assure the continuity?

    I’m not even asking if there was one in that case (and if there is one in GMail), I just don’t understand what could be such a structure to assure continuity of any service. What should I have demanded to avoid data loss?

    Archive.Today was born precisely out of an understanding of the fragility of services from which it itself is not immune. And this is not a project to transmit information to distant descendants, it is a project to let latecomers be witnesses. Having the magic to assure continuity, I wouldn’t need to invent such weak tools.

    • 1 week ago
    • 4 notes
  • What is your perspective on creating a new page for "trending" and "most popular" snapshots? Or is that something that can lead to disasters?

    Anonymous

    It can be used as a news stream (that’s where I get my news), but there’s a lot of garbage too. Also, trolls can promote disgusting pictures to the top.

    • 1 week ago
    • 1 notes
  • If you run out of money, will Wayback Machine take over the backed up pages? It would suck if they all just disappeared. Big fan of service BTW

    Anonymous

    I can leave a will, but how can you be sure that they will be happy to receive it and that they will dispose of it in such a way that you will like it?

    • 1 week ago
  • Question - How the heck does this archive retrieve saved pages from many years ago so quickly? Is there some sort of CDN being used?

    Anonymous

    No, there is no special optimization, simply because copying large amounts (e.g. to CDN edges) would take weeks or months. But. It just so happens that older archives are now running on more powerful servers, and there are fewer requests for them than for newer ones. So I’m getting both complaints about the site being too slow and wondering how it works so fast at the same time :)

    • 1 week ago
  • Can the pop-up window be removed? Also can the 'Read More' buttons be clickable to expand the text content? ysKuG, UmDJf, 5esIa.

    Anonymous

    yes, fixed locally, the fix will be deployed in few hours

    • 1 week ago
  • If you ever get another Instagram/Facebook login credential, you may want to limit the number of exit IPs used by crawlers to just one, so that it looks less suspicious. That might be the reason why the accounts created in past might have gotten banned so quickly.

    Anonymous

    It is so since long time ago (and still so for Linkedin, DeviantArt, VK, OK, … and other sites which do eventually ban but are not so paranoid as FB last years).

    There is multi-exit VPN using patched Wireguard, which control exit IPs for many websites (it is not only for accounts: many US local media need US IP, etc). This could be an interesting product itself: to avoid seeing “this website is not available in your region“, and yes, to protect accounts from being banned when you are traveling.

    An analysis of recent bans of my accounts has convinced me that blocking occurs after visiting questionable pages in different languages from the same account. Apparently, FB algorithms believe that if a normal person reads fake news, then only in one language. Interest in such pages in different languages (from German to Marathi) can lead to the classification of the user as a data journalist or similar undesirable visitor who “do not follow community guidelines“ as they say

    • 2 weeks ago
  • If you find the time, could you please fix the body of text on 'EOZ73'? The older versions of The Washington Post seem to have this problem. Thank you kindly.

    Anonymous

    yes, it seems that WaPo has to be added to the list of slow websites which require pressing F5 after the load. There is more than one snapshot similar to this. I’ll fix it in a few hours (there is compilation is in progress).

    • 2 weeks ago
    • 1 notes