Archive.is blog

Blog of http://archive.is/ project
  • ask me anything
  • rss
  • archive
  • Hi there! /tl1pG is not being captured properly. Thank you!

    Anonymous

    Fixed

    Sorry for delays, Tumblr questions are not forwarded to email since April and I just spot it.

    • 1 year ago
    • 9 notes
  • Is there any way you could make an Android app that I could just share the URL to in the sharing menu so it goes right there?

    cnlson


    This: https://play.google.com/store/apps/details?id=com.navasgroup.share2archive

    • 1 year ago
    • 7 notes
  • bleacherreport articles are showing up blank (screenshot works but the actual webpage archive is blank). Could you take a look at this? Thanks!

    e.g. bleacherreport*com/articles/2583445-nba-opening-night-gets-the-hotline-bling-drake-music-video-treatment

    Anonymous

    I made a fix for future bleacherreport saves, but this one seems unrecoverable (React often cleans the whole page on JavaScript error, and this is what was happening on bleacherreport)

    • 1 year ago
    • 1 notes
  • Did you decide to stop allowing the archival of Mature Labelled content from Tumblr? Thanks for your time!

    Anonymous

    No, I did not. Any examples of broken pages ?

    • 2 years ago
    • 5 notes
  • Hi, thanks for providing this fabulous archive service! In https://blog.archive.today/post/688077534761566208, you said that user’s IP address won’t be send to website since 2019. Could you provide an option to send IP address to capture localized contents? And some websites may only be reached in certain regions… :-(

    Anonymous

    Websites no longer look at `X-Forwarded-For` for user region, so I have to use per-website proxies to get localized content and avoid geo-block.

    It is not 100% correct though, so feel free to report a bug it you spot that.

    • 2 years ago
  • Can you expand the text at "See More" on QreQE ?

    Anonymous

    yes

    • 2 years ago
    • 2 notes
  • Did something change over the past few days such that the site no longer functions through TOR?

    Anonymous

    It works.

    Let me guess: if you copy-pasted archiveiya74codqgiixo33q62qlrqtkgmcitqx5u2oeqnmn5bpcbiyd.onion from wikipedia, it won’t work because it contains a zero-width space character inside.

    • 2 years ago
    • 3 notes
  • Hi, could you have a look why pages from steamcommunity won't archive? Here's an example, the page "The updated Steam Mobile App is now available" is stuck in submit. Thanks for the great work and site!

    Anonymous

    Could you tell the url of the page? I see no failed pages from steamcommunity

    • 2 years ago
    • 1 notes
  • please add support for showing sensitive content from mastodon screenshots.

    Anonymous

    It is supported.

    But. Mastodon is not detected automatically, there is a list of domains which may be incomplete. Just mail me where you see it is not supported yet.

    • 2 years ago
    • 1 notes
  • Pretty please remove the recent restrictions for archiving Twitter. It's practically useless for archiving Twitter now. I just archived someone's media page and it only captured their first 4 tweets.

    Anonymous

    I know, it need to be remade almost from scratch because of the changes on Twitter side. Old code resulted in loading spinner and no content at all.

    • 2 years ago
    • 1 notes
  • I archived a website but the sidebar from the right side is in the middle of the page. How do I fix that?

    Anonymous

    where?

    • 2 years ago
    • 1 notes
  • Z7gaQ and 09Vbx are stuck on the loading screens, however the thumbnails for these archives show past the loading screens

    Anonymous

    yes, fixed.

    thank you for the report!

    • 2 years ago
    • 1 notes
  • "It needs accounts, otherwise instagram redirects to the login page or shows a fake 404 page. Accounts do not live long." I've entered the full instagram url, all i get is "Not Found (yet?)". what does this mean? the provided ig acount i entered is public not private

    Anonymous

    This means it has tried 10 times and given up.You didn’t specify an account for the archiver to log in to instagram, it’s just not implemented

    • 2 years ago
    • 1 notes
  • why isn't instagram archive working?

    Anonymous

    It needs accounts, otherwise instagram redirects to the login page or shows a fake 404 page. Accounts do not live long.

    • 2 years ago
    • 3 notes
  • Spoilers on the War Thunder forum (and google caches of it) don't get expanded. Could this be fixed?

    Anonymous

    sure

    • 2 years ago
    • 1 notes
  • Some pages don't display images correctly (ie, like archive 'wR3Ar'). Could you fix it?

    匿名

    Those are not missing images but missing ad blocks.

    • 2年前
    • 1 notes
  • I notice on the /wip pages when I archive something from a news site, most of the time is spent loading various trackers, assets that are never displayed like videos, etc. Why not only load what's needed to render content? Not commenting on the state of the web here, just the performance of the archiver.

    匿名

    Known trackers are skipped (their lines are gray instead of green)

    • 2年前
    • 2 notes
  • Why can't Instagram pages be archived anymore? I try to save one and then its like "Sorry this webpage is not available."

    匿名

    Instagram and FaceBook are broken most of the time: constantly getting kicked out of the account, being blocked by IP, … Although there aren’t many requests to save pages from there, less than 100 a day. I think there must be quite a few live users who view many more pages in a day. How they do it is not clear to me.

    • 2年前
    • 1 notes
  • Could you expand all "Drivers details" on 3ZQ7j (mesamatrix)? Thanks in advance.

    匿名

    It opens on click

    • 2年前
    • 1 notes
  • Any possibility of a MacOS Safari extension like the Chrome one? Thanks.

    匿名

    Please, ask the author of Chrome/Firefox extension, it is not me. I cannot, I have no MacOS devices.

    • 2年前
    • 3 notes
  • can you replace recaptcha for cloudflare turnstile, i constantly have to do captchas and cloudflare turnstile is much faster for me. Thanks

    匿名

    No, that captcha is too difficult: I can’t select “strawberry cakes” among others just by the picture

    • 2年前
    • 3 notes
  • Are pages of a domain deleted periodically to be sure aren't more than ~1000? Dezgo pages keeps descending. Now they are 1162, previoulsy were 1300 and before where ~2400. Thanks. Those links couldn't be accessed anymore live.

    匿名

    The pages are not deleted, but number of search results is limited.

    To get more pages try to split the query to `domain.com/a*`, `domain.com/b*`, etc

    • 2年前
    • 1 notes
  • Reddit has really been bugging out the archiver the past few days.

    匿名

    examples?

    • 2年前
  • Economist articles archived from at least today are being cut off, not showing full article & text is being overlaid by the usual list of links/images of related articles. Is this a bug, or?

    匿名

    Could you point to exact page?

    I have seen the 15 latest from Economist and found no issues

    • 2年前
  • Community tabs on YouTube channels doesn't archive correctly; redirects to a specific post on the Community tab instead like this /5lUZU

    匿名

    fixed. (clicking on “expand“ buttons hit one wrong button)

    • 2年前
    • 1 notes
  • Is the search function broken?

    匿名

    yes.

    the index is rebuilding, it will be back in few hours

    • 2年前
    • 2 notes
  • does archiving a webpage send the user’s IP address to the host site?

    匿名

    It was so in old version (before 2019) to get localized versions.

    Now it is useless, because nowadays most of localizations are “this page is not for your country”, so the IP is not passed to the host site.

    • 2年前
  • What is the long version of the url please. Wikipedia wants it to be used....how to I get to it? Greg

    匿名

    Click on “share” button to see different forms of linking to a page.

    For example https://archive.vn/Aoans/share

    • 2年前
    • 1 notes
  • The Archive seems to be having difficulty archiving YouTube urls. What's going on?

    匿名

    youtube started showing captcha to the archiver

    • 2年前
    • 1 notes
  • Has Roskomnadzor behavior or amount of removal requests changed since the start of the war?

    匿名

    No, there were no removal requests related to this conflict at all. Neither from Roskomnadzor nor from other agencies. Although the stream of requests about ISIS content goes as usual.

    • 2年前
    • 3 notes
  • About the SSL_ERROR_NO_CYPHER_OVERLAP in the previous question. This problem isn't on the DNS server, but on the Archive website (google it, many people complaining). I turnaround this error by adding your IP to my HOSTS

    匿名

    Your last sentence proves that the problem is exactly DNS server

    • 2年前
  • Could you expand comments on nNUlG? Thanks!

    匿名

    No, `substack.com` does not expand comments, they are on another page; if I click on “expand comments”, the content disappears.

    • 2年前
  • Підключення для цього сайту не захищено archive_dot_ph використовує протокол, який не підтримується.ERR_SSL_VERSION_OR_CIPHER_MISMATCHНепідтримуваний протокол Клієнт і сервер не підтримують загальну версію протоколу SSL або комплект шифрів. Від браузера не залежить. Через VPN все працює. Територіально - Україна.

    匿名

    Try to change DNS from 1.1.1.1 to something else.

    • 2年前
  • Can't you buy cards with crypto, or something like that? I'm sure some people would be willing to help with that.

    匿名

    That’s exactly the case of <<Some people, when confronted with a problem, think “I know, I’ll use regular expressions.” Now they have two problems.>>

    1. Where to get crypto (a certain amount every month, with no skips) ? Donations are not enough and not stable. Brave browser stopped paying again (not enough anyway). Traveling to the towns trying to buy crypto for paper cash from shady people does not look like a sustainable plan. Getting a job with a crypto salary will not cover the demand too.

    2. Cryptocard services are not reliable and prone to exit scam and regulatory risks. This will require supporting a redundant system on top of at least three of them, maintaining the necessary balances everywhere, tracking news and being prepared for losses.

    On the other hand, advertising (and possibly paid features) allows to escape the money conversion hell and to tune cash flows in different currencies and on different sides of the Iron Curtain, to cover the local expenses.

    • 3年前
    • 1 notes
  • Why are there ads on this site?

    匿名

    I already answered this: https://blog.archive.today/post/677297433505628160/a-bit-different-than-a-usual-question-but-do-you

    Basically, money have become fragmented and hard to convert.

    For example, I have no other way to top up PayPal except with donations (and there aren’t enough of them) or by showing a certain amount of advertising from an agency that pays there. Cards do not work, making new ones involves a trip to a different vaccine zone, etc.

    • 3年前
    • 2 notes
  • do you plan to add any cryptocurrency methods as a donate option? considering that there are many people on the world other side who do not have access to a bank card with international payment capabilities (visa/mastercard etc), they can pay with the local banking system, local currency (cash, checkouts or some online payment system), and they can use cryptocurrency or any medium, or need to add support for so many payment systems (webmoney, JCB, wechat pay/alipay/unionpay, prepaid / gift cards

    匿名

    The site does not have any premium features available only to paid users, so there is no need to consider yourself penalized if you can’t pay.

    • 3年前
  • One year later, how has the OVHcloud fire impacted the project? Are you able to participate in the Action Collective (Class Action) against the company?

    匿名

    No, all the equipment there was rented.

    • 3年前
  • I have a suggestion. When a user archives a page, sometimes an error page is archived (ie, like archive "9UE0W"). When that user is shown the archive for the first time, they (and only that user) could be presented with a question, "Has this page been archived correctly?" If they respond "no", then the archive would be deleted, so they can try again. After a short period of time, this option no longer appears to the user.

    匿名

    Retry won’t help: the page address is invalid, it has “%3F” instead of “?”

    • 3年前
  • Thanks for looking at the Telegram link preview issue yesterday. I'm afraid they still don't work. Note that you can test them, by re-scraping a page, using the @WebpageBot bot.

    匿名

    Yes, it works unstable, I do not know yet why :(

    It is a different issue: the first was about /xxxx works and /xxxx/image does not. And it was reproducible on other previews (Twitter, …).

    The second is Telegram-only and affected all pages.

    • 3年前
    • 1 notes
  • I try to log in to Archive, but for some reason it keeps taking me to a Welcome to Nginx page without ever going to the actual website, is there some sort of way to fix it

    匿名

    there are no accounts and no way to log in

    • 3年前
    • 1 notes
  • Is there any link between your website and the Internet Archive?

    匿名

    No

    • 3年前
  • Hello. Thank you for providing this amazing service. Are you aware that link previews for 'archive today' don't work in Telegram, even when using a link to the screenshot tab? Is there a reason why, and is it possible to fix it? Thanks a lot.

    匿名

    looks like robots.txt issue. it should work now

    • 3年前
    • 1 notes
  • There are at least two use cases for your service: 1. To archive pages for posterity. 2. To bypass paywalls or weekly article limits. For the second use case, the article might only need to be archived for a month, after which the content may no longer be of interest. I wonder if you could reduce storage requirements by running two services: one for permanent archives, and one for temporary archives. Also, I am curious: What other use cases are there?

    匿名

    Basic scenario: saving a page that can be edited or deleted.

    Your two are accidental: the first because of the word “archive” in the domain and the false association with archive.org, and the second is side effect of cookie isolation and incomplete javascript support.

    It is definitely not a free permanent infinite cloud storage for your hentai collections.

    • 3年前
    • 1 notes
  • Hello. I am developing an application that programmatically loads archived webpages from your service, but the captcha has destroyed this ability. Your service is one of the best. Is there an API available or is using your service in this way an impossibility? I am open for any discussion. Thank you.

    americansamizdat

    No, I can’t afford automated saving. The current hardware can barely handle manual. Of course, it is possible to create a paid API service to buy new servers. But… the current crisis of supply, payment, and trust has shown that limiting growth was the right thing to do. If the archive had gone that way, it would have to be shut down now.

    • 3年前
    • 2 notes
  • with the looming threat of russia cutting off their internet, how will the site manage their servers being accessible to the world, if they are located there?

    匿名

    They are not in Russia (although looking at energy prices, I would prefer that they were there).

    In any case, Internet fragmentation is already a thing: the Chinese segment has been around for years (the archiver is neither able to crawl most of it nor serve most people there), from now on there will be the Russian segment, what’s next? Islamic? … so yes, you are right, one day every site will land in one of the segments, being inaccessible from the rest. “the world” gets obsolete

    • 3年前
    • 4 notes
Next page
  • Page 1 / 82