Is there anyway we can donate our residential IP address to your project?
What would be very useful is a residential IP address in mainland China.
I cannot open many .gov.cn websites, even from Alibaba Cloud
As you are aware, some sites do not load if they are visited from a European IP address due to the site not complying with GDPR. I found a US border news site that needs to be loaded from another region. /TWnDg How do you normally handle these edge cases? Do you have to hand-code a list of overrides?
There is multi-exit VPN which does use USA IPs to access such newspapers. The bug here is: although the exit IP is in the USA, it is Netherlans in MaxMind GeoIP database :(
The unfolding of quora answers is not working again since 2020-9-16 (example: 9vnYb, 52nJS)
fixed
Correct me if I'm wrong, but I once read somewhere that the limit for the number of archives a site can have is 1000. Is this true?
It is probable the limit of search results. There are numbers like 1000, 10000, …
Hi, Thanks again for your amazing service! I noticed that an increasing amount of news are communicated via data dashboards. For example the various COVID-19 dashboards. It seems that archiving these is difficult as the dashboard visualization seems to have either a lazy load feature or some DRM complexities. Could you look at /ohXL0 ? This is a wild-fire dashboard for the United States and the archive shows the dashboard but its blank.
Fixed
Will you offer a paid service? I want to pay you to scrape/archive my bookmarks.
No. You need something like Evernote for that.
Is there a way to save something that is visible only to me (like a Facebook post in a private group)?
I think it is possible to add this feature to the browser extension.
Your Facebook account is banned again...
Yes
Why the long answers on Quora now do not unfold themselves (example: LGn9p)? Few days ago it still works (example: 828E2). Some the archives of the Japanese Quora has had the same problem for a few months already, that the first few long answers won't unfold (including but not limited to: KABrn, LVsqV .NVRQb, Q26Mu).
Quora has changed design a little bit. Should be fixed.
Regional versions are my bug, the archiver clicks on a button with “continue reading“ text, which is localized in regional versions. I will fix today.
If I mailed you a bunch of hard drives and donated to cover your time, could I get a backup of your entire archive?
No. Both archive mirrors are not local to me and there are COVID travel restrictions.
Why are product pages of Newegg being redirected to shopping cart? Example - archive is/JaQ5W How a product page would normally look like - archive is/UKQx6 Is the redirect caused by some kind of bug and can it be fixed?
Fixed (there was a bug: the click on [x] to close the modal popup was performed twice)
How has the COVID-19 Pandemic affected the popularity of your service? Have you noticed an increase in sites archived or retrieved?
I spotted two projects which use archive to track COVID press coverage: https://ncovmemory.github.io/ (early days in China) and https://www.covid19-archive.com/ (woldwide news).
As for the numbers: we deployed a new system in December 2019, which is based on Chromium instead of old PhantomJS. It improved the quality of snapshots, but degraded the performance. So we have to limit the rates of archival and retrieval, and it impacted the stats more than COVID.
I tried to archive the following page, but the subscribe popup still appeared, making the article darkend out. Can this be fixed? mississippitoday org-201-03-17-death-in-meridian-a-mystery-three-years-later
yes, fixed
Jesus Christ I just checked the GitHub page (via a Reddit link) on this issue, and they freaking paid your friend! What the hell?! Opera is run by the Chinese Communist Party, but that's fine with you, whereas Brave is run by a bunch of libertarians and they're evil? Holy shit. Look at the last post on this page, is he lying? (replace DOT with .) githubDOTcom/brave/brave-browser/issues/10219
They did not paid many other Chinese, Russian, Indonesian, … and still have (probably the only) cryptocurrency with racial segregation.
The point of Brave userbase is also interesting: as it appeared after the block, their drive-by download program is popular among toxic websites, so the Brave block reduced popularity of the archive in those communities, which is sort of a positive move too.
So a browser that allows its users to retain all the privacy they desire... that's your version of a "scam"? Whereas a browser that openly spies on its users for a massive megacorporation (Chrome) is *not* a scam. Right. Talk about screwy priorities. If I don't want to see any ads on Brave, I don't have to. And I know they aren't spying on me. But because they pissed you off, I have to use a browser with spyware to use your site. Brilliant. Just brilliant. Is Google paying you? Is China? Damn.
They both do not provide enough privacy. Proof? When Facebook bans an account, it is not possible to use the same browser (whether it is Brave or Chrome, even in “incognito mode” and new IP address) to register new accounts for many days. Browsers leak enough information for Facebook to track you.
The problem with Brave was racial rhetoric they use to steal money: some races can receive their cryprocurrency while some other are not eligible.
How do NEW archives of twitter pages & tweets still show the real "GoodTwitter" layout? I thought the plugin stopped working and the GoodTwitter2 userscript isn't as close to the original "GoodTwitter" layout as the archives are (for example: the tweet overlaying the profile page instead of just a white background). Please, the new web 3.0 design of Twitter makes it so much harder to use.
Setting UserAgent to GoogleBot does the trick for most of the Twitter pages.
I imagine you get some disapproval of what you do and your archive. I would like to give you an example of something you could reference just in case you need to counter their criticism or censure. /hLMoL is an archive of a subreddit called "anime_titties". This isn't porn but is world news. There is no porn at this URL. Censorship by word list, would exclude this news discussion and community.
There is no keyword-based censorship
Hi, I have noticed that an increasingly amount of websites with interstitials will not load the page until the box is clicked. I can imagine this is a game of wack-a-mole and a huge headache for your team. An example is /am6I7 . Could you take a peak at this site's implementation of lazy loading and see if solving this one is worth your time? I'm worried their method or used library might be used by other sites.
Fixed.
Yes, there is a big database similar to adblockers’ which require maintenance as websites got changed
Hi, there is a cookies opt-in popup overlay on the site sfchronicle that hides the content underneath. Could you remove overlay for this website please?
Fixed
Can you do something about yandex cache pages leading to 404? Example: DpSGL
They could be open only from the same IP.
If you send a link to yandex cache to somebody, they will get 404 too
:(
UPD: I rearchived it at https://archive.vn/J8tx9 Archiving Yandex Cache can to be solved, by simulating full user interaction with Yandex: entering the URL into the search form and then doing 2 clicks. I’ll try to implement it in few days