Projects
Projects status |
---|
Online (329) · Special cases (49) · Endangered (71) · Closing (17) · Offline (421)
|
Rescued Sites (497) · Self-Saved (17) · Partially Rescued Sites (210) · In Progress (43) · Upcoming (10) · Not Saved Yet (409) · On hiatus (12) · Lost Sites (90)
|
Unknown Status (64) |
This page should contain, or directly link to, almost all ArchiveTeam archiving endavours, categorized.
- Current projects: currently active, upcoming and recently finished grandiose ArchiveTeam projects. (Extract of the next two categories.)
- Warrior projects: projects that utilize(d) ArchiveTeam's distributed archiving system.
- Manual projects that need(ed) much more effort than just pushing a button.
- Small projects: small-scale website archiving projects usually done by a single individual.
- Early projects: first archiving endavours on the dawn of ArchiveTeam, in a format nobody is apparently able/dare to touch.
(The box on the top counts projects having dedicated wiki pages, those numbers aren't complete and far don't contain all projects mentioned in the sections below.)
If you know of a website in danger, let us know that on IRC. If it's a larger site, please also mention it on the Deathwatch page. And, after a decision is made on IRC, or if it doesn't need a decision, then, to help things kept documented and up to date, you are encouraged to add projects, or modify their status
- in the appropriate section(s),
- on the project's dedicated wiki page (if any),
- on Deathwatch and/or on Alive... OR ARE THEY.
The box on the top is generated automatically from projects' dedicated wiki pages, so shouldn't be touched.
Important: Contents of sections below are embedded from other pages, that is, don't edit the section, nor this page, but use the "Edit this list" link! (That opens the corresponding page for editing, and after editing, you'll be forwarded to the page containing only that list: don't worry, you didn't delete the others.)
Current projects
Currently active team projects you can get involved in.
Archive Team recruiting
- Help us: ☞ Download and run your warrior ☜.
- What's on: online tracker.
- Donate to keep our projects going.
- Want to code for Archive Team? Here's a starting point.
Warrior-based projects
Short-term, urgent projects
- Vbox7: A bulgarian video hosting site is getting rid of all user-uploaded videos on 2024-02-22. IRC Channel #vboxxy (on hackint)
Medium-term projects
(none currently)
Long-term projects
- Blogger: Grabbing inactive Blogger blogs since Google began a mass purge of inactive Google accounts on or after 2023-12-01. IRC Channel #frogger (on hackint).
- GitHub: Embraced-uh, I mean, bought by Microsoft. IRC Channel #gitgud (on hackint).
- Imgur: Unregistered users' "old" and "inactive" images will be purged, and all NSFW content is being shown the door on 2023-05-15. IRC Channel #imgone (on hackint).
- MediaFire: Not 'at-risk' but grabbing speculatively to save historic files IRC Channel #mediaonfire (on hackint).
- Pastebin: Archiving the pastas. IRC Channel #pastalavista (on hackint).
- Reddit: Banning communities that generate bad PR for Reddit Inc. Restricted access to APIs and data on 2023-06-19. IRC Channel #shreddit (on hackint).
- Telegram: Archiving public messages in various newsworthy and/or otherwise notable Telegram channels. IRC Channel #telegrab (on hackint).
- URLTeam: URL shorteners were a fucking awful idea. IRC Channel #urlteam (on hackint).
- URLs: A random collection of stuff. IRC Channel #// (on hackint).
- YouTube: Archiving selected videos. IRC Channel #down-the-tube (on hackint).
An updated Warrior virtual appliance (v3.2) is now available with better support for newer projects that utilize wget-at.
Manual projects
- ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
- Codearchiver: Dumping and archival of source code repositories and associated version control systems. IRC Channel #codearchiver (on hackint).
- Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. IRC Channel #archiveteam (on hackint)
- WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).
Upcoming & proposed projects
- Chrome Web Store: Google has announced a timeline of policy changes that will lead to content being removed between 2021-12-01 and 2025. IRC Channel #chromeweblore (on hackint).
- Photobucket: Finally following through on over a year of email threats that free accounts are going to be mass deactivated if they don't pay up. IRC Channel #photosucket (on hackint).
- Abandoned iOS App Store & Google Play apps: Both Apple and Google are slimming down on abandoned apps, with an estimated ~1.5M of them at risk. IRC Channel #appocalypse (on hackint).
- Twitter: General instability; deleting inactive accounts
2019-12-11sometime. IRC Channel #archiveteam-bs (on hackint). - VKontakte: A Russian equivalent of Facebook carries the risk of tumbling down under the weight of sanctions as a result of the government's invasion of Ukraine. IRC Channel #lostkontakt (on hackint).
- JamiiForums: the Tanzanian government would like this gone. IRC Channel #archiveteam-bs (on hackint).
- LiveJournal: Very old, widely regarded as in decline, and has a lot of important stuff buried in it. IRC Channel #archiveteam-bs (on hackint).
- The Pirate Bay: Recently came back up, grabbing an archive for sanity's sake. IRC Channel #archiveteam-bs (on hackint).
- Valhalla: Where to store what even the Internet Archive doesn't have space for? IRC Channel #huntinggrounds (on hackint).
- Giphy: Bought by
FacebookShutterstock, to be "integrated" (assimilated) intoInstagramhttps://news.knowyourmeme.com/news/facebook-to-buy-giphy
Recently finished projects
On Hiatus
- Angelfire: Angelfire is a web hosting service that contains big chunks of early WWW history and has no proper backup. IRC Channel #angelonfire (on hackint).
- Audit 2014: It's time to verify our shit. IRC Channel #auditteam (on hackint).
- Flickr:
Yahoo!SmugMug decided to kill it after finding Yahoo!'s plans to do so before they were bought by Verizon. IRC Channel #flickrfckr (on hackint). - FTP: Help us find and download all FTP sites! IRC Channel #effteepee (on hackint).
- Google Drive: Same as MediaFire. IRC Channel #googlecrash (on hackint). Currently on haitus.
- Google Groups: "Gone within a year" (SketchCow, 2016-06-07).
- Google News Archive: Let's store all newspapers at Google, WCGW? IRC Channel #papersplease (on hackint).
- INTERNETARCHIVE.BAK: Grab a slice of the big cake of The Archive! IRC Channel #internetarchive.bak (on hackint).
- ISP Hosting: Finding ISP web hosting services before the Grim Reaper finds them. IRC Channel #webroasting (on hackint).
- Miraheze:
Shutting down sometime between 2023-09-01 and 2023-10-31.Rescued by new volunteers! - Project Newsletter: Archiving e-newsletters, currently in development. IRC Channel #projectnewsletter (on hackint).
- Quizlet: Flashcards and other learning tools IRC Channel #quizletusin (on hackint).
- Tinkercad: Autodesk announced its intent to put designs from inactive OAuth accounts back into minds around 2021-05-24. IRC Channel #tinkerhad (on hackint).
- Tumblr: Yahoo! considered killing it, now Yahoo has been acquired and Verizon declared war on NSFW blogs. Tumblr has since been sold to Automattic. IRC Channel #tumbledown (on hackint).
ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/ – More info
Warrior projects
ArchiveTeam's past, current and future Warrior projects with details, in a table form.
Manual projects
Difficult, discussion-intensive, human-resource-intensive and audit projects.
Small projects
List of smaller website rescuing projects, usually done by single individuals.
Early projects
List of ArchiveTeam's early endavours, for historical interest, not edited.
← Fire Drill • Projects • Philosophy →