One webpage for every book ever published!
Python 4.1k 945
The Internet Archive BookReader
JavaScript 790 364
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Java 2.4k 743
IA's public Wayback Machine (moved from SourceForge)
Python Client Library for the Archive.org OpenLibrary API
A repository of cleanup bots implementing the openlibrary-client
Summarize and ask questions about items in the Internet Archive
Monorepo for Archive.org UX development and prototyping.
An interactive WARI JSON viewer
Import workflows for the Wikipedia Citations Database
Efficient hOCR tooling
A web component that highlights Democracy's Library