Skip to content
Change the repository type filter

All

    Repositories list

    • Zeno

      Public
      State-of-the-art web crawler 🔱
      HTML
      GNU Affero General Public License v3.0
      1183265Updated Dec 2, 2024Dec 2, 2024
    • One webpage for every book ever published!
      Python
      GNU Affero General Public License v3.0
      1.4k5.3k817154Updated Dec 2, 2024Dec 2, 2024
    • The Internet Archive BookReader
      JavaScript
      GNU Affero General Public License v3.0
      41999813488Updated Dec 1, 2024Dec 1, 2024
    • heritrix3

      Public
      Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
      Java
      Other
      7622.8k335Updated Nov 30, 2024Nov 30, 2024
    • gocrawlhq

      Public
      Go client for Crawl HQ v3
      Go
      0001Updated Nov 29, 2024Nov 29, 2024
    • TypeScript
      GNU Affero General Public License v3.0
      15213Updated Nov 29, 2024Nov 29, 2024
    • rclone

      Public
      [vault fork] of "rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Yandex Files
      Go
      MIT License
      4.2k200Updated Nov 29, 2024Nov 29, 2024
    • IAUX Typescript WebComponent Template
      TypeScript
      GNU Affero General Public License v3.0
      47311Updated Nov 28, 2024Nov 28, 2024
    • A Modal Manager WebComponent
      TypeScript
      GNU Affero General Public License v3.0
      11112Updated Nov 28, 2024Nov 28, 2024
    • TypeScript
      GNU Affero General Public License v3.0
      00111Updated Nov 28, 2024Nov 28, 2024
    • iaux

      Public
      Monorepo for Archive.org UX development and prototyping.
      JavaScript
      GNU Affero General Public License v3.0
      866788144Updated Nov 27, 2024Nov 27, 2024
    • Sparkling

      Public
      Internet Archive's Sparkling Data Processing Library
      Scala
      MIT License
      21110Updated Nov 27, 2024Nov 27, 2024
    • TypeScript
      2200Updated Nov 27, 2024Nov 27, 2024
    • iiif

      Public
      The official Internet Archive IIIF service
      JavaScript
      GNU General Public License v3.0
      422172Updated Nov 26, 2024Nov 26, 2024
    • brozzler

      Public
      brozzler - distributed browser-based web crawler
      Python
      Apache License 2.0
      976733313Updated Nov 26, 2024Nov 26, 2024
    • A repository of cleanup bots implementing the openlibrary-client
      Python
      Other
      4962278Updated Nov 25, 2024Nov 25, 2024
    • components for IA Wayback Machine to render legacy medias and data in human friendly fashion
      Python
      0000Updated Nov 25, 2024Nov 25, 2024
    • www

      Public
      archive.org website prototype - using only javascript static files
      JavaScript
      GNU Affero General Public License v3.0
      0200Updated Nov 24, 2024Nov 24, 2024
    • newsum

      Public
      Daily TV News Summary using GPT
      Python
      GNU Affero General Public License v3.0
      42111Updated Nov 23, 2024Nov 23, 2024
    • JavaScript
      GNU Affero General Public License v3.0
      8720Updated Nov 22, 2024Nov 22, 2024
    • React components to render differences between captures at the Wayback Machine
      JavaScript
      GNU General Public License v3.0
      83211Updated Nov 22, 2024Nov 22, 2024
    • Add/remove item to userlists on Details page
      TypeScript
      GNU Affero General Public License v3.0
      1101Updated Nov 22, 2024Nov 22, 2024
    • A Streamlit application to visualize Wikipedia IABot statistics
      Python
      GNU Affero General Public License v3.0
      1200Updated Nov 21, 2024Nov 21, 2024
    • PHP
      GNU Affero General Public License v3.0
      3312702Updated Nov 21, 2024Nov 21, 2024
    • iare

      Public
      An interactive IARI JSON viewer
      JavaScript
      GNU Affero General Public License v3.0
      45331Updated Nov 21, 2024Nov 21, 2024
    • iari

      Public
      Import workflows for the Wikipedia Citations Database
      Python
      GNU General Public License v3.0
      911560Updated Nov 21, 2024Nov 21, 2024
    • IA lending bar controls for bookreader
      JavaScript
      GNU Affero General Public License v3.0
      0213Updated Nov 20, 2024Nov 20, 2024
    • An API wrapper to the Elasticsearch index of web archival collections and a web UI to explore those indexes.
      Python
      GNU Affero General Public License v3.0
      5820Updated Nov 19, 2024Nov 19, 2024
    • gospn

      Public
      Save Page Now client in Go
      Go
      GNU Affero General Public License v3.0
      1500Updated Nov 18, 2024Nov 18, 2024
    • gifcities

      Public
      gifcities.org web app
      Go
      GNU Affero General Public License v3.0
      0210Updated Nov 15, 2024Nov 15, 2024