Repository Scraper Tool 📜

This tool is designed to scrape and combine the text from all files in a GitHub repository into a single text file. It supports both cloning a repository directly from GitHub (Online Version) and processing a repository that has already been downloaded to your local machine (Offline Version).

Prerequisites 📋

Git must be installed on your system.
Python 🐍 must be installed on your system.
Ensure you have internet access and the necessary permissions to clone the target repository.

Usage [online]🌐

Open online-scraper.py in your python development software (such as PyCharm)
Replace https://github.com/GithubName/RepoName.git with the URL of the GitHub repository you want to scrape.
Run the script: python online-scraper.py
The script will clone the repository and combine the contents of all files into scraped.txt.

Usage [offline]🔍

download the repo that you want to scrape
Open offline-scraper.py in your python development software (such as PyCharm)
Replace C:\Users\SomeRandomAssFolder\Downloads\YourDownloadedRepoFolder with the path to the repo you want to scrape.
Run the script: python offline-scraper.py
all your shit should be scraped into a file called scraped.txt that is located in the same directory as the python script

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Repository Scraper Tool 📜

Prerequisites 📋

Usage [online]🌐

Usage [offline]🔍

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
offline-scraper.py		offline-scraper.py
online-scraper.py		online-scraper.py

stormcoph/RepoScraper

Folders and files

Latest commit

History

Repository files navigation

Repository Scraper Tool 📜

Prerequisites 📋

Usage [online]🌐

Usage [offline]🔍

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages