A Python script to scrape GitHub profiles and repositories. The script can obtain information such as the GitHub profile name, profile URL, users followed, followers, recent repositories and their URLs given the GitHub username. It can also obtain repository information including name, description, number of stars, watchers and forks.
It is important to read the GitHub terms and policies as it may be against their terms to use this service for collecting and selling personal information of users.
The author of this script does not encourage nor will be responsible for such actions. Please read the GitHub Terms wrt Information Usage Restrictions before using the script
- Python 3.x
- requests library
- BeautifulSoup library
- Clone the repository:
git clone https://github.com/pragadeshbs/GitHubScraper.git
- Install the required libraries:
pip install -r requirements.txt
- Run the script:
python gitScrap.py
The script can be configured to scrape additional data or in a different format. Please refer to the code comments for more information.
Contributions to the project are always welcome! Feel free to submit a pull request or open an issue if you find any bugs or have suggestions for improvement.
This project is licensed under the MIT License - see the LICENSE file for details.