BlindCrawler - Beta v1.0

A tool for web crawling & content discovery.

Installation

git clone https://github.com/AhmedConstant/BlindCrawler.git

cd /BlindCrawler

sudo pip3 install requirements.txt

Usage

domain

python3 BlindCrawler.py -s https://domain.com

subdomain

python3 BlindCrawler.py -s https://sub.domain.com/path

random agents

python3 BlindCrawler.py -s https://sub.domain.com/path --random-agents

with cookies

python3 BlindCrawler.py -s https://sub.domain.com/path -c "key: value; key:value"

Features

Process
- Crawle the subdomains to expand the discovery surface.
- Crawle /robot.txt for more URLs to crawle.
- Crawle /sitemap.xml for more URLs to crawle.
- Use web archive CDX API to get more URLs to crawle.
Output
- A file with all crawled URLs
- A file with all paths crawled
- A file with subdomains discovered.
- A file with schemes discovered.
- A file with emails discovered.
- a file with comments discovered
Performance
- There will be a continuous process to make performance as fast as possible
Design
- OOP Design
- Good Documentation.
- Easy to edit the script code

To-Do List

The Author

Ahmed Constant Twitter

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
BlindCrawler.py		BlindCrawler.py
Bootstrap.py		Bootstrap.py
LICENSE		LICENSE
Output.py		Output.py
README.md		README.md
TheCrawler.py		TheCrawler.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BlindCrawler - Beta v1.0

Installation

Usage

domain

subdomain

random agents

with cookies

Features

To-Do List

The Author

About

Releases 1

Packages

Languages

License

AhmedConstant/BlindCrawler

Folders and files

Latest commit

History

Repository files navigation

BlindCrawler - Beta v1.0

Installation

Usage

domain

subdomain

random agents

with cookies

Features

To-Do List

The Author

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages