Hyper-fast HTTP Scraping Tool

Go to file

acidvegas dc7a66fadb added preview		2025-02-10 01:14:37 -05:00
.screens	added preview	2025-02-10 01:14:37 -05:00
httpz.py	clean output	2025-02-10 01:11:30 -05:00
LICENSE	Title and body cleanup	2025-02-10 00:24:28 -05:00
README.md	added preview	2025-02-10 01:14:37 -05:00
requirements.txt	REAL BAY SHIT MOTHER FUCKER	2025-02-09 23:56:46 -05:00
setup.sh	REAL BAY SHIT MOTHER FUCKER	2025-02-09 23:56:46 -05:00

README.md

HTTPZ Web Scanner

A high-performance concurrent web scanner written in Python. HTTPZ efficiently scans domains for HTTP/HTTPS services, extracting valuable information like status codes, titles, SSL certificates, and more.

Requirements

Python
- aiohttp
- apv
- beautifulsoup4
- cryptography
- dnspython
- mmh3
- python-dotenv
- tqdm

Installation

git clone https://github.com/acidvegas/httpz
cd httpz
chmod +x setup.sh
./setup.sh

Usage

python httpz.py domains.txt [options]

Arguments

Argument	Long Form	Description
`file`	-	File containing domains (one per line), use `-` for stdin
`-d`	`--debug`	Show error states and debug information
`-c N`	`--concurrent N`	Number of concurrent checks (default: 100)
`-o FILE`	`--output FILE`	Output file path (JSONL format)
`-j`	`--jsonl`	Output JSON Lines format to console
`-all`	`--all-flags`	Enable all output flags

Output Field Flags

Flag	Long Form	Description
`-sc`	`--status-code`	Show status code
`-ct`	`--content-type`	Show content type
`-ti`	`--title`	Show page title
`-b`	`--body`	Show body preview
`-i`	`--ip`	Show IP addresses
`-f`	`--favicon`	Show favicon hash
`-hr`	`--headers`	Show response headers
`-cl`	`--content-length`	Show content length
`-fr`	`--follow-redirects`	Follow redirects (max 10)
`-cn`	`--cname`	Show CNAME records
`-tls`	`--tls-info`	Show TLS certificate information

Other Options

Option	Long Form	Description
`-to N`	`--timeout N`	Request timeout in seconds (default: 5)
`-mc CODES`	`--match-codes CODES`	Only show specific status codes (comma-separated)
`-ec CODES`	`--exclude-codes CODES`	Exclude specific status codes (comma-separated)
`-p`	`--progress`	Show progress counter

Examples

Scan domains with all flags enabled and output to JSONL:

python httpz.py domains.txt -c 100 -o output.jsonl -j -all -to 10 -mc 200,301 -ec 404,500 -p

Scan domains from stdin:

cat domains.txt | python httpz.py - -c 100 -o output.jsonl -j -all -to 10 -mc 200,301 -ec 404,500 -p