IP-based Website Scanner for Sensitive Content Detection and Automatic Classification
Project Description:
IP Content Scanner is a Python-based tool designed to scan a list of IP addresses and identify the type of content hosted—such as gambling 🎲, pornographic 🔞, or general ✅. This tool assists network administrators, security researchers, and service providers in auditing suspicious or unclassified IPs, with clean, color-coded output and optional Telegram notifications.
✨ Key Features
🎲 Gambling Site Detection: Identifies gambling-related websites using keyword matching.
🔞 Pornographic Site Detection: Flags adult sites based on accessible content.
✅ General Category: Marks safe content if no sensitive keywords are detected.
🌍 Geolocation Info: Displays city, region, and country of each IP.
💥 HTTP/HTTPS Access Check: Tests socket connection and web accessibility.
📊 Color-coded Output: Organized, readable output in the terminal.
📩 Telegram Notification: Sends result summaries via Telegram (optional).
⚙️ Quick Installation: Includes a setup script for dependencies.
⚙️ How to Use
1️⃣ Install Dependencies
chmod +x install_dependencies.sh
./install_dependencies.sh
2️⃣ Run the Scanner
python3 scan_situs.py ip_list.txt
💡 Make sure
ip_list.txt
contains the target IPs, one per line.
📲 Telegram Notification (Optional)
Enable by setting the following variables in the script:
TELEGRAM_TOKEN = "YOUR_BOT_TOKEN"
TELEGRAM_CHAT_ID = "YOUR_CHAT_ID"
🧠 Technologies Used
Python: requests, BeautifulSoup (bs4), socket, tqdm, colorama
API: ipinfo.io for IP location lookup
Technique: Regex for keyword detection, basic HTML parsing
🛡️ Notes
No data is saved to any file or database.
Scans are based solely on publicly accessible website content.
Encrypted or dynamic content (SPA/JS-heavy) is not supported.
🚀 Project Status
🟢 Stable – Ready for use in network supervision, content audit, or public infrastructure monitoring.
👨💻 This project is open-source and can be further developed as needed.