8 Commits

Author SHA1 Message Date
fabriziosalmi
e8da4c3d28 nginx snippets generation fix + others minor improvements. 2025-01-16 14:02:19 +01:00
fab
5cf6e75096
Update badbots.py
feat: Integrate tqdm for progress tracking during bot list fetching
feat: Add regex filters to exclude IP addresses and domains from bot lists
refactor: Remove IP and domain-specific sources from BOT_LIST_SOURCES
refactor: Update parse_bot_list to skip lines matching IP or domain regex
refactor: Improve logging for better debugging and clarity
refactor: Use ThreadPoolExecutor with tqdm for concurrent fetching with progress
docs: Add comments and docstrings for better code understanding
chore: Ensure output directories exist before generating WAF configurations
2025-01-03 21:09:00 +01:00
fab
61e1a856c9
Update badbots.py
Improved error handling, logging and code structure.
2025-01-03 13:20:01 +01:00
fab
9f1266f81f
Update badbots.py
parsing fixes
2025-01-01 12:56:12 +01:00
fab
f347ac2817
Update badbots.py
- Introduced concurrency with ThreadPoolExecutor for faster bot list fetching.
- Added rate limiting and exponential backoff for resilient fetch operations.
- Expanded bot list sources to include new datasets for improved coverage.
- Implemented GitHub token authentication for rate-limited endpoints.
- Refactored file generation to use pathlib for better path management.
- Replaced os.path with Path for consistency and readability.
- Modularized retry logic and enhanced error handling for robustness.
- Ensured bot lists are aggregated and deduplicated across sources.
- Improved logging for better visibility into fetch and write operations.
2024-12-30 00:08:12 +01:00
fab
2763bbc13b
Update badbots.py
Fix for fallback #1 URL which lands nowhere (404).
2024-12-21 11:29:45 +01:00
fab
df34f762c3
Update badbots.py 2024-12-21 01:57:55 +01:00
fab
c9ef0751e1
Create badbots.py 2024-12-21 01:46:33 +01:00