- FastAPI backend with DuckDB pushdown queries on 72M parquet - Async enrichment worker: HTTP, SSL, DNS MX, CMS fingerprint, ip-api.com - Resumable parquet download with HTTP Range support - Lead scoring engine (max 100 pts, target countries ES,GB,DE,FR,RO,PT,AD,IT) - Single-file Alpine.js + Chart.js dashboard on port 6677 - SQLite enrichment DB with job queue and scores tables - Dockerized with persistent /data volume Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
7 lines
222 B
Bash
7 lines
222 B
Bash
DATA_DIR=/data
|
|
PARQUET_URL=https://github.com/digitalcortex/72m-domains-dataset/raw/refs/heads/master/domains.parquet
|
|
CONCURRENCY_LIMIT=50
|
|
SCORE_THRESHOLD=60
|
|
TARGET_TLDS=es,com,net
|
|
TARGET_COUNTRIES=ES,GB,DE,FR,RO,PT,AD,IT
|