Cloudflare Worker API proxy for index.commoncrawl.org.
All proxied routes are exposed under:
/api/v1/index/*
pnpm --filter @commoncrawl.cc/api devThis runs wrangler dev src/index.ts.
pnpm --filter @commoncrawl.cc/api buildThis runs a dry-run deploy build via Wrangler.
Generated spec endpoint:
/openapi.json
Defined in wrangler.toml:
CORS_ALLOW_ORIGIN(default:https://commoncrawl.cc)INDEX_UPSTREAM_BASE_URL(default:https://index.commoncrawl.org)INDEX_UPSTREAM_TIMEOUT_MS(default:20000)
- Supports
GET+OPTIONS. - Adds CORS headers for browser fetch/XHR.
- Forwards safe request headers (
Accept,Accept-Datetime, cache/range headers). - Uses Valibot +
@hono/standard-validatorfor route-level input validation. - Includes CORS + CSRF protections at the proxy layer.