Crawler for exporting Workback issue pages into per-issue markdown dossiers in issues/<id>/issue-<id>.md.
- Node.js 20+
- Valid Workback cookies from
app.workback.ai(pass via--cookie)
Discover new issue IDs from the Workback issues index, then crawl only IDs that are not already present in issues/:
npm run crawl-new -- --cookie "csrftoken=...; sessionid=..."Optional tuning:
node crawl-issues.js \
--discover-new \
--discover-pages 8 \
--discover-stop-after-stale-pages 2 \
--cookie "csrftoken=...; sessionid=..."Dry-run (discovery without writing files):
node crawl-issues.js --discover-new --dry-run --cookie "csrftoken=...; sessionid=..."- Crawl from CSV export IDs:
node crawl-issues.js --from-csv --missing-only --cookie "..." - Crawl from explicit IDs file:
node crawl-issues.js --ids-file /tmp/ids.txt --missing-only --cookie "..." - Fix broken downloaded images:
node crawl-issues.js --fix-images --cookie "..."