* feat(hiringcafe): register new source across shared/server/client enums * feat(hiringcafe-extractor): add browser-backed Hiring Cafe dataset extractor * feat(orchestrator): integrate Hiring Cafe discovery service into pipeline * feat(orchestrator-ui): add Hiring Cafe to source availability and run estimates * chore(hiringcafe): wire CI/docker and add extractor documentation * chore(format): apply biome formatting for Hiring Cafe integration * add original websites * coomints * number or null
Hiring Cafe Extractor
Browser-backed extractor for Hiring Cafe search APIs.
Special thanks: initial implementation inspiration came from umur957/hiring-cafe-job-scraper.
Environment
HIRING_CAFE_SEARCH_TERMS(JSON array or|/ comma / newline-delimited)HIRING_CAFE_COUNTRY(default:united kingdom)HIRING_CAFE_MAX_JOBS_PER_TERM(default:200)HIRING_CAFE_DATE_FETCHED_PAST_N_DAYS(default:7)HIRING_CAFE_OUTPUT_JSON(default:storage/datasets/default/jobs.json)JOBOPS_EMIT_PROGRESS=1to emitJOBOPS_PROGRESSeventsHIRING_CAFE_HEADLESS=falseto run headed
Notes
- The extractor uses
s = base64(url-encoded JSON search state). worldwideandusa/caare treated as broad search modes without hard country location filters.