ilia c840f289e1
Some checks failed
CI / Linting (Biome) (push) Failing after 40s
CI / Tests (push) Successful in 5m54s
CI / Type Check (adzuna-extractor) (push) Successful in 1m8s
CI / Type Check (gradcracker-extractor) (push) Successful in 1m11s
CI / Type Check (hiringcafe-extractor) (push) Successful in 1m8s
CI / Type Check (orchestrator) (push) Successful in 1m23s
CI / Type Check (startupjobs-extractor) (push) Successful in 1m6s
CI / Type Check (ukvisajobs-extractor) (push) Successful in 1m7s
CI / Documentation (push) Successful in 1m54s
feat(extractors): expand catalog, smoke coverage, and sourcing docs
Adds Arc.dev, BC T-Net, Eluta, iCIMS tenants, QAJobsBoard, and SmartRecruiters
manifests with registry/settings/UI wiring; registers full extractor list in
smoke-extractors and documents supplementary board access paths. Aligns Careerjet
v4 with the url query parameter and fixes strict typing in QAJobsBoard.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-05-15 22:36:23 -04:00
..

arcdev-extractor

Reads Arc remote-job listings from SSR HTML: each page embeds __NEXT_DATA__ with arcJobs (Arc talent network) and externalJobs (partner postings).

Configure arcRemoteJobsPaths as URL paths on https://arc.dev, for example:

  • /remote-jobs/playwright
  • /remote-jobs/cypress
  • /remote-jobs/selenium

Or set ARC_REMOTE_JOBS_PATHS (comma/newline-separated). Defaults include Playwright and Cypress stacks.

Employer names: External jobs include company.name. Arc-managed listings omit company names in the payload — those rows use employer "Arc talent network" while preserving titles and skill categories.

Cap merged matches per configuration fetch via arcMaxJobsPerPath (applied separately per path, default 120).