ilia c840f289e1
Some checks failed
CI / Linting (Biome) (push) Failing after 40s
CI / Tests (push) Successful in 5m54s
CI / Type Check (adzuna-extractor) (push) Successful in 1m8s
CI / Type Check (gradcracker-extractor) (push) Successful in 1m11s
CI / Type Check (hiringcafe-extractor) (push) Successful in 1m8s
CI / Type Check (orchestrator) (push) Successful in 1m23s
CI / Type Check (startupjobs-extractor) (push) Successful in 1m6s
CI / Type Check (ukvisajobs-extractor) (push) Successful in 1m7s
CI / Documentation (push) Successful in 1m54s
feat(extractors): expand catalog, smoke coverage, and sourcing docs
Adds Arc.dev, BC T-Net, Eluta, iCIMS tenants, QAJobsBoard, and SmartRecruiters
manifests with registry/settings/UI wiring; registers full extractor list in
smoke-extractors and documents supplementary board access paths. Aligns Careerjet
v4 with the url query parameter and fixes strict typing in QAJobsBoard.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-05-15 22:36:23 -04:00

1.7 KiB

id, title, description, sidebar_position
id title description sidebar_position
arcdev Arc.dev Extractor Remote tech roles from Arc.dev listing pages via embedded Next.js data. 17

What it is

Arc.dev exposes remote job listings on paths such as /remote-jobs/playwright and /remote-jobs/cypress. The extractor downloads SSR HTML and parses the embedded __NEXT_DATA__ payload (Arc-managed and external rows).

Implementation: extractors/arcdev/manifest.ts.

Why it exists

Curated remote hiring with explicit tooling-oriented feeds; many roles are open to North America when labeled that way on the site.

How to use it

  1. Enable Arc.dev in pipeline sources (no credentials).
  2. Configure arcRemoteJobsPaths as a JSON array of path strings (defaults include Playwright and Cypress remote feeds). Optionally seed defaults from ARC_REMOTE_JOBS_PATHS (comma-separated paths).
  3. Set arcMaxJobsPerPath (default 120, max 300) to cap rows per listing URL after deduplication.
  4. Align searchTerms with titles or stacks you care about; empty-term behavior is handled inside the manifest per path.

Common problems

  • HTML changes: If Arc ships a new payload shape, parsing may need an update; smoke-test with npx tsx scripts/smoke-extractors.ts arcdev, or run the full extractor suite with npx tsx scripts/smoke-extractors.ts.
  • Arc talent network employer: Some Arc-managed rows omit a company name; the mapper uses that placeholder.