ilia f5179304c1
Some checks failed
CI / Linting (Biome) (push) Failing after 41s
CI / Tests (push) Successful in 5m27s
CI / Type Check (adzuna-extractor) (push) Successful in 1m9s
CI / Type Check (gradcracker-extractor) (push) Successful in 1m13s
CI / Type Check (hiringcafe-extractor) (push) Successful in 1m9s
CI / Type Check (orchestrator) (push) Successful in 1m24s
CI / Type Check (startupjobs-extractor) (push) Successful in 1m8s
CI / Type Check (ukvisajobs-extractor) (push) Successful in 1m9s
CI / Documentation (push) Successful in 1m59s
feat(discovery): blocked countries filter and smoke subprocess fixes
Add blockedCountries in Settings so pipeline discovery drops jobs whose
location mentions listed countries (existing discovered rows are kept).
Document the feature, fix smoke tsconfig inheritance for nested extractors,
and run smoke via an absolute-tsconfig wrapper.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-05-16 11:41:29 -04:00

1.6 KiB

id, title, description, sidebar_position
id title description sidebar_position
arcdev Arc.dev Extractor Remote tech roles from Arc.dev listing pages via embedded Next.js data. 17

What it is

Arc.dev exposes remote job listings on paths such as /remote-jobs/playwright and /remote-jobs/cypress. The extractor downloads SSR HTML and parses the embedded __NEXT_DATA__ payload (Arc-managed and external rows).

Implementation: extractors/arcdev/manifest.ts.

Why it exists

Curated remote hiring with explicit tooling-oriented feeds; many roles are open to North America when labeled that way on the site.

How to use it

  1. Enable Arc.dev in pipeline sources (no credentials).
  2. Configure arcRemoteJobsPaths as a JSON array of path strings (defaults include Playwright and Cypress remote feeds). Optionally seed defaults from ARC_REMOTE_JOBS_PATHS (comma-separated paths).
  3. Set arcMaxJobsPerPath (default 120, max 300) to cap rows per listing URL after deduplication.
  4. Align searchTerms with titles or stacks you care about; empty-term behavior is handled inside the manifest per path.

Common problems

  • HTML changes: If Arc ships a new payload shape, parsing may need an update; smoke-test with npm run smoke:extractors -- arcdev, or run the full suite with npm run smoke:extractors.
  • Arc talent network employer: Some Arc-managed rows omit a company name; the mapper uses that placeholder.