Some checks failed
CI / Linting (Biome) (push) Failing after 41s
CI / Tests (push) Successful in 5m27s
CI / Type Check (adzuna-extractor) (push) Successful in 1m9s
CI / Type Check (gradcracker-extractor) (push) Successful in 1m13s
CI / Type Check (hiringcafe-extractor) (push) Successful in 1m9s
CI / Type Check (orchestrator) (push) Successful in 1m24s
CI / Type Check (startupjobs-extractor) (push) Successful in 1m8s
CI / Type Check (ukvisajobs-extractor) (push) Successful in 1m9s
CI / Documentation (push) Successful in 1m59s
Add blockedCountries in Settings so pipeline discovery drops jobs whose location mentions listed countries (existing discovered rows are kept). Document the feature, fix smoke tsconfig inheritance for nested extractors, and run smoke via an absolute-tsconfig wrapper. Co-authored-by: Cursor <cursoragent@cursor.com>
35 lines
1.6 KiB
Markdown
35 lines
1.6 KiB
Markdown
---
|
|
id: arcdev
|
|
title: Arc.dev Extractor
|
|
description: Remote tech roles from Arc.dev listing pages via embedded Next.js data.
|
|
sidebar_position: 17
|
|
---
|
|
|
|
## What it is
|
|
|
|
[Arc.dev](https://arc.dev) exposes remote job listings on paths such as `/remote-jobs/playwright` and `/remote-jobs/cypress`. The extractor downloads SSR HTML and parses the embedded `__NEXT_DATA__` payload (Arc-managed and external rows).
|
|
|
|
Implementation: `extractors/arcdev/manifest.ts`.
|
|
|
|
## Why it exists
|
|
|
|
Curated remote hiring with explicit tooling-oriented feeds; many roles are open to North America when labeled that way on the site.
|
|
|
|
## How to use it
|
|
|
|
1. Enable **Arc.dev** in pipeline sources (no credentials).
|
|
2. Configure **`arcRemoteJobsPaths`** as a JSON array of path strings (defaults include Playwright and Cypress remote feeds). Optionally seed defaults from **`ARC_REMOTE_JOBS_PATHS`** (comma-separated paths).
|
|
3. Set **`arcMaxJobsPerPath`** (default `120`, max `300`) to cap rows per listing URL after deduplication.
|
|
4. Align **`searchTerms`** with titles or stacks you care about; empty-term behavior is handled inside the manifest per path.
|
|
|
|
## Common problems
|
|
|
|
- **HTML changes:** If Arc ships a new payload shape, parsing may need an update; smoke-test with `npm run smoke:extractors -- arcdev`, or run the full suite with `npm run smoke:extractors`.
|
|
- **`Arc talent network` employer:** Some Arc-managed rows omit a company name; the mapper uses that placeholder.
|
|
|
|
## Related pages
|
|
|
|
- [Extractors overview](/docs/next/extractors/overview)
|
|
- [Canadian QA contracting firms](/docs/next/extractors/qa-contract-staffing-canada)
|
|
- [Manual Import](/docs/next/extractors/manual)
|