Some checks failed
CI / Linting (Biome) (push) Failing after 41s
CI / Tests (push) Successful in 5m27s
CI / Type Check (adzuna-extractor) (push) Successful in 1m9s
CI / Type Check (gradcracker-extractor) (push) Successful in 1m13s
CI / Type Check (hiringcafe-extractor) (push) Successful in 1m9s
CI / Type Check (orchestrator) (push) Successful in 1m24s
CI / Type Check (startupjobs-extractor) (push) Successful in 1m8s
CI / Type Check (ukvisajobs-extractor) (push) Successful in 1m9s
CI / Documentation (push) Successful in 1m59s
Add blockedCountries in Settings so pipeline discovery drops jobs whose location mentions listed countries (existing discovered rows are kept). Document the feature, fix smoke tsconfig inheritance for nested extractors, and run smoke via an absolute-tsconfig wrapper. Co-authored-by: Cursor <cursoragent@cursor.com>
1.6 KiB
1.6 KiB
id, title, description, sidebar_position
| id | title | description | sidebar_position |
|---|---|---|---|
| arcdev | Arc.dev Extractor | Remote tech roles from Arc.dev listing pages via embedded Next.js data. | 17 |
What it is
Arc.dev exposes remote job listings on paths such as /remote-jobs/playwright and /remote-jobs/cypress. The extractor downloads SSR HTML and parses the embedded __NEXT_DATA__ payload (Arc-managed and external rows).
Implementation: extractors/arcdev/manifest.ts.
Why it exists
Curated remote hiring with explicit tooling-oriented feeds; many roles are open to North America when labeled that way on the site.
How to use it
- Enable Arc.dev in pipeline sources (no credentials).
- Configure
arcRemoteJobsPathsas a JSON array of path strings (defaults include Playwright and Cypress remote feeds). Optionally seed defaults fromARC_REMOTE_JOBS_PATHS(comma-separated paths). - Set
arcMaxJobsPerPath(default120, max300) to cap rows per listing URL after deduplication. - Align
searchTermswith titles or stacks you care about; empty-term behavior is handled inside the manifest per path.
Common problems
- HTML changes: If Arc ships a new payload shape, parsing may need an update; smoke-test with
npm run smoke:extractors -- arcdev, or run the full suite withnpm run smoke:extractors. Arc talent networkemployer: Some Arc-managed rows omit a company name; the mapper uses that placeholder.