---
id: gradcracker
title: Gradcracker Extractor
description: How the Gradcracker crawler builds search URLs and extracts jobs.
sidebar_position: 2
---

A plain-English walkthrough of the Gradcracker extractor in `extractors/gradcracker`.

Original website: [gradcracker.com](https://www.gradcracker.com)

## Big picture

The crawler builds search URLs, scrapes listing pages, then opens job details for descriptions and apply URLs.

## 1) Build search URLs

- Combines UK regions with role terms.
- Defaults include roles such as `web-development` and `software-systems`.
- `GRADCRACKER_SEARCH_TERMS` overrides defaults.

## 2) Crawl list pages

- Waits for job cards (`article[wire:key]`).
- Extracts title, employer, discipline, deadline, salary, location, degree, start date.
- Queues job detail pages.

Controls:

- `GRADCRACKER_MAX_JOBS_PER_TERM`
- `JOBOPS_SKIP_APPLY_FOR_EXISTING=1`
- `JOBOPS_EXISTING_JOB_URLS` / `JOBOPS_EXISTING_JOB_URLS_FILE`

## 3) Crawl detail pages

- Waits for `.body-content`
- Captures full description text
- Clicks apply button to resolve final application URL
- Handles popup and same-tab redirects

## 4) Progress reporting

Set `JOBOPS_EMIT_PROGRESS=1` for structured progress lines consumable by orchestrator UI.

## Notes

- Uses Playwright + Crawlee via Camoufox.
- Low concurrency and longer timeouts for stability.