Firecrawl Inc. logo

Product Engineer — Scrape

Job Overview

Location

San Francisco, CA (Hybrid) OR Remote (Americas, UTC-3 to UTC-10)

Job Type

Full-time

Category

Software Engineering

Date Posted

May 22, 2026

Full Job Description

📋 Description

  • Own Firecrawl's flagship scrape endpoint end-to-end, the API that converts any URL into clean, LLM-ready markdown or structured data with a single call — the product used by 100k+ developers and central to Firecrawl’s growth.
  • Make the scrape product unbeatable by relentlessly improving latency, reliability, error messaging, response format, and markdown quality — every detail that affects developer experience is your responsibility.
  • Solve the messy web’s edge cases: JavaScript-heavy SPAs, anti-bot systems, dynamic content, infinite scrolls, broken HTML, and unusual charsets — ensuring the API "just works" without exposing complexity to users.
  • Design and refine structured extraction features (schema-based, JSON mode, prompt-based) so developers can skip LLM calls entirely, building trust that the output is reliable enough to depend on in production.
  • Treat output quality as a product decision: determine what to preserve, strip, flatten, or restructure in scraped content to optimize it for LLM context windows — balancing fidelity with usability.
  • Dogfood the product daily: use the API in your own side projects, read every GitHub issue, Discord thread, and support ticket related to scrape, and fix friction before users report it.
  • Run fast product experiments: form hypotheses about improvements, instrument them, ship changes quickly, measure impact, and iterate — no waiting for perfect data or perfect scope.
  • Elevate developer experience by ensuring API ergonomics are intuitive, response formats are consistent, documentation matches behavior, and error codes are actionable — technical users notice and expect excellence.
  • Operate under real production load: understand what breaks first under scale, instrument key metrics, and make tradeoffs between latency, quality, and cost that serve thousands of daily users.
  • Ship features from design to deployment without needing a product manager — define what good looks like, prioritize based on user feedback, and own outcomes end-to-end.
  • Maintain deep technical instincts for browser automation (Playwright, Puppeteer, Chromium), crawling infrastructure, and data extraction systems — you’ve felt the pain of flaky scrapers at 3am and built solutions that don’t break.
  • Have hands-on experience building developer-facing APIs used by thousands, with a strong backend bias and full-stack capability to implement both infrastructure and user-facing interfaces.
  • Bring production experience with data infrastructure or developer tools, having shipped products that others rely on daily — not just built features, but maintained critical infrastructure.
  • Understand what makes data "LLM-ready": you’ve built on top of LLMs and know how noisy, poorly structured context degrades model performance — you intuitively structure scraped content to maximize prompt effectiveness.
  • Work at high velocity: move from customer pain point to shipped fix in days, not sprints — the feedback loop is fast, and slow iteration directly impacts adoption and trust.
  • Use the product yourself: you’re the kind of engineer who reads documentation critically, builds side projects with APIs like Firecrawl’s, and notices when something feels off.

🎯 Requirements

  • 3+ years shipping developer-facing products, ideally in scraping, crawling, browser automation, or data infrastructure
  • Deep instincts for handling the messy web — including anti-bot systems, JavaScript rendering, and dynamic content extraction
  • Obsessive focus on developer experience — API ergonomics, response format, error messages, and documentation matter as much as functionality
  • Hands-on builder who owns features from design to deployment without needing a PM or perfectly scoped ticket
  • Experience with LLMs and understanding of what makes scraped data clean, structured, and optimal for LLM context windows
  • Production experience operating systems under real load with strong instincts for latency, reliability, and cost-quality tradeoffs

🏖️ Benefits

  • Salary range of $180,000 to $290,000/year (U.S.-based); adjusted fairly for remote employees outside the U.S. based on cost of living
  • Up to 0.15% equity in Firecrawl
  • 15 days mandatory PTO; any time beyond 24 days can be taken with approval (holidays excluded)
  • 12 weeks fully paid parental leave for all parents
  • $100/month wellness stipend for gym, therapy, massages, or other wellness expenses
  • $1,000/year learning and development budget for professional growth
  • Full medical, dental, and vision coverage (100% for employees, 50% for dependents) for U.S.-based employees
  • 401(k) plan with employer contribution for U.S.-based employees
  • E-bike loaner for San Francisco-based employees
  • Team offsites and a 3-month paid sabbatical after 4 years

Skills & Technologies

JavaScript
Go
GitHub
Remote
$180k-290k

Ready to Apply?

You will be redirected to an external site to apply.

Firecrawl Inc. logo
Firecrawl Inc.
Visit Website

About Firecrawl Inc.

Firecrawl Inc. provides an API that converts entire websites into clean markdown or structured data. Designed for AI applications, the service crawls all accessible subpages, renders dynamic content, and returns LLM-ready output without requiring sitemaps. It includes built-in scraping, search, and extraction capabilities for building knowledge bases, fine-tuning datasets, or powering chatbots. The company targets developers and data teams who need reliable web content ingestion at scale, offering cloud-hosted endpoints and self-hosted options under a usage-based pricing model.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Expired
Any Location / Remote
Full-time
Expired May 2, 2026
GitLab
Remote
Degree Required

3 months ago

Apply
Expires soon
Remote: ANZ
Full-time
Expires Jun 13, 2026 (Soon)
Python
JavaScript
GCP
+5 more

2 months ago

Apply
Expires soon
Sedgwick Claims Management Services, Inc. logo

Sedgwick Claims Management Services, Inc.

US Telecommuter
Full-time
Expires Jun 11, 2026 (Soon)
Remote
$18-32/hr
Degree Required

2 months ago

Apply
Expires soon
Remote
Full-time
Expires Jun 14, 2026 (Soon)
Remote

2 months ago

Apply