
Job Overview
Location
San Francisco
Job Type
Full-time
Category
Software Engineering
Date Posted
June 4, 2026
Full Job Description
đź“‹ Description
- • Lead development of the core agent intelligence layer that executes multi-step workflows across complex desktop engineering software including CAD, CAE, and PLM systems.
- • Serve as technical lead for a small team of AI engineers, a user researcher, and domain expert contractors, reporting directly to the CTO.
- • Own the full product loop: define agent capabilities from user stories, build implementations, and benchmark against real-world engineering workflows.
- • Drive agent task success rate by defining evaluation frameworks, establishing baselines, and systematically improving completion metrics.
- • Set and enforce per-task token budgets and track cost per completed workflow to ensure commercial viability of AI-driven workflows.
- • Build rigorous, reproducible evaluation infrastructure grounded in validated user stories, applying SWE-bench-level rigor to engineering workflows.
- • Lead user story mapping and validation through direct interviews and collaboration with mechanical engineering domain experts.
- • Translate validated user stories into testable evaluations, closing the loop between user research and benchmarking.
- • Own agent architecture decisions including tool-calling strategies, state management, error recovery, model routing, and context management.
- • Act as a player-coach: write production code, review technical designs, unblock team members, and raise overall engineering standards.
- • Collaborate cross-functionally with integrations, product, and customer teams during proof-of-concept (POC) deployments to align agent behavior with real-world usage.
- • Ensure agent systems operate reliably under enterprise constraints, including locked-down corporate workstations and restricted environments.
- • Implement and maintain observability tooling using platforms such as Logfire or LangSmith for tracing, monitoring, and debugging agent behavior.
- • Apply deep expertise in LLM application architectures: model selection, context/window management, retrieval strategies, and orchestration patterns.
- • Deliver AI systems with measurable outcomes — including task success rate and cost efficiency — not just prototypes or demos.
- • Maintain high standards for reproducibility, scalability, and reliability in agent systems deployed to Fortune 100 hardware engineering customers.
- • Contribute to the strategic technical direction of the company’s agentic AI platform as a senior technical leader in an early-stage, well-funded startup.
🎯 Requirements
- • 7+ years in software engineering, including at least 2 years building agentic LLM-based agents that act in the real world (tool-calling, multi-step workflows, failure handling, cost constraints)
- • Deep experience designing LLM application architectures: model selection, context/window management, retrieval strategies, tool-calling frameworks, and orchestration patterns
- • Strong evaluation and benchmarking instincts for agentic systems — task completion, cost efficiency, failure mode analysis; familiarity with SWE-bench, GAIA, or τ-bench
- • Proven track record shipping AI systems with measurable outcomes (e.g., agent task success rate, cost efficiency) — not just demos
- • Strong Python skills and hands-on experience with LLM tooling (function calling, tool use APIs, tracing/observability tools such as Logfire or LangSmith, evaluation frameworks)
- • Experience leading a small technical team (3–6 engineers): setting direction, performing code reviews, driving architecture decisions
🏖️ Benefits
- • Salary: $160,000 – $250,000 USD annually, depending on experience
- • Equity participation in an early-stage, Series A company
- • Visa sponsorship is not available for this role
Skills & Technologies
About Clera
Clera, Inc. operates as an AI Talent Agent, leveraging artificial intelligence to enhance and streamline various aspects of the talent acquisition and management process. The company's core offering appears to center on an AI-powered platform designed to connect skilled individuals with suitable opportunities or to assist organizations in efficiently sourcing and evaluating candidates. While comprehensive details regarding specific features, target industries, or the scale of its operations are not yet publicly available, the current online presence indicates that Clera's services are actively being prepared for launch. This suggests an upcoming introduction of innovative AI solutions aimed at optimizing the ecosystem where talent meets demand.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
1 month ago

Sanity, Inc
1 month ago


