
Job Overview
Location
New York, NY
Job Type
Full-time
Category
Software Engineering
Date Posted
June 4, 2026
Full Job Description
đź“‹ Description
- • Design and execute structured, statistically rigorous experiments to evaluate and improve Mia, an AI voice agent handling inbound patient calls for healthcare providers.
- • Build and maintain regression test coverage to detect unintended side effects of prompt changes across multi-turn, voice-first workflows.
- • Create a validated pipeline of AI improvements that Engineering can confidently deploy, replacing intuition-based changes with evidence-backed decisions.
- • Own the design, testing, and iteration of prompts across all Mia agent workflows, ensuring alignment with healthcare context, compliance, and user intent.
- • Stay current with emerging research in prompting techniques, agentic architectures, and LLM evaluation, translating academic and industry advances into testable prototypes.
- • Prototype new prompting and agent behavior approaches locally before any changes are introduced to production systems.
- • Serve as the primary resource for Product and Engineering teams when resolving complex prompting challenges or ambiguous AI behaviors.
- • Translate experimental results into clear, actionable recommendations for both technical and non-technical stakeholders.
- • Develop reusable evaluation templates, behavioral test suites, and validation frameworks to institutionalize rigorous AI assessment practices.
- • Collaborate with cross-functional teams to prioritize experiments based on impact, feasibility, and alignment with clinical and operational goals.
- • Evaluate and interpret LLM behavior to understand why systems succeed or fail in specific patient interactions, using both quantitative metrics and qualitative analysis.
- • Work with RAG systems and modern LLM tooling to enhance Mia’s contextual accuracy and reliability in high-stakes healthcare scenarios.
- • Ensure all AI improvements are validated for safety, consistency, and compliance within regulated healthcare environments.
- • Communicate findings through written reports and presentations that bridge the gap between research insights and product implementation.
- • Maintain a deep focus on real-world patient outcomes, ensuring AI enhancements improve both clinical efficiency and patient experience.
🎯 Requirements
- • 5+ years of experience in a prompting, AI research, or applied AI role
- • Advanced degree in a research-oriented field (PhD preferred) in CS, linguistics, cognitive science, stats, or similar
- • Real prompt engineering experience — deliberately designing, testing, and improving prompts to change system behavior
- • Solid experimental design fundamentals: controls, statistical significance, understanding when results are meaningful
- • Hands-on experience working with LLMs in applied contexts
- • Comfort with RAG, agentic architectures, and modern LLM tooling
🏖️ Benefits
- • Base salary: $180,000 - $230,000
- • Meaningful equity ownership in a fast-growing healthtech startup
- • Work on an AI voice agent used in real healthcare settings with direct patient impact
- • Collaborate with a founding team with proven experience scaling technology at Carbon Health
Skills & Technologies
About Hellopatient Inc.
Hellopatient is a digital health company focused on improving patient care coordination and communication. They offer a platform that connects patients with their healthcare providers, streamlining appointment scheduling, prescription refills, and access to medical records. The company aims to reduce administrative burdens for clinics and empower patients with more control over their health journey. Their services are designed for various medical practices, enhancing efficiency and patient engagement within the healthcare ecosystem. Hellopatient operates within the rapidly growing digital health and healthtech industries, leveraging technology to make healthcare more accessible and manageable.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Workato, Inc.
4 days ago

Nebius Group N.V.
3 months ago

Deepgram Inc.
2 months ago