
Job Overview
Location
San Francisco, California, USA
Job Type
Full-time
Category
Data Engineer
Date Posted
February 28, 2026
Full Job Description
📋 Description
- • At Parallel Bio, we are at the forefront of revolutionizing drug discovery by harnessing the power of the human immune system. We are building a groundbreaking platform that integrates best-in-class human immune organoids with massive scale and advanced computational methods, including AI and machine learning. This unique approach allows us to generate unprecedented, population-scale insights into human health and disease, enabling the rapid discovery of safer, more effective drugs that are designed to work for patients from the outset and across diverse populations.
- • As a Data Application Engineer for our Foundry platform, you will play a pivotal role in extending leadership capabilities within the Data & Infrastructure team. This strategic and operational position requires you to possess a comprehensive understanding of the department's workstreams, acting as a trusted proxy to make informed decisions, remove obstacles for teams, and drive execution with a high degree of autonomy. The ideal candidate will demonstrate strong Palantir Foundry fluency, a genuine curiosity for the underlying science, and the operational acumen to proactively identify and address the highest-value tasks without explicit direction.
- • This is not a typical individual contributor role. It is a dynamic position situated at the critical intersection of technical execution, project management, and strategic planning. You will fluidly adapt your approach across these modes to meet the evolving needs of the department, ensuring seamless integration and efficient progress.
- • Your primary ownership will encompass the North of Ontology Data Systems & Platform Infrastructure. This involves building and continuously evolving the ontology – the core structure of actions, objects, and links – that accurately represents our complex biological workflows within Foundry. You will also be responsible for developing bespoke, user-friendly React applications that empower our scientists and customers, making sophisticated data accessible and actionable.
- • You will drive the comprehensive buildout of our experimental data pipelines, meticulously architecting our storage solutions, and developing advanced analytical tooling. A key aspect of this role is to develop a deep, working understanding of our existing data landscape: what data we possess, its inherent value, and any critical gaps. Based on this understanding, you will architect and implement workflows designed to unlock the full potential of this data.
- • As our dataset volume and complexity grow, you will define and rigorously enforce data standards, schemas, and governance protocols. This will involve close collaboration with our Automation and Science teams to ensure these standards accurately reflect real-world experimental workflows, proactively preventing the accumulation of data debt and maintaining data integrity from the source.
- • You will act as a champion for data-driven discovery across the entire organization. This includes raising Foundry literacy among our scientific staff, empowering them to transition from raw data to actionable insights with increasing independence and confidence.
- • A crucial part of your role will be to proactively identify technical debt and infrastructure gaps within the platform. You will then be responsible for scoping, prioritizing, and leading the remediation efforts to ensure the long-term health and scalability of our data systems.
- • Your responsibilities will extend to fostering strong Science Team Partnerships. This requires developing a genuine, working-level understanding of the science teams' priorities, their experimental roadmaps, and their active book of work. You will achieve this by actively participating in their discussions and processes, rather than relying on secondhand summaries, ensuring our data infrastructure directly supports their critical research needs.
- • You will ensure that the Data & Infrastructure team builds solutions that align with the actual requirements of the science teams, avoiding the creation of systems that are logically sound from a purely technical perspective but disconnected from practical scientific application.
- • You will be instrumental in identifying where data capture or pipeline inefficiencies create friction for researchers, treating these issues with the same urgency and priority as internal engineering challenges.
- • Building sufficient trust with science leadership will enable you to anticipate future needs and proactively scope work, ensuring our data capabilities remain ahead of the curve.
- • Coordination with the Automation Team is also a key component. You will stay abreast of the automation team's roadmap to ensure our data infrastructure remains fully compatible with the evolving physical platform, particularly as new instruments and processes are introduced.
- • At the critical intersection points – including instrument integration, data ingestion, and metadata standards – you will manage sequencing and dependencies effectively, ensuring smooth operational flow without creating bottlenecks.
- • You will ensure the data layer consistently keeps pace with expanding automation capabilities, guaranteeing that increased experimental volume translates into well-structured, valuable datasets rather than creating burdensome cleanup backlogs.
- • Finally, you will be responsible for Keeping Things Moving by self-organizing around the department's highest-value work. You will proactively seek out, sequence, and prioritize tasks, demonstrating initiative rather than waiting for a defined task list.
- • You will own the operating rhythm of the team, including sprint planning, roadmap reviews, cross-functional syncs, and dependency tracking, ensuring efficient project execution.
- • You will be adept at surfacing risks and tradeoffs early in the infrastructure delivery process, allowing for timely mitigation and informed decision-making.
- • You will translate complex technical constraints into clear, concise business terms for discussions with BD, finance, and partnership teams, particularly when data infrastructure or security posture is a relevant consideration.
🎯 Requirements
- • 2 to 6 years of hands-on experience with Palantir Foundry, specifically in ontology design, pipeline development, or application building.
- • Proficiency with React, with a demonstrated ability to build Foundry applications and user-facing tools.
- • Familiarity with cloud infrastructure (AWS) and modern data engineering practices.
- • Experience in a startup or scale-up environment where scope is fluid and resourcefulness is essential.
- • Exposure to life sciences data (e.g., assay data, LIMS, genomics) is desirable, with the ability to understand scientific discussions and translate them into data and infrastructure requirements.
🏖️ Benefits
- • Competitive salary and equity package.
- • Comprehensive health, dental, and vision insurance.
- • Generous paid time off and holidays.
- • Opportunities for professional development and continuous learning.
- • A dynamic and collaborative work environment at the cutting edge of biotechnology.
Skills & Technologies
About Parallel Biologics
Parallel Biologics is a biotechnology company focused on developing novel therapeutic antibodies. They utilize a proprietary platform to discover and engineer antibodies with enhanced potency, specificity, and drug-like properties. Their pipeline targets a range of diseases, including oncology and autoimmune disorders. The company aims to accelerate the development of life-changing medicines by leveraging advanced biological insights and cutting-edge protein engineering technologies. Parallel Biologics collaborates with academic institutions and pharmaceutical partners to advance its antibody candidates through preclinical and clinical development, ultimately seeking to bring innovative treatments to patients in need.
Similar Opportunities
4 days ago



