Zillow Group, Inc. logo

Principal Machine Learning Engineer

Job Overview

Location

Remote-USA

Job Type

Full-time

Category

Machine Learning Engineer

Date Posted

May 17, 2026

Full Job Description

đź“‹ Description

  • • Set the multi-quarter technical roadmap for agentic data foundations including context engineering, agentic memory, and AI workflows that power Zillow’s customer-facing agentic experiences.
  • • Architect, prototype, and ship scalable systems handling hundreds of millions of agent interactions with high availability, low latency, and predictable cost, remaining hands-on in code and production when critical.
  • • Lead complex, cross-organizational initiatives across Agentic AI and Platform teams, aligning on architecture, surfacing dependencies, and driving outcomes through influence rather than direct authority.
  • • Translate ambiguous customer problems, complex platform trade-offs, and emerging agentic paradigms into clear, actionable insights for engineering peers, product partners, Directors, and VPs.
  • • Mentor Senior and Staff engineers, elevate technical judgment and architecture decisions, and shape the engineering culture of the Agentic Data Platform organization.
  • • Design and implement production-grade agentic systems with built-in observability, evaluation, safety, latency control, and cost efficiency, prioritizing minimal viable foundations before scaling.
  • • Own the end-to-end lifecycle of agentic AI platforms—from concept and design to deployment and operational excellence—ensuring systems meet strict SLOs under massive scale.
  • • Collaborate with data science, engineering, and product teams to integrate retrieval systems (embeddings, hybrid search, ranking), tool use, orchestration, and memory mechanisms into core agentic workflows.
  • • Maintain deep expertise in distributed systems architecture and operational best practices for large-scale ML infrastructure, including feature stores, model-serving layers, and data pipelines.
  • • Drive adoption of production-grade defaults for LLM-based systems, including tracing, safety frameworks, and evaluation metrics, leaving behind documentation and systems that raise the technical bar across the organization.
  • • Operate effectively in ambiguity, leading through whiteboarding, design docs, and production code to earn alignment across science, engineering, and product stakeholders.

🎯 Requirements

  • • 10+ years building, scaling, and operating large-scale data and ML infrastructure, with 1–2 recent years shipping agent-based or LLM-powered systems to production and 3+ years as a technical leader across multiple organizations.
  • • Hands-on experience designing and shipping agentic AI in production, including orchestration, tool use, memory and context engineering, retrieval systems (embeddings, hybrid search, ranking), and evaluation.
  • • Platform engineering background with expertise in distributed systems architecture, scaling large-scale ML infrastructure, and operational excellence under massive scale and tight SLOs.
  • • Expert-level Python and deep experience with agentic frameworks (LangGraph, LangChain, Agents SDK, AutoGen), large-scale data processing tools (Spark, Databricks, Airflow, Temporal), vector stores, and AWS cloud infrastructure.
  • • Proven ability to set technical direction across organizational boundaries, build trust with engineering, science, and product leaders, and articulate complex trade-offs clearly to engineering peers and executives.
  • • Demonstrated capacity to resist premature platform building by shipping the smallest viable foundation, then hardening patterns only after validating real need in production.

🏖️ Benefits

  • • Competitive base salary range of $194,200.00 – $326,600.00 annually, depending on location.
  • • Equity awards based on experience, performance, and location.
  • • Remote work flexibility with the ability to work from any of the 50 United States (with limited exceptions).
  • • Opportunity to work at the intersection of cutting-edge agentic AI and large-scale real estate data platforms at a Fortune 100 Best Company to Work For®.
  • • Access to an innovative, inclusive culture recognized for employee empowerment and career growth.
  • • Compliance with state-specific salary thresholds and employment laws, including those in California, New York, Colorado, and others.

Skills & Technologies

Python
AWS
Apache Spark
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Zillow Group, Inc. logo
Zillow Group, Inc.
Visit Website

About Zillow Group, Inc.

Zillow Group operates the largest digital real-estate marketplace in the United States, connecting buyers, sellers, renters, landlords and agents through websites and mobile apps. Founded in 2006 and headquartered in Seattle, the company provides property listings, valuation estimates via its Zestimate algorithm, comparative market analytics, mortgage origination and title services, and iBuying through its Zillow Instant Offers program. Revenue is generated primarily from Premier Agent advertising, lead generation, and real-estate services. The platform covers more than 110 million U.S. homes and is publicly traded on NASDAQ under the ticker ZG.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

ARGENTINA
Full-time
Expires Jun 20, 2026
AWS
Terraform
TensorFlow
+4 more

2 months ago

Apply
Argentina
Full-time
Expires Jul 20, 2026
Remote

20 days ago

Apply
Qualysoft GmbH logo

Qualysoft GmbH

Bucharest
Full-time
Expires Jun 22, 2026
Data Science
Senior
Onsite

2 months ago

Apply
Expired
Melbourne
Full-time
Expired May 15, 2026
Python
Kubernetes
PyTorch
+4 more

3 months ago

Apply