
Job Overview
Location
2 Locations
Job Type
Full-time
Category
Machine Learning Engineer
Date Posted
April 2, 2026
Full Job Description
đź“‹ Description
- • As a Staff Software Engineer on Cloudera’s Anywhere Cloud team, you will lead the architecture and delivery of a cloud-native AI platform that bridges cutting-edge AI research with production-grade Kubernetes environments, enabling organizations to run AI workloads anywhere—public cloud, private data centers, or the edge—without vendor lock-in.
- • You will design and implement enterprise-grade AI services in Go/Node.js that wrap open-source foundation models (Llama, Qwen, etc.) for secure, scalable consumption by internal product teams and external customers, forming the "nervous system" of Cloudera’s AI stack.
- • You will lead the deployment and optimization of AI inference servers (vLLM, Triton) using KServe, KubeRay, or Knative to achieve serverless-style scaling, ensuring efficient GPU utilization through MIG and fractional GPU scheduling in Kubernetes.
- • You will architect robust Retrieval-Augmented Generation (RAG) pipelines and prompt management services that integrate with vector databases and enterprise data sources, enabling context-aware, accurate AI responses grounded in proprietary data.
- • You will build internal developer tooling, SDKs, and "AI Gateways" that streamline the integration of foundation models into product features, significantly improving developer velocity and reducing time-to-market for AI-powered capabilities.
- • You will collaborate closely with UI/UX engineers, product managers, and data scientists to ensure the AI platform is not only powerful and performant but also intuitive, usable, and aligned with real-world developer workflows.
- • You will contribute to a unified, scalable, and future-ready platform powered by Kubernetes and enhanced by the Taikun acquisition, simplifying hybrid cloud management and enabling consistent, secure, and compliant AI workloads across any infrastructure.
- • You will help shape Cloudera’s vision of giving organizations 100% access to 100% of their data, anywhere, by eliminating friction in AI deployment and enabling true hybrid AI innovation at enterprise scale.
🎯 Requirements
- • Bachelor’s degree with 6+ years of software engineering experience (or equivalent Masters/PhD tenure), including at least 2+ years focused on AI/ML systems.
- • Expert proficiency in Python for AI/ML ecosystems and strong competence in a systems language such as Go, Rust, or C++ for building high-performance serving layers and infrastructure components.
- • Deep understanding of LLM deployment challenges and runtimes (vLLM, Triton, TorchServe, ONNX), including hands-on experience with quantization techniques (AWQ, GPTQ) to optimize model size, latency, and throughput.
- • Proven experience building complex AI workflows using LangChain or LlamaIndex and deploying them on containerized infrastructure (Docker/Kubernetes) in production environments.
- • Ability to navigate the rapidly evolving AI landscape, distinguish practical engineering solutions from hype, and drive technical alignment across cross-functional teams.
🏖️ Benefits
- • Generous PTO policy supporting work-life balance and employee well-being.
- • Flexible WFH policy enabling remote and hybrid work arrangements.
- • Comprehensive mental and physical wellness programs, including access to wellness resources and initiatives.
- • Phone and internet reimbursement program to support remote work setup and connectivity.
- • Access to continued career development opportunities, including training, conferences, and skill-building resources.
- • Comprehensive benefits and competitive compensation packages aligned with market standards.
- • Paid volunteer time and employee resource groups fostering community engagement and inclusion.
- • Unplugged Days initiative encouraging employees to disconnect and recharge.
Skills & Technologies
About Cloudera, Inc.
Cloudera, Inc. provides an enterprise data cloud platform for analytics and machine learning. Its software combines data engineering, data warehousing, and AI workloads on hybrid and multi-cloud environments. Built around open-source technologies like Apache Hadoop, Spark, and Kafka, it offers unified security, governance, and metadata management. Customers use Cloudera Data Platform to ingest, store, analyze, and model large-scale data for business intelligence and real-time insights. The company serves financial services, healthcare, telecommunications, and public sector organizations worldwide.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Heidi Health Pty Ltd
1 month ago

Heidi Health Pty Ltd
1 month ago

