This job has expired
This position was posted on November 6, 2025 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Job Overview
Location
Remote
Job Type
Full-time
Category
Software Engineering
Date Posted
November 6, 2025
Full Job Description
đź“‹ Description
- • Architect and own the end-to-end MLOps lifecycle for Simpplr’s AI-powered employee-experience platform, ensuring that every model—from recommendation engines to natural-language understanding services—can be trained, validated, deployed, monitored, and rolled back in minutes, not days.
- • Design cloud-native, Kubernetes-orchestrated pipelines that span data ingestion, feature engineering, distributed training, hyper-parameter tuning, model registry, A/B testing, and real-time inference at petabyte scale while keeping costs predictable and security airtight.
- • Champion infrastructure-as-code best practices using Terraform, Helm, and GitOps workflows so that environments (dev, staging, prod) are reproducible, version-controlled, and self-healing—freeing data scientists to focus on algorithms instead of YAML.
- • Build automated model-drift, data-drift, and performance-monitoring dashboards that surface anomalies before they impact 1M+ daily active users, triggering rollback or retraining jobs without human intervention.
- • Partner with product, data science, and customer-success teams to translate business KPIs (e.g., “increase intranet search relevance by 12%”) into measurable ML objectives, then deliver the observability stack that proves ROI to Fortune 500 executives.
- • Optimize GPU/TPU utilization and multi-region latency to ensure sub-100 ms inference for real-time personalization while keeping cloud spend under 5% of ARR through intelligent auto-scaling and spot-instance orchestration.
- • Create secure, privacy-compliant pipelines that satisfy SOC 2, GDPR, and HIPAA requirements—encrypting PII at rest and in transit, implementing role-based access control, and maintaining audit trails that make compliance reviews a breeze.
- • Mentor junior MLOps and platform engineers through pair programming, design reviews, and lunch-and-learn sessions, raising the bar for code quality, testing, and documentation across the entire R&D organization.
- • Evaluate emerging technologies (e.g., vector databases, LLM fine-tuning frameworks, feature stores) in quarterly spikes, then socialize findings via internal tech talks and proof-of-concept repos that accelerate innovation.
- • Own the 24×7 on-call rotation for production ML services once per quarter, using blameless post-mortems to turn incidents into durable system improvements and reducing MTTR by 30% year over year.
- • Collaborate with DevOps and Security to integrate ML pipelines into the broader CI/CD ecosystem, ensuring that every pull request triggers automated unit, integration, and load tests before promotion to staging.
- • Drive cost-allocation tagging and FinOps practices so that every experiment’s cloud bill is transparently attributed to the requesting team, enabling data-driven decisions about model complexity versus business value.
- • Establish SLIs/SLOs for model latency, throughput, and accuracy, then build canary and blue-green deployment strategies that let new models earn traffic gradually while protecting user experience.
- • Contribute to open-source MLOps projects (Kubeflow, MLflow, Feast) and represent Simpplr at meetups and conferences, strengthening our employer brand and attracting top-tier talent.
🎯 Requirements
- • 5+ years production-grade MLOps or platform engineering experience with Kubernetes, Docker, and at least one major cloud (AWS, GCP, or Azure)
- • Expert-level Python and proficiency in Infrastructure-as-Code (Terraform, CloudFormation, or Pulumi)
- • Hands-on experience with distributed training frameworks (Ray, Horovod, or PyTorch DDP) and model-serving stacks (KServe, TensorFlow Serving, or Triton)
- • Nice-to-have: advanced degree in Computer Science, Data Science, or related quantitative field
🏖️ Benefits
- • Fully remote-first culture with quarterly in-person retreats in destinations like Lisbon or Austin
- • Competitive salary plus equity in a fast-growing, profitable SaaS company
- • $2,000 annual learning stipend for courses, conferences, or certifications
- • Flexible PTO policy and company-wide mental-health days every quarter
Skills & Technologies
About Simpplr Inc.
Simpplr provides cloud-based intranet software that unifies employee communications, knowledge management, and engagement in one platform. The company’s AI-driven platform personalizes content delivery, consolidates corporate news and documents, and supports collaboration across distributed teams. Founded in 2014 and headquartered in Redwood City, California, Simpplr serves mid-market to Fortune 500 organizations, integrating with enterprise systems like Salesforce, Google Workspace, and Microsoft 365 to streamline internal communication and improve employee experience.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.



