This job has expired

This position was posted on November 6, 2025 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Simpplr Inc. logo

Senior AI MLOps Engineer

Job Overview

Location

Remote

Job Type

Full-time

Category

Software Engineering

Date Posted

November 6, 2025

Full Job Description

đź“‹ Description

  • • Architect and own the end-to-end MLOps lifecycle for Simpplr’s AI-powered employee-experience platform, ensuring that every model—from recommendation engines to natural-language understanding services—can be trained, validated, deployed, monitored, and rolled back in minutes, not days.
  • • Design cloud-native, Kubernetes-orchestrated pipelines that span data ingestion, feature engineering, distributed training, hyper-parameter tuning, model registry, A/B testing, and real-time inference at petabyte scale while keeping costs predictable and security airtight.
  • • Champion infrastructure-as-code best practices using Terraform, Helm, and GitOps workflows so that environments (dev, staging, prod) are reproducible, version-controlled, and self-healing—freeing data scientists to focus on algorithms instead of YAML.
  • • Build automated model-drift, data-drift, and performance-monitoring dashboards that surface anomalies before they impact 1M+ daily active users, triggering rollback or retraining jobs without human intervention.
  • • Partner with product, data science, and customer-success teams to translate business KPIs (e.g., “increase intranet search relevance by 12%”) into measurable ML objectives, then deliver the observability stack that proves ROI to Fortune 500 executives.
  • • Optimize GPU/TPU utilization and multi-region latency to ensure sub-100 ms inference for real-time personalization while keeping cloud spend under 5% of ARR through intelligent auto-scaling and spot-instance orchestration.
  • • Create secure, privacy-compliant pipelines that satisfy SOC 2, GDPR, and HIPAA requirements—encrypting PII at rest and in transit, implementing role-based access control, and maintaining audit trails that make compliance reviews a breeze.
  • • Mentor junior MLOps and platform engineers through pair programming, design reviews, and lunch-and-learn sessions, raising the bar for code quality, testing, and documentation across the entire R&D organization.
  • • Evaluate emerging technologies (e.g., vector databases, LLM fine-tuning frameworks, feature stores) in quarterly spikes, then socialize findings via internal tech talks and proof-of-concept repos that accelerate innovation.
  • • Own the 24Ă—7 on-call rotation for production ML services once per quarter, using blameless post-mortems to turn incidents into durable system improvements and reducing MTTR by 30% year over year.
  • • Collaborate with DevOps and Security to integrate ML pipelines into the broader CI/CD ecosystem, ensuring that every pull request triggers automated unit, integration, and load tests before promotion to staging.
  • • Drive cost-allocation tagging and FinOps practices so that every experiment’s cloud bill is transparently attributed to the requesting team, enabling data-driven decisions about model complexity versus business value.
  • • Establish SLIs/SLOs for model latency, throughput, and accuracy, then build canary and blue-green deployment strategies that let new models earn traffic gradually while protecting user experience.
  • • Contribute to open-source MLOps projects (Kubeflow, MLflow, Feast) and represent Simpplr at meetups and conferences, strengthening our employer brand and attracting top-tier talent.

🎯 Requirements

  • • 5+ years production-grade MLOps or platform engineering experience with Kubernetes, Docker, and at least one major cloud (AWS, GCP, or Azure)
  • • Expert-level Python and proficiency in Infrastructure-as-Code (Terraform, CloudFormation, or Pulumi)
  • • Hands-on experience with distributed training frameworks (Ray, Horovod, or PyTorch DDP) and model-serving stacks (KServe, TensorFlow Serving, or Triton)
  • • Nice-to-have: advanced degree in Computer Science, Data Science, or related quantitative field

🏖️ Benefits

  • • Fully remote-first culture with quarterly in-person retreats in destinations like Lisbon or Austin
  • • Competitive salary plus equity in a fast-growing, profitable SaaS company
  • • $2,000 annual learning stipend for courses, conferences, or certifications
  • • Flexible PTO policy and company-wide mental-health days every quarter

Skills & Technologies

Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Simpplr Inc. logo
Simpplr Inc.
Visit Website

About Simpplr Inc.

Simpplr provides cloud-based intranet software that unifies employee communications, knowledge management, and engagement in one platform. The company’s AI-driven platform personalizes content delivery, consolidates corporate news and documents, and supports collaboration across distributed teams. Founded in 2014 and headquartered in Redwood City, California, Simpplr serves mid-market to Fortune 500 organizations, integrating with enterprise systems like Salesforce, Google Workspace, and Microsoft 365 to streamline internal communication and improve employee experience.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

San Francisco
Full-time
Expires Jun 2, 2026
Go
Onsite

1 month ago

Apply
PactFi Inc. logo

PactFi Inc.

New York, NY
Full-time
Expires May 27, 2026
Python
JavaScript
TypeScript
+4 more

2 months ago

Apply
Kernel Medical Devices, Inc. logo

Kernel Medical Devices, Inc.

London
Full-time
Expires Jun 2, 2026
JavaScript
Senior
Remote
+1 more

1 month ago

Apply
Poland
Full-time
Expires May 31, 2026
Senior
Onsite

1 month ago

Apply