
Job Overview
Location
Remote Nationwide
Job Type
Full-time
Category
Software Engineering
Date Posted
May 27, 2026
Full Job Description
📋 Description
- • As a Decision Intelligence Engineer at Humana, you will design, train, and improve the reinforcement learning policy at the core of the Next Best Action platform, directly impacting healthcare decisions for millions of members by ensuring clinically appropriate, safe, and effective recommendations.
- • Day to day, you will design and evaluate decision-making algorithms (including PPO, A3C, DQN, CQL, and Decision Transformer), frame member decisioning as MDPs or POMDPs, build simulation and backtesting environments, own the nightly Databricks training workflow using PySpark and Ray RLlib, manage model lifecycle in MLflow, and collaborate with data, platform, and rules engine teams to ensure compliance with clinical eligibility and program objectives.
- • You will join Humana’s caring community, a leading U.S. healthcare company dedicated to putting health first through insurance and CenterWell services, serving millions with Medicare, Medicaid, families, and military personnel, where your work contributes to better health outcomes and quality of life.
- • In this role, you will deepen your expertise in reinforcement learning, constrained optimization, and production ML systems at scale, gain experience in healthcare-specific AI governance and explainability, and advance your ability to ship reliable, auditable decision systems that operate under real-world constraints and regulatory scrutiny.
🎯 Requirements
- • 8+ years of software engineering or quantitative research experience building and operating large-scale production systems, with emphasis on data-intensive platforms, recommendation systems, optimization engines, or simulation frameworks serving millions of users.
- • 3+ years of hands-on experience implementing reinforcement learning, operations research methods, or simulation-driven decision systems in production, including PPO, A3C, DQN, CQL, stochastic dynamic programming, or constrained optimization.
- • Proficiency in Python 3.x; experience with PyTorch or TensorFlow for policy network implementation.
- • Experience with Ray RLlib or equivalent distributed computation frameworks for large-scale training or optimization.
- • Experience with Databricks, PySpark, and Delta Lake for large-scale ML or data pipelines processing tens of millions of records.
- • Experience with MLflow for experiment tracking, model registry, and artifact management.
🏖️ Benefits
- • Medical, dental, and vision benefits
- • 401(k) retirement savings plan
- • Paid time off (including company and personal holidays, volunteer time off, paid parental and caregiver leave)
- • Short-term and long-term disability, life insurance
- • Bonus incentive plan based on company and/or individual performance
- • Bi-weekly internet expense reimbursement for eligible remote employees in CA, IL, MT, or SD
Skills & Technologies
About Humana Inc.
Humana Inc. is a for-profit health and well-being company headquartered in Louisville, Kentucky. Founded in 1961, it provides health insurance, Medicare Advantage plans, Medicaid services, pharmacy benefit management, and clinical care through primary care centers. Serving millions of members across the United States, Humana focuses on integrated care delivery, home health, and wellness programs aimed at improving health outcomes and reducing costs for individuals, employers, and government partners.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.



