This job has expired
This position was posted on March 25, 2026 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Job Overview
Location
India
Job Type
Full-time
Category
Software Engineering
Date Posted
March 25, 2026
Full Job Description
📋 Description
- • Atlan is at the forefront of building the essential context layer for data and AI, addressing the critical challenge where a significant majority of AI pilots fail due to a lack of understanding of the data's context, including its meaning, governance, and appropriate usage. This role is pivotal in ensuring that AI systems can effectively leverage enterprise data by providing the necessary contextual foundation.
- • As a Staff Software Engineer on the Foundation Platform team, you will be instrumental in developing and maintaining the production infrastructure that underpins Atlan’s context layer for AI. This infrastructure spans across major cloud providers including AWS, Azure, and GCP. A key aspect of this role involves the natural integration and utilization of AI-assisted development tools, such as Claude Code and Cursor, into your daily engineering workflow, enhancing productivity and innovation.
- • Your day-to-day responsibilities will encompass owning and continuously evolving our multi-tenant infrastructure, which is built on Kubernetes. This includes managing dedicated clusters for each customer and overseeing the entire tenant lifecycle, from initial provisioning and dynamic scaling to seamless migration and efficient offboarding processes. You will be a key player in ensuring the robustness and scalability of our multi-tenant architecture.
- • A significant part of your work will involve enhancing our GitOps deployment processes. This includes making the ArgoCD and Helm-based pipeline more robust, faster, and safer, especially considering it deploys over 1,000 applications across hundreds of tenants. You will identify bottlenecks and implement improvements to streamline these critical deployment operations.
- • You will be tasked with transforming manual infrastructure runbooks into automated, reliable solutions. This includes automating processes for Kubernetes upgrades, Private Link setups, Disaster Recovery (DR) drills, and cluster onboarding. The primary tools for this automation will be Infrastructure-as-Code (IaC) principles and workflow engines, ensuring consistency and reducing human error.
- • Strengthening the platform's observability and efficiency will be a core focus. This involves improving our logging, metrics, and alerting stack. By leveraging these enhanced systems, you will drive improvements in reliability, provide greater visibility into system performance, and achieve meaningful reductions in cloud costs through optimized resource utilization.
- • You will take ownership of customer-facing infrastructure work and lead incident response from beginning to end. The insights gained from these experiences will be translated into clear, actionable runbooks, comprehensive dashboards, and custom Claude Skills. These resources will empower both human operators and AI agents to manage and operate the platform effectively.
- • About the team and company: Atlan is a rapidly growing SaaS company building the missing context layer for data and AI. We are trusted by global enterprises like Mastercard, Workday, and General Motors, and backed by prominent investors such as GIC and Insight Partners. Our mission is to help data teams do their life’s best work by providing the infrastructure for businesses to become AI-forward, ensuring AI operates on trusted, governed context.
- • In this role, you will have the opportunity to lead major, cross-cutting infrastructure initiatives, significantly impacting the reliability, cost-efficiency, and developer experience of our platform. You will gain deep expertise in operating complex Kubernetes and GitOps platforms at scale, and develop advanced skills in transforming manual processes into automated, resilient workflows. You will also hone your ability to leverage observability and cost signals to drive strategic improvements and gain invaluable experience in owning customer-facing infrastructure and incidents end-to-end, acting as a technical multiplier for the entire team.
Skills & Technologies
See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.
About Atlan Data Technologies Private Limited
Atlan Data Technologies provides a cloud-native data collaboration workspace that unifies data catalog, lineage, quality, and governance for analytics and AI teams. The platform integrates with modern data stacks, enabling data discovery, context sharing, and automated quality checks across warehouses, lakes, and BI tools. It serves enterprises aiming to democratize data while ensuring security and compliance.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

DoiT International
3 months ago

Ddome Inc.
3 months ago

Stedi, Inc.
4 months ago

DoiT International
3 months ago