
Job Overview
Location
Remote
Job Type
Full-time
Category
DevOps & SysAdmin
Date Posted
February 17, 2026
Full Job Description
đź“‹ Description
- • As a DevOps Team Lead at Fundamental, you will be at the forefront of shaping and scaling the infrastructure that powers our groundbreaking AI company. You will lead a talented team of DevOps engineers, fostering an environment of technical excellence, continuous learning, and collaborative problem-solving. Your leadership will be instrumental in defining and executing our infrastructure roadmap, ensuring it aligns seamlessly with Fundamental's ambitious objectives and the rapid evolution of our AI technologies.
- • You will architect, implement, and oversee the design of our cloud infrastructure, with a primary focus on AWS, GCP, and Azure. This includes ensuring our systems are robust, scalable, and secure, capable of supporting the demanding workloads of large language model training and enterprise-grade deployment. A key aspect of this role involves establishing and enforcing best practices, standards, and streamlined processes for all aspects of infrastructure development and operations. This ensures consistency, efficiency, and maintainability across our entire technology stack.
- • Collaboration is central to this role. You will work closely with our Engineering, Research, and Field Deployment Engineering (FDE) teams to ensure our infrastructure capabilities are perfectly aligned with their needs and the broader business objectives. This cross-functional partnership is crucial for delivering cutting-edge AI solutions to our Fortune 100 clients.
- • A significant focus will be placed on the evolution and optimization of our Kubernetes clusters. You will drive initiatives to enhance these clusters for GPU workloads, ensuring efficient resource utilization for model training and inference. Furthermore, you will optimize them for Production SaaS hosting and the diverse deployment models required by our enterprise clients, ensuring high availability and performance.
- • Championing modern DevOps methodologies is essential. You will champion GitOps practices, leveraging tools like ArgoCD to implement robust continuous integration and continuous deployment (CI/CD) pipelines. This ensures rapid, reliable, and automated software delivery.
- • You will also establish and enforce stringent infrastructure as code (IaC) standards, primarily using Terraform. This approach guarantees that our infrastructure is version-controlled, repeatable, and auditable, reducing manual errors and increasing agility.
- • Developing and implementing a comprehensive monitoring and observability strategy for our distributed systems is a critical responsibility. You will ensure we have the tools and processes in place to gain deep insights into system performance, identify potential issues proactively, and maintain optimal operational health.
- • You will collaborate closely with our Machine Learning engineers to fine-tune and optimize the infrastructure specifically for the unique demands of model training and serving. This includes understanding the computational requirements, data pipelines, and deployment patterns specific to AI/ML workloads.
- • Ultimately, you will own the infrastructure's reliability, performance, and security posture. This involves implementing proactive measures, incident response protocols, and continuous improvement initiatives to maintain the highest standards.
- • Implementing and maintaining effective cost optimization strategies (FinOps) for our cloud resources will be a key performance indicator. You will identify opportunities to reduce expenditure without compromising performance or reliability, ensuring efficient use of company resources.
- • This role requires a strategic thinker who can balance immediate operational needs with long-term infrastructure vision. You will be instrumental in building a scalable, resilient, and cost-effective infrastructure that supports Fundamental's rapid growth and its mission to transform enterprise decision-making with AI.
- • You will also be responsible for mentoring and developing your team members, providing guidance on technical challenges, career development, and fostering a culture of innovation and continuous improvement. Your leadership will empower the team to achieve their full potential and contribute significantly to the company's success.
- • The ideal candidate will possess a deep understanding of cloud-native technologies and a passion for building and operating highly available, scalable, and secure systems. You will be a hands-on leader, comfortable diving into technical details while also maintaining a strategic overview of the infrastructure landscape.
- • You will play a pivotal role in ensuring our infrastructure can support the deployment of our proprietary Large Tabular Model (LTM), NEXUS, to Fortune 100 companies, meeting their stringent security, performance, and compliance requirements.
- • This is an opportunity to join a category-defining company at a pivotal stage, working with world-class talent and building technology that has a tangible impact on how businesses operate globally. You will be part of a mission-driven team that values ownership, bias toward action, and a deep commitment to pushing the boundaries of AI.
🎯 Requirements
- • 7+ years of experience in cloud infrastructure and DevOps, with a minimum of 3 years in a technical leadership role, demonstrating a proven track record of building and leading high-performing infrastructure teams.
- • Deep expertise in Kubernetes, including multi-cluster management, GPU workload optimization, resource scheduling and autoscaling, and network policies and security.
- • Extensive experience with cloud networking, encompassing VPC design, load balancer configuration, network security and segmentation, and cross-cloud networking solutions.
- • Strong proficiency in Infrastructure as Code (IaC) using Terraform and GitOps practices with tools like ArgoCD.
- • Proficiency in at least one major cloud provider (AWS, GCP, Azure) and experience with multiple is highly desirable.
🏖️ Benefits
- • Competitive compensation package including salary and equity.
- • Comprehensive health coverage: medical, dental, vision, and 401K.
- • Generous paid parental leave for all new parents, including adoptive and surrogate journeys, and fertility support.
- • Relocation assistance for those joining the team at one of our office locations.
- • A mission-driven, low-ego culture that values diversity of thought, ownership, and a bias toward action.
Skills & Technologies
About Fundamental
Fundamental is a company focused on providing innovative solutions and services. They aim to empower businesses by leveraging cutting-edge technology and expert insights. Their offerings span various sectors, addressing complex challenges with tailored approaches. The company is committed to driving growth and efficiency for its clients through a blend of strategic planning and practical execution. With a strong emphasis on research and development, Fundamental continuously seeks to advance its capabilities and deliver value-added services. Their client-centric model ensures that solutions are aligned with specific business objectives, fostering long-term partnerships and mutual success. Fundamental strives to be a reliable partner in navigating the evolving business landscape.
Similar Opportunities

Saronic Technologies Inc.
5 months ago


