DraftKings Inc. logo

Lead Site Reliability Engineer

Job Overview

Location

Indiana, USA

Job Type

Full-time

Category

Software Engineering

Date Posted

March 5, 2026

Full Job Description

đź“‹ Description

  • • As a Lead Site Reliability Engineer at DraftKings, you will be at the forefront of ensuring the unwavering reliability, exceptional scalability, and optimal efficiency of our critical infrastructure. This pivotal role involves spearheading key initiatives that directly impact the performance and stability of our rapidly growing platform, which is increasingly powered by cutting-edge AI technologies.
  • • You will collaborate intimately with cross-functional teams, including engineering, product, and operations, to architect and implement robust infrastructure automation solutions. Your expertise will be instrumental in shaping the future of our platform, ensuring it can handle massive user growth and complex workloads.
  • • A significant aspect of this leadership position involves mentoring and guiding fellow engineers. You will foster a culture of continuous learning, knowledge sharing, and innovation, empowering the team to tackle complex challenges and adopt best practices in site reliability engineering.
  • • You will be responsible for architecting and automating self-healing, fault-tolerant infrastructure. This includes leveraging declarative configurations, GitOps principles, and event-driven automation to enable scalable and resilient deployments across both public cloud environments (like GCP and AWS) and on-premise data centers.
  • • The role demands the design, development, and maintenance of sophisticated software-driven infrastructure automation. This involves building internal tools and utilities to streamline operations, eliminate repetitive manual tasks, and enhance overall engineering productivity.
  • • You will take ownership of critical decisions related to product deployment strategies, performance tuning of systems, comprehensive monitoring, and proactive alerting. The ultimate goal is to guarantee the highest levels of availability and system efficiency in our production environments.
  • • A key responsibility will be defining and tracking key performance metrics and Service Level Agreements (SLAs) for new web services. This is crucial for supporting our rapid traffic growth and ensuring a seamless user experience.
  • • You will design and implement advanced monitoring and alerting strategies. These systems will be critical for enforcing application SLAs, detecting potential issues before they impact users, and enabling rapid response to incidents.
  • • This role offers a unique opportunity to influence the technical direction of our infrastructure and play a vital part in shaping the future of DraftKings' technology stack, especially as AI becomes more integrated into our operations and product offerings.
  • • You will be expected to stay abreast of the latest trends and technologies in SRE, cloud computing, and automation, bringing that knowledge to bear on our challenges.
  • • The Lead SRE will contribute to the strategic planning of infrastructure roadmaps, ensuring alignment with business objectives and technological advancements.
  • • You will be a champion for reliability and operational excellence throughout the engineering organization, promoting a proactive approach to system health.
  • • This position requires a deep understanding of distributed systems and the ability to troubleshoot complex issues across the entire stack, from the network layer to application performance.
  • • You will work with infrastructure that supports a dynamic and fast-paced environment, requiring adaptability and a problem-solving mindset.
  • • The impact of your work will be directly visible in the performance and availability of DraftKings' services, contributing to a positive customer experience and the company's overall success.
  • • You will be involved in capacity planning and performance analysis to ensure our infrastructure scales effectively with business demands.
  • • The role involves collaborating with development teams to embed reliability best practices early in the software development lifecycle.
  • • You will contribute to incident management processes, including post-mortems and the implementation of preventative measures.
  • • This is a hands-on leadership role where you will not only guide strategy but also contribute technically to the solutions you help implement.
  • • You will be instrumental in ensuring the security and compliance of our infrastructure, working closely with relevant teams.
  • • The opportunity to work with a leading technology company in the rapidly evolving sports and gaming industry, with a strong focus on innovation and AI, makes this a compelling career move.

Skills & Technologies

Python
Java
Ruby
AWS
GCP
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

DraftKings Inc. logo
DraftKings Inc.
Visit Website

About DraftKings Inc.

DraftKings Inc. operates regulated digital sports entertainment and gaming products in North America. Founded in 2012, it offers daily fantasy sports, sports betting, iGaming, and media content through web and mobile platforms. The company holds gaming licenses in multiple U.S. states and provinces, providing real-money wagering on professional and collegiate sports, online casino games, and live-dealer tables. Revenue is generated via entry fees, rake, and sportsbook hold. DraftKings has partnered with major leagues and teams, and it trades on the Nasdaq under DKNG.

Similar Opportunities

Indiana, USA
Full-time
Expires Apr 13, 2026
Python
JavaScript
AWS
+3 more

1 month ago

Apply
Indiana, USA
Full-time
Expires Apr 13, 2026
Python
JavaScript
AWS
+3 more

1 month ago

Apply
SHI International Corp. logo

SHI International Corp.

Indiana, USA
Full-time
Expires Apr 29, 2026
AWS
Azure
Remote
+2 more

14 days ago

Apply
Indiana, USA
Full-time
Expires Apr 13, 2026
Remote

1 month ago

Apply