GoDaddy Inc. logo

Senior Site Reliability Engineer

Job Overview

Location

India

Job Type

Full-time

Category

Software Engineering

Date Posted

March 5, 2026

Full Job Description

đź“‹ Description

  • • As a Senior Site Reliability Engineer at GoDaddy, you will be at the forefront of ensuring the unwavering availability, optimal performance, and robust operational integrity of GoDaddy's critical security platforms. This pivotal role involves taking ownership of the reliability outcomes for sophisticated security infrastructure, including Intrusion Detection/Prevention Systems (IDS/IPS), Distributed Denial of Service (DDoS) mitigation services, and other vital security technologies that safeguard GoDaddy's extensive global footprint.
  • • Your primary responsibility will be to meticulously define Service Level Indicators (SLIs) and Service Level Objectives (SLOs), establishing clear error budgets that guide our reliability efforts. You will be instrumental in building actionable alerting mechanisms, comprehensive dashboards, and detailed runbooks to provide real-time insights and facilitate swift responses to any potential issues.
  • • You will architect, implement, and maintain high availability solutions, conduct thorough capacity planning, and develop robust disaster recovery strategies for the core security platforms. This includes ensuring the resilience of IDS/IPS, DDoS mitigation systems, and all supporting services, guaranteeing continuous protection against threats.
  • • A key aspect of this role involves designing and executing zero or minimal-downtime maintenance and upgrade strategies. This encompasses seamless OS updates, firmware patches, and signature deployments across the security infrastructure, minimizing any impact on service availability.
  • • You will leverage your expertise in automation to streamline deployments, manage configurations, and ensure compliance using tools like SaltStack and Python. This focus on automation is crucial for reducing manual effort and increasing operational efficiency.
  • • You will operate and continuously improve a complex and heterogeneous technology stack. This includes hands-on experience with industry-leading platforms such as TrendMicro TippingPoint IPS, Suricata, NetScout/Arbor Sightline/TMS, HAProxy, Nginx, Juniper, Palo Alto, and Kentik/KProxy, ensuring their optimal performance and reliability.
  • • You will play a critical role in building and evolving our observability capabilities. This involves working with Icinga for alerting, Grafana for dashboarding, InfluxDB for metrics collection, and rsyslog for log pipelines. A significant focus will be on driving SLO-based alerting and actively working to reduce alert noise, ensuring that our teams are alerted to meaningful events.
  • • You will lead incident response efforts as part of a 24/7 on-call rotation. In this capacity, you will act as the incident commander, driving rapid mitigation strategies, and conducting blameless postmortems to identify root causes and implement durable fixes, thereby preventing recurrence.
  • • A core tenet of this role is the reduction of toil through the development of self-service tooling, APIs, and automated health checks. You will champion reliability reviews and actively participate in game days and chaos testing to proactively identify and address potential weaknesses in our systems.
  • • You will ensure that all operations are audit-ready and aligned with stringent industry standards such as WebTrust and PCI-DSS. This includes upholding rigorous change management processes, maintaining configuration baselines, and enforcing strict access controls to protect sensitive systems.
  • • Collaboration is key. You will work closely with cross-functional teams, including Network Engineering, Security Architecture, Hosting, and Product teams, to ensure seamless integration and operation of security platforms. You will also mentor and provide technical guidance to a team of 2-3 contractors, fostering their growth and ensuring high-quality output.
  • • Maintaining high-quality operational documentation, Standard Operating Procedures (SOPs), and architectural diagrams is essential for knowledge sharing and operational consistency.
  • • This role offers the opportunity to work remotely from India, with occasional visits to a GoDaddy office for team events or meetings, providing flexibility while maintaining strong team collaboration.

Skills & Technologies

Python
Git
Linux
Grafana
Senior
Remote
Degree Required

Ready to Apply?

You will be redirected to an external site to apply.

GoDaddy Inc. logo
GoDaddy Inc.
Visit Website

About GoDaddy Inc.

GoDaddy Inc. is a publicly traded internet domain registrar and web hosting company headquartered in Tempe, Arizona. Established in 1997, it provides domain name registration, website hosting, SSL certificates, website building tools, email hosting, and cloud-based products to individuals and small-to-medium businesses worldwide. The company operates a large network of data centers and supports over 84 million domains for approximately 21 million customers. GoDaddy generates revenue through subscription-based services, domain renewals, and add-on products, and serves markets in North America, Europe, Asia-Pacific, Latin America, and the Middle East.

Similar Opportunities

❌ EXPIRED
Scale to Win LLC logo

Scale to Win LLC

Remote
Full-time
Expired Jan 22, 2026
Senior
Remote

3 months ago

Apply
Remote - USA
Full-time
Expires May 2, 2026
Senior
Remote

4 days ago

Apply
Dandy Technology, Inc. logo

Dandy Technology, Inc.

USA - Remote
Full-time
Expires May 3, 2026
REST
Remote

3 days ago

Apply
Remote - Canada
Full-time
Expires May 2, 2026
Go
MongoDB
Redis
+3 more

4 days ago

Apply