Full Job Description
đź“‹ Description
• Architect, standardize and govern infrastructure-as-code across JioHotstar’s multi-cloud, multi-region footprint (AWS, GCP, on-prem) so that every service—from live-streaming micro-services to ad-serving pipelines—deploys with the same reliability, security and observability guarantees used during the IPL finals or a global movie premiere.
• Design and maintain reusable Terraform modules, Helm charts and Python libraries that abstract the complexity of VPCs, IAM, EKS/GKE, storage classes, ingress controllers and autoscaling policies, enabling 200+ engineering squads to ship features without ever touching raw cloud APIs.
• Own capacity planning and cost-optimization end-to-end: build predictive models that ingest real-time traffic, CDN logs and business calendars to pre-warm clusters before marquee events, then automatically scale back down to save millions in cloud spend.
• Champion a culture of reliability by defining SLOs, error budgets and chaos-engineering playbooks; run game-days that simulate region loss, AZ failure or sudden 10× traffic spikes and ensure the platform recovers in < 90 seconds without customer impact.
• Evaluate bleeding-edge platform technologies—service meshes, eBPF-based observability, confidential computing—and run controlled pilots that prove ROI before rolling them out org-wide.
• Act as the final escalation point for critical incidents, performing deep Linux-level debugging (tcpdump, eBPF, strace, cgroups) and post-mortems that turn outages into systemic fixes and runbooks.
• Partner with security, compliance and finance teams to codify guardrails (OPA policies, SCPs, budget alerts) that prevent misconfigurations while still giving developers the autonomy to move fast.
• Mentor senior and staff engineers, conduct architecture reviews and run internal guilds that raise the bar for code quality, documentation and operational excellence across the company.
• Influence the long-term technical roadmap by presenting data-driven proposals to the CTO and VPs, ensuring infrastructure investments align with JioStar’s ambition to reach 1 billion weekly viewers.
🎯 Requirements
• 4-8 years building and operating large-scale, multi-account cloud infrastructure on AWS and GCP with deep expertise in Kubernetes, Terraform and Python.
• Proven track record of designing highly-available, low-latency systems that serve > 1 million concurrent users or 10 Gbps+ sustained traffic.
• Strong Linux internals and networking fundamentals—TCP/IP, HTTP/2, gRPC, load-balancing, iptables, kernel tuning—and the ability to debug live production issues under pressure.
• Experience codifying security, compliance and cost controls into CI/CD pipelines and infrastructure modules.
• Excellent written and verbal communication skills; able to author crisp design docs and present to both engineers and executives.
• B.Tech/B.E or Masters in Computer Science or related field from a reputed institution.
🏖️ Benefits
• Competitive compensation plus annual performance bonus and equity in one of the world’s fastest-growing media-tech companies.
• Premium health insurance for you and your dependents, including mental-wellness and tele-medicine programs.
• Flexible leave policy, quarterly recharge weeks and the option to work from anywhere for up to 30 days a year.
• Annual learning stipend, global conference passes and dedicated time for open-source contributions.
Skills & Technologies
Python
AWS
GCP
Docker
Kubernetes
Senior
Onsite
Degree Required