
Job Overview
Location
Melbourne
Job Type
Full-time
Category
Product Management
Date Posted
June 3, 2026
Full Job Description
đź“‹ Description
- • Design, architect, and maintain scalable, resilient infrastructure that supports Kraken’s AI-driven energy management platform serving millions of customers globally.
- • Ensure high availability, performance, and scalability of core platform products by implementing robust monitoring, alerting, and incident response systems.
- • Collaborate with product and engineering teams to identify reliability bottlenecks and implement proactive solutions that prevent system degradation and outages.
- • Develop and enforce best practices for infrastructure as code, automated testing, and deployment pipelines to enhance platform stability and reduce manual toil.
- • Optimize resource utilization across cloud and on-premises environments to support renewable energy optimization, smart grid operations, and customer-facing systems including CIS, billing, meter data management, and CRM.
- • Lead root cause analysis of production incidents and drive implementation of permanent fixes to improve mean time to recovery (MTTR) and mean time between failures (MTBF).
- • Integrate observability tools (logging, tracing, metrics) across the platform to provide actionable insights into system health and customer experience impact.
- • Partner with DevOps and SRE teams to evolve platform reliability standards and ensure alignment with industry benchmarks for uptime and performance.
- • Contribute to capacity planning and forecasting to anticipate growth in customer demand and ensure infrastructure scales efficiently with minimal latency or disruption.
- • Maintain and improve disaster recovery and failover mechanisms to guarantee business continuity during regional outages or system failures.
- • Document system architectures, runbooks, and operational procedures to enable knowledge sharing and reduce dependency on individual team members.
- • Stay current with emerging technologies in cloud infrastructure, automation, and reliability engineering to continuously elevate the platform’s resilience and efficiency.
- • Advocate for reliability as a shared responsibility across all engineering teams, promoting a culture of ownership and proactive system improvement.
- • Support the development of AI-driven communications and customer experience systems by ensuring underlying infrastructure meets strict performance and compliance requirements.
- • Work within a globally distributed team to align reliability strategies across time zones and regional infrastructure constraints while maintaining consistent service levels.
- • Participate in on-call rotations to respond to critical incidents and ensure rapid resolution of platform-wide issues impacting customer-facing services.
- • Translate business goals around sustainable energy delivery into technical reliability requirements that prioritize efficiency, scalability, and environmental impact.
🎯 Requirements
- • Proven experience as a Platform Engineer, Site Reliability Engineer, or similar role in a high-availability, cloud-native environment
- • Expertise in designing and managing scalable infrastructure using cloud platforms (AWS, Azure, or GCP)
- • Strong proficiency in infrastructure as code tools (Terraform, Ansible, or similar)
- • Experience with monitoring and observability tools (Prometheus, Grafana, Datadog, ELK stack, etc.)
- • Solid understanding of CI/CD pipelines and automated deployment practices
- • Excellent communication skills with the ability to collaborate across engineering, product, and operations teams
🏖️ Benefits
- • Opportunity to work on a platform transforming global energy systems toward sustainability
- • Flexible work arrangements with options for remote work within Australia
- • Collaborative, impact-driven culture focused on innovation in renewable energy technology
- • Professional development support for advancing skills in reliability engineering and cloud architecture
Skills & Technologies
About Kraken Technologies Limited
Kraken builds the Kraken platform — a cloud-native, AI-powered operating system for utilities and energy companies that automates the energy supply chain (customer lifecycle, billing/CRM, trading and asset optimisation, migration from legacy systems) to enable faster product innovation, lower operating costs, and support distributed/renewable energy use.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Real Chemistry LLC
2 months ago

Wealth Enhancement Group, LLC
7 hours ago

Synchrony Financial
7 hours ago
