Knox Systems logo

Level 2 (L2) Cloud Operations Engineer

Job Overview

Location

Indiana, USA

Job Type

Full-time

Category

DevOps & SysAdmin

Date Posted

February 12, 2026

Full Job Description

đź“‹ Description

  • • As a Level 2 (L2) Cloud Operations Engineer at Knox Systems, you will be at the forefront of managing and securing critical federal cloud environments. This role is pivotal in ensuring the stability, reliability, and compliance of the largest managed federal cloud infrastructure, supporting the U.S. government's most vital missions. You will operate within a dynamic 24x7 Network and Cloud Operations Center (NOC), contributing to national security, public safety, and essential public services.
  • • Your primary responsibility will be advanced troubleshooting, system administration, and comprehensive application environment support across Knox's cloud infrastructure. This position acts as a crucial bridge between operations, automation, and development support, ensuring the integrity of systems, executing complex changes, and rigorously adhering to compliance standards within FedRAMP Moderate, High, and IL4 environments.
  • • The ideal candidate possesses hands-on experience operating compliance-controlled cloud environments, ideally within a NOC/SOC setting. You should have a deep understanding of cloud infrastructure services and proven experience in responding to real-time alerts, managing incidents, and handling escalations in production environments. This is a shift-based role, requiring dedication to maintaining continuous operational coverage during assigned shifts, including participation in a rotating on-call schedule for after-hours incidents and holiday support.
  • • Key responsibilities include performing advanced troubleshooting for a wide range of infrastructure, operating system, and application issues. You will meticulously analyze system logs, metrics, and telemetry data from sophisticated monitoring platforms such as Grafana, Datadog, Wiz, and Crowdstrike to diagnose problems and identify root causes.
  • • You will collaborate closely with Platform DevOps Engineers to conduct thorough root cause analysis (RCA) and implement long-term remediation strategies, ensuring timely resolution of escalated incidents in strict accordance with Service Level Agreements (SLAs).
  • • A significant part of your role will involve managing and maintaining AWS, Azure, and hybrid cloud environments, ensuring strict adherence to NIST 800-53 controls and other relevant federal compliance frameworks.
  • • You will execute essential system maintenance tasks, including patching, upgrades, and configuration changes, leveraging automation and scripting where possible to enhance efficiency and reduce manual error.
  • • Performing health checks, validating deployments, and conducting post-change verifications will be critical to ensure the stability and integrity of the production environment.
  • • Maintaining up-to-date infrastructure documentation and accurate system configuration inventories is essential for operational clarity and audit readiness.
  • • You will engage in advanced application troubleshooting for web-based applications and common application architectures, diagnosing and resolving app-layer issues such as API failures, integration errors, or misconfigurations.
  • • Working in tandem with DevOps Platform teams, you will help optimize CI/CD deployment workflows and refine rollback plans to minimize downtime and risk during deployments.
  • • Ensuring strict adherence to change management protocols and deployment authorization processes is paramount to maintaining a secure and compliant operational posture.
  • • You will contribute to the creation or modification of automation scripts using Bash, Python, or PowerShell to streamline maintenance tasks and reporting.
  • • Leveraging infrastructure-as-code tools like Terraform and configuration management tools like Ansible, or cloud-native equivalents, will be key to provisioning resources and ensuring environment consistency.
  • • Proactively identifying opportunities to automate recurring operational processes will be highly valued, driving continuous improvement within the operations team.
  • • Documenting system changes and detailed incident response procedures is crucial for supporting FedRAMP audits and demonstrating compliance.
  • • You will actively support Continuous Monitoring (ConMon) activities by generating vulnerability reports, tracking patch compliance, and assisting in the maintenance of logs, baselines, and access control evidence.
  • • This position is customer-facing, requiring professional and clear communication with clients during incident response. This includes answering support phone calls and participating in customer meetings via Zoom or other collaboration tools, ensuring a high level of customer satisfaction and trust.
  • • By joining Knox Systems, you become part of a mission-driven organization where your expertise directly impacts national security and public services. You will work with cutting-edge technology in a highly secure and compliant environment, solving complex challenges at federal scale. Your contributions will be visible, your expertise will be relied upon, and the impact of your work will be immediate and measurable.

🎯 Requirements

  • • 3-5 years of experience in cloud operations, system administration, or infrastructure support, with a strong emphasis on cloud environments.
  • • Hands-on experience with CrowdStrike Falcon endpoint protection, including analyzing detections, reviewing IOM/IOA telemetry, assessing endpoint vulnerability exposure, and executing or supporting SOAR-based automated response actions.
  • • Proven experience using Grafana or Datadog for operational monitoring and incident response, including building and maintaining dashboards, analyzing time-series metrics, and correlating alerts to identify performance degradation, availability issues, and system failures in production environments.
  • • Proficiency in command-line troubleshooting and strong working knowledge of AWS and/or Azure infrastructure services.
  • • Familiarity with CI/CD pipelines, deployment automation tools, and experience writing and maintaining scripts (Bash, Python, PowerShell).
  • • Familiarity with FedRAMP, NIST 800-53, or similar compliance environments is highly desirable.
  • • U.S. Citizenship is required due to the nature of our work with federal government clients and compliance with applicable regulations.

🏖️ Benefits

  • • Comprehensive Medical, Dental, and Vision insurance plans.
  • • Life and Disability insurance coverage.
  • • Unlimited Paid Time Off (PTO) through a Professional Employer Organization (PEO).
  • • Employee-funded 401k plan for retirement savings.

Skills & Technologies

Python
AWS
Azure
Terraform
Grafana
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Knox Systems logo
Knox Systems
Visit Website

About Knox Systems

Knox Systems is a technology company focused on providing secure and reliable solutions for data management and protection. They specialize in developing advanced software and hardware that ensures the integrity, confidentiality, and availability of critical information for businesses across various sectors. Their offerings often include robust encryption, secure storage, and comprehensive data recovery services. Knox Systems aims to empower organizations to safeguard their digital assets against evolving threats and compliance challenges, enabling them to operate with confidence and maintain business continuity. The company is dedicated to innovation and customer-centric support, striving to deliver peace of mind through superior technology and expertise.

Similar Opportunities

Brisbane, Australia
Full-time
Expires May 12, 2026
Senior
Onsite

2 days ago

Apply
Anduril Industries, Inc. logo

Anduril Industries, Inc.

Sydney, Australia
Full-time
Expires Apr 27, 2026
Python
Rust
AWS
+5 more

17 days ago

Apply
Canada
Full-time
Expires Apr 1, 2026
Apache Spark
DevOps
Senior
+1 more

1 month ago

Apply
Canada
Full-time
Expires Apr 25, 2026
DevOps
Senior
Onsite

19 days ago

Apply