
Job Overview
Location
Florianópolis/ Remote
Job Type
Full-time
Category
DevOps
Date Posted
May 23, 2026
Full Job Description
📋 Description
- • Design, build, and maintain scalable, resilient, and secure infrastructure systems that support Loadsmart’s logistics technology platform.
- • Collaborate with engineering squads across platform engineering to improve system reliability, performance, and observability.
- • Implement and optimize monitoring, alerting, and incident response systems to ensure high availability and rapid recovery from failures.
- • Automate operational tasks using infrastructure-as-code tools to reduce manual toil and increase deployment velocity.
- • Analyze system bottlenecks and failure modes to propose and implement safer, more efficient architectures and processes.
- • Participate in on-call rotations to respond to production incidents, conduct post-mortems, and drive preventive improvements.
- • Work closely with developers to embed SRE practices into the software development lifecycle, including CI/CD pipeline optimization.
- • Ensure compliance with security best practices and infrastructure standards across all environments.
- • Document system architectures, runbooks, and operational procedures to enable knowledge sharing and reduce dependency on individuals.
- • Continuously evaluate emerging tools and technologies to enhance platform stability, scalability, and efficiency.
- • Advocate for reliability as a shared responsibility across engineering teams, fostering a culture of ownership and proactive problem-solving.
- • Contribute to capacity planning and resource forecasting to align infrastructure growth with business objectives.
- • Support the migration and modernization of legacy systems to cloud-native architectures with minimal disruption.
- • Maintain a strong focus on operational excellence by measuring and improving key SRE metrics such as SLIs, SLOs, and error budgets.
- • Engage with cross-functional teams to translate business requirements into technical solutions that enhance system reliability and user experience.
🎯 Requirements
- • Proven experience as a Site Reliability Engineer or similar role in a high-availability production environment
- • Strong proficiency in Linux/Unix systems and shell scripting
- • Hands-on experience with cloud platforms (AWS, GCP, or Azure)
- • Experience with infrastructure-as-code tools such as Terraform or CloudFormation
- • Familiarity with containerization and orchestration technologies (Docker, Kubernetes)
- • Experience with monitoring and observability tools (Prometheus, Grafana, Datadog, or similar)
🏖️ Benefits
- • Fully remote work opportunity within Brazil
- • Competitive salary and performance-based bonuses
- • Health and dental insurance coverage
- • Flexible work hours and paid time off
Skills & Technologies
About Loadsmart Inc.
Loadsmart Inc. provides digital freight brokerage and capacity-matching technology for shippers and carriers in North America. The cloud platform automates pricing, booking, and shipment execution for dry van, refrigerated, flatbed, and drayage freight. Integrations with TMS and ELD systems enable instant quotes, real-time tracking, and predictive analytics to optimize routes, reduce empty miles, and improve service levels. Founded in 2014, the company leverages AI and machine-learning models to balance supply and demand across its network of carriers and shippers.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Web.com Group, Inc.
14 days ago


