
Job Overview
Location
San Francisco, US
Job Type
Full-time
Category
Backend Engineer
Date Posted
March 23, 2026
Full Job Description
đź“‹ Description
- • As a Backend Engineer, AI at Bjak Sdn. Bhd., you will own the inference and orchestration layer that powers every AI interaction in the product, turning model capability into fast, stable, observable APIs used across mobile and desktop clients. Your work directly impacts real-world user experience by ensuring low latency, high correctness, reliability, and cost efficiency in production AI systems.
- • Day to day, you will build and operate production backend systems that serve AI-powered features, design inference pipelines and orchestration layers around LLMs and embeddings, own production concerns including monitoring, logging, alerting, and incident response, optimize latency and throughput through caching, batching, and streaming strategies, debug distributed systems under load, and collaborate with ML and frontend teams to ensure seamless API integration and seamless user experiences.
- • You will join a high-talent-density, hands-on team in San Francisco that values collective decision-making, rapid execution, and balancing high-quality output with continuous learning. The team operates with autonomy and judgment, focused on shipping meaningful AI-driven products that deliver practical benefits to users globally.
- • In this role, you will deepen your expertise in production AI infrastructure, gain hands-on experience with cutting-edge LLM serving patterns (including open-source and proprietary models like OpenAI and Anthropic), master observability and reliability engineering at scale, and contribute to systems that handle real-world AI traffic with measurable impact on performance, cost, and user satisfaction—positioning you at the forefront of applied AI engineering.
🎯 Requirements
- • Strong backend engineering fundamentals with proven experience building and operating high-throughput, low-latency production services
- • Proficiency in Python and/or Node.js, with experience deploying and managing services in Kubernetes and Docker environments
- • Familiarity with AI inference patterns, including working with LLMs (such as OpenAI, Anthropic, or open-source models), embeddings, and multimodal systems
- • Comfort debugging distributed systems under load, with strong skills in monitoring, logging, alerting, and incident response
- • Bias toward shipping, learning from production behavior, and iterating based on real-world usage and performance data
🏖️ Benefits
- • Opportunity to work on cutting-edge AI infrastructure that powers real-user-facing AI features at scale
- • High autonomy and ownership in a small, high-talent-density team where your impact is immediate and measurable
- • Exposure to modern AI/ML tooling including PyTorch, Kubernetes, Docker, and leading LLM providers
- • Collaborative, transparent culture with rapid feedback loops and a focus on learning from production behavior
- • Based in San Francisco with access to a vibrant tech ecosystem and opportunities to engage with industry leaders in AI and infrastructure
Skills & Technologies
About Bjak Sdn. Bhd.
Bjak operates Malaysia’s largest digital auto-insurance marketplace, enabling instant price comparison and online purchase of motor coverage from leading insurers. The platform uses proprietary technology to simplify complex tariffs, deliver personalised quotes and e-policy issuance within minutes, eliminating paperwork and agent visits. Licensed by Bank Negara Malaysia, Bjak also offers road-tax renewal, accident assistance and claims support, serving millions of drivers nationwide while partnering with insurers to increase digital distribution efficiency and customer reach.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Silver.com LLC
1 month ago

