
Job Overview
Location
Israel
Job Type
Full-time
Category
Software Engineering
Date Posted
May 26, 2026
Full Job Description
đź“‹ Description
- • Own the inference and orchestration layer that powers every AI interaction in the product, sitting between large language models and end users across mobile and desktop clients.
- • Build and operate production backend systems that serve AI-powered features with high reliability, low latency, and high throughput under real-world usage conditions.
- • Design and implement inference pipelines, orchestration layers, and service boundaries around LLMs, embeddings, and multimodal models to ensure stable and scalable AI behavior.
- • Optimize latency and throughput across inference, caching, batching, and streaming mechanisms to improve user experience and system efficiency.
- • Own end-to-end production concerns including monitoring, logging, alerting, and incident response for AI-driven services handling continuous, non-deterministic model outputs.
- • Develop and maintain stable, well-documented APIs that enable seamless integration with frontend applications and machine learning systems.
- • Debug and resolve distributed system failures under load, ensuring system resilience despite unpredictable model behavior and external tool dependencies.
- • Implement and iterate on systems that support persistent context, multi-step reasoning, and long-running workflows required for real-world task completion.
- • Collaborate with ML and frontend teams to align system design with product goals, ensuring APIs are clear, consistent, and support evolving AI capabilities.
- • Continuously improve system performance and reliability through iterative, data-driven enhancements based on real usage patterns and production incidents.
- • Operate in a high-talent-density, hands-on team environment where decisions are made collectively, speed is prioritized, and independent execution with judgment is expected.
- • Work with a tech stack including Python, Node.js, PyTorch, OpenAI, Anthropic, open-source LLMs, SQL and NoSQL databases, Kubernetes, and Docker to deliver production-grade AI systems.
- • Ensure systems are observable, maintainable, and capable of scaling to support global user demand while minimizing user impact from failures.
- • Contribute to a product vision focused on bringing practical, reliable AI intelligence to everyday conversations, errands, organization, and workflows.
🎯 Requirements
- • Strong backend engineering fundamentals in production environments
- • Experience running high-throughput, low-latency services
- • Familiarity with AI inference patterns (LLMs, embeddings, multimodal)
- • Comfortable debugging distributed systems under load
- • Bias toward shipping and learning from production behavior
- • Proficiency in Python, Node.js, PyTorch, Kubernetes, Docker, SQL/NoSQL
🏖️ Benefits
- • Opportunity to join a high-talent-density, hands-on team building a globally impactful AI product
- • Collaborative decision-making culture with rapid iteration and learning
- • Transparent and efficient interview process with prompt decision-making
- • Direct impact on bringing practical AI benefits to billions of users
Skills & Technologies
About Bjak Sdn. Bhd.
Bjak operates Malaysia’s largest digital auto-insurance marketplace, enabling instant price comparison and online purchase of motor coverage from leading insurers. The platform uses proprietary technology to simplify complex tariffs, deliver personalised quotes and e-policy issuance within minutes, eliminating paperwork and agent visits. Licensed by Bank Negara Malaysia, Bjak also offers road-tax renewal, accident assistance and claims support, serving millions of drivers nationwide while partnering with insurers to increase digital distribution efficiency and customer reach.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
1 day ago



