
Job Overview
Location
San Francisco
Job Type
Full-time
Category
Software Engineering
Date Posted
April 28, 2026
Full Job Description
📋 Description
- • Lead Member of Technical Staff role focused on designing and operating high-performance, scalable, and reliable machine learning infrastructure for Cohere's AI platform serving large language models via API endpoints.
- • Provide technical leadership across multiple teams, driving architecture and strategy for deploying optimized NLP models in low latency, high throughput, and high availability environments; serve as key customer contact for customized deployments and mentor engineers to raise technical standards.
- • Cohere is a team of researchers, engineers, designers, and more passionate about scaling intelligence to serve humanity through frontier models for developers and enterprises building AI systems for content generation, semantic search, RAG, and agents.
- • Opportunity to shape the next generation of AI platforms, lead complex infrastructure initiatives, mentor engineers, and drive technical excellence in distributed systems and accelerator utilization at scale.
Skills & Technologies
About Cohere Inc.
Cohere provides large language models and retrieval-augmented generation APIs for enterprise developers to embed conversational AI, search, summarization, and content generation into applications. Founded in 2021 by former Google Brain researchers, the company offers cloud and on-premise deployment, fine-tuning tools, and multilingual support to help organizations automate workflows, improve customer support, and analyze unstructured data while maintaining data privacy and security controls.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Fair Isaac Corporation
1 month ago


