Cohere Inc. logo

Lead Member of Technical Staff, Inference Infrastructure

Job Overview

Location

San Francisco

Job Type

Full-time

Category

Software Engineering

Date Posted

April 28, 2026

Full Job Description

📋 Description

  • Lead Member of Technical Staff role focused on designing and operating high-performance, scalable, and reliable machine learning infrastructure for Cohere's AI platform serving large language models via API endpoints.
  • Provide technical leadership across multiple teams, driving architecture and strategy for deploying optimized NLP models in low latency, high throughput, and high availability environments; serve as key customer contact for customized deployments and mentor engineers to raise technical standards.
  • Cohere is a team of researchers, engineers, designers, and more passionate about scaling intelligence to serve humanity through frontier models for developers and enterprises building AI systems for content generation, semantic search, RAG, and agents.
  • Opportunity to shape the next generation of AI platforms, lead complex infrastructure initiatives, mentor engineers, and drive technical excellence in distributed systems and accelerator utilization at scale.

Skills & Technologies

Go
AWS
Azure
GCP
Kubernetes
DevOps
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Cohere Inc. logo
Cohere Inc.
Visit Website

About Cohere Inc.

Cohere provides large language models and retrieval-augmented generation APIs for enterprise developers to embed conversational AI, search, summarization, and content generation into applications. Founded in 2021 by former Google Brain researchers, the company offers cloud and on-premise deployment, fine-tuning tools, and multilingual support to help organizations automate workflows, improve customer support, and analyze unstructured data while maintaining data privacy and security controls.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Remote - US
Full-time
Expires Jul 7, 2026
Python
AWS
Azure
+4 more

29 days ago

Apply
Fermi AI, Inc. logo

Fermi AI, Inc.

Bangalore, India
Full-time
Expires Jul 24, 2026
Onsite
Degree Required

13 days ago

Apply
Expired
London (GB)
Full-time
Expired Jun 3, 2026
Junior
Onsite

2 months ago

Apply
Work from Home, United States
Full-time
Expires Jun 24, 2026
Java
Spring
AWS
+4 more

1 month ago

Apply