Speech Research Intern-2

Centific Global Technologies Pte. Ltd.

Job Overview

Location

Remote Work( USA)

Job Type

Full-time

Full Job Description

📋 Description

• Design and evaluate speech-first models, with a focus on Spoken Language Models (SLMs) that reason over audio and engage in conversational interactions.
• Develop end-to-end speech dialogue systems that process speech input and generate speech output.
• Align speech encoders with text backbones using lightweight adapters to enable multimodal reasoning.
• Design efficient speech tokenization and temporal compression techniques optimized for long-form audio processing.
• Build and execute reliable evaluation frameworks covering speech recognition, understanding, and generation tasks, including robustness and safety metrics.
• Optimize inference pipelines for low-latency, streaming, and real-time user experiences.
• Prototype conversational SLMs using self-supervised learning (SSL) speech encoders and compact adapters on existing LLMs, comparing against established baselines.
• Create data recipes that combine conversational speech datasets with instruction-following corpora, conducting targeted ablations and documenting findings.
• Construct an evaluation harness that measures performance across ASR, ST, SLU, and speech QA tasks, including streaming-specific metrics like latency, stability, and endpointing.
• Ship minimal production-ready demos with streaming inference and logging capabilities, documenting setup procedures, metrics, and reliability checks.
• Author clear internal technical write-ups outlining project goals, design decisions, results, and actionable next steps for productionization.
• Work closely with applied scientists and engineers to translate research prototypes into practical, scalable AI solutions.
• Utilize PyTorch, CUDA, torchaudio, and librosa for model development and training on GPU infrastructure.
• Implement experiment tracking using tools such as Weights & Biases to ensure reproducibility and iterative improvement.
• Integrate LLM backbones with lightweight adapters and apply neural audio codecs or vocoders as needed for speech generation.
• Deploy efficient inference systems using FastAPI/gRPC, ONNX, TensorRT, and quantization techniques.
• Operate in a remote-first environment with flexible scheduling, aligned to team collaboration needs.
• Contribute to a research culture focused on fast, responsible experimentation with access to cutting-edge AI infrastructure.
• Engage in mentorship opportunities with senior scientists and engineers, with potential pathways to publish findings or present at internal/external forums.

Skills & Technologies

Python

FastAPI

gRPC

PyTorch

SSL

Junior

Remote

Degree Required

Ready to Apply?

Apply Externally

You will be redirected to an external site to apply.

AI Job Fit Analysis

Pro

See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.

Centific Global Technologies Pte. Ltd.

Visit Website

About Centific Global Technologies Pte. Ltd.

Centific is a data-centric AI services company providing data collection, annotation, and model validation solutions to enterprises and technology vendors. It operates a global crowd platform that combines human intelligence with automation to prepare, curate, and test datasets for computer vision, NLP, and generative AI applications. The company supports full AI lifecycle needs, from training data to reinforcement learning and model safety, serving industries including retail, automotive, healthcare, and technology. Headquartered in Singapore, Centific maintains delivery centers across Asia, Europe, and North America.

View Company Profile

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.