Apiphany Inc. logo

Associate Data Scientist

Job Overview

Location

Remote

Job Type

Full-time

Category

Data Scientist

Date Posted

February 26, 2026

Full Job Description

đź“‹ Description

  • • Apiphany Inc. is at the forefront of innovation, leveraging cutting-edge AI and Machine Learning to redefine industry standards. We are seeking a motivated and detail-oriented Associate Data Scientist to join our dynamic team and play a pivotal role in shaping the future of our LLM-driven systems. This is a unique opportunity to contribute to real-world data processing, pipeline development, and model evaluation, directly impacting the performance and efficacy of our advanced AI solutions. As an Associate Data Scientist, you will be instrumental in transforming raw data into high-quality, structured datasets that fuel our sophisticated machine learning models.
  • • Your primary responsibility will be the meticulous processing and cleaning of both structured and unstructured data. This involves identifying and rectifying inconsistencies, handling missing values, and transforming disparate data sources into a cohesive format suitable for AI/ML pipelines. You will be deeply involved in preparing training-ready datasets, a critical step for the successful fine-tuning and rigorous evaluation of Large Language Models (LLMs). This hands-on work ensures that our models learn from the most accurate and relevant information, leading to superior performance and reliability.
  • • A significant aspect of your role will be supporting our Retrieval Augmented Generation (RAG) and Natural Language to SQL (NL SQL) systems. This involves not only preparing the data that these systems consume but also rigorously validating its quality and consistency. You will ensure that the data fed into these systems is accurate, up-to-date, and formatted correctly, which is crucial for their effective operation and for generating reliable outputs. Your efforts will directly contribute to the intelligence and responsiveness of these advanced AI components.
  • • Data quality is paramount in AI/ML development, and you will be responsible for performing comprehensive data quality checks. This includes verifying data completeness, ensuring consistency across various datasets, and identifying any anomalies or errors that could negatively impact model training or performance. Establishing and adhering to strict data quality protocols will be a key part of your contribution.
  • • You will also assist in the development and maintenance of robust data pipelines and APIs. This may involve working with tools and frameworks like FastAPI to create efficient data ingestion and retrieval mechanisms. By contributing to the infrastructure that supports our data workflows, you will help ensure seamless data flow and accessibility for the wider engineering team.
  • • Collaboration is at the heart of our success. You will work closely with experienced data scientists and machine learning engineers to troubleshoot complex data-related issues, optimize existing data workflows, and implement new data processing strategies. Your ability to communicate technical findings clearly and work effectively within a team will be essential.
  • • This role offers a fantastic learning environment where you can deepen your understanding of data science principles and gain practical experience with state-of-the-art AI technologies. You will have the opportunity to learn about prompt engineering, model evaluation techniques, and the intricacies of deploying LLMs in production environments. Your contributions will be visible and valued, as you help build the data foundations for next-generation AI applications.
  • • We are looking for individuals who are passionate about data and its potential to drive innovation. If you are eager to roll up your sleeves, tackle challenging data problems, and contribute to a team that is pushing the boundaries of AI, this role is an excellent fit for you. You will be an integral part of a forward-thinking company that values continuous learning and professional growth.

🎯 Requirements

  • • 1-3 years of experience in data processing, data analysis, or other data-focused roles.
  • • Strong proficiency in Python, including extensive experience with data manipulation and analysis libraries such as Pandas, NumPy, and Scikit-learn.
  • • Demonstrated experience supporting LLM workflows, which may include fine-tuning, prompt engineering, or model evaluation.
  • • Familiarity with both structured data (e.g., SQL databases) and unstructured text data.
  • • Solid understanding of data preparation principles and best practices for AI/ML systems.
  • • Bachelor's or Master's degree in Computer Science, Data Science, Statistics, or a related quantitative field.

🏖️ Benefits

  • • Competitive salary and performance-based bonuses.
  • • Comprehensive health, dental, and vision insurance.
  • • Generous paid time off (PTO) and company holidays.
  • • Remote work flexibility, allowing you to work from anywhere.
  • • Opportunities for professional development, including training, conferences, and certifications.
  • • Collaborative and innovative work environment with a focus on continuous learning.

Skills & Technologies

Python
FastAPI
Docker
TensorFlow
PyTorch
Data Science
Junior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Apiphany Inc. logo
Apiphany Inc.
Visit Website

About Apiphany Inc.

Apiphany is a cloud-based platform designed to help businesses manage and monetize their APIs. It provides tools for API discovery, design, security, and analytics, enabling companies to create robust API ecosystems. The platform supports various industries, including finance, healthcare, and e-commerce, by facilitating seamless integration and data exchange. Apiphany's solutions aim to accelerate digital transformation and drive innovation through effective API management strategies. They focus on empowering developers and businesses to build, secure, and scale their API offerings efficiently, fostering new revenue streams and improving operational agility.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

London
Full-time
Expires May 14, 2026
Python
Data Science
Senior
+1 more

11 days ago

Apply
Brazil - Sao Paolo
Full-time
Expires Apr 25, 2026
Data Science
Junior
Remote

1 month ago

Apply
SĂŁo Paulo, Brazil
Full-time
Expires Apr 25, 2026
Python
Apache Spark
Onsite
+1 more

1 month ago

Apply
Canada
Full-time
Expires May 9, 2026
Python
Spring
TensorFlow
+4 more

16 days ago

Apply