Innodata Inc. logo

Innodata Sr Language Data Scientist Search Specialization

Job Overview

Location

Remote job

Job Type

Full-time

Category

Data Scientist

Date Posted

February 23, 2026

Full Job Description

đź“‹ Description

  • • As a Senior Language Data Scientist specializing in Search, you will be at the forefront of innovation, driving the advancement of search and information retrieval applications powered by cutting-edge Generative AI (GenAI) technologies. Innodata, a leading data engineering company with a global presence and a strong track record of serving major technology firms, is building a specialized team to tackle complex search challenges. This role offers a unique opportunity to work hands-on with diverse, multi-modal, and multilingual search-specific datasets, including queries, documents, and relevance judgments, collaborating closely with search engineers, product teams, and directly with clients.
  • • Your core mission will involve leveraging your deep expertise in query understanding, semantic matching, and ranking systems to enhance search relevance and user experience. You will be instrumental in developing and refining human and synthetic data workflows, crucial for training and evaluating advanced AI models. This includes a strong focus on understanding and addressing the nuances of search data, such as inter-rater disagreement on relevance, context-dependent query interpretation, and the temporal and geographic aspects of information needs.
  • • You will lead long-term, high-complexity projects from inception to completion, setting the strategic plan and driving them towards success. This involves not only technical execution but also strategic thinking and process design excellence. Your ability to navigate ambiguity, utilize your technical knowledge, and effectively manage multiple stakeholders will be key to delivering innovative solutions.
  • • A significant aspect of this role is the design and improvement of workflows for creating, validating, and annotating search-specific data. This encompasses query-document pairs, relevance judgments, query intent classifications, and assessments of search result quality across various domains like web search, e-commerce, and specialized vertical search applications. You will also engage with multimodal search scenarios, such as image and product search.
  • • You will consult directly with customers to understand their business objectives and translate them into actionable data strategies and processes. Generating actionable insights from client processes and products will be vital for driving continuous improvement and innovation. Furthermore, you will advise and support business unit heads on client engagement, ensuring a clear understanding of the upstream activities that leverage Innodata's services.
  • • Key responsibilities include designing and refining search data annotation frameworks. This involves creating detailed relevance judging guidelines that can effectively handle subtle query-document relationships, the inherent ambiguity of user queries, and domain-specific search challenges like the need for freshness in news search or understanding user intent in product search.
  • • You will dive deep into existing workflows and processes, gathering data and insights to identify areas for improvement. Your recommendations will drive innovation and foster cross-functional collaboration with both internal teams and external clients.
  • • A critical part of the role involves assessing and optimizing search-specific evaluation approaches. This includes designing and implementing A/B testing frameworks, defining and tracking key ranking metrics, and conducting human evaluation studies to ensure the quality and effectiveness of search results.
  • • You will critically assess annotation tooling and workflows, ensuring they are efficient, scalable, and produce high-quality data. This involves quantitative analysis of large datasets, performing statistical analysis, calculating performance metrics, and making data-driven recommendations to enhance accuracy and overall system performance.
  • • Close collaboration with client stakeholders is essential. You will work with them to understand their goals, gather detailed requirements, propose tailored solutions, and oversee their execution. Your work will directly impact the development of next-generation search capabilities.
  • • You are expected to set an ambitious research agenda focused on continuous improvement of our products and services. This includes contributing to the establishment of best practices and standards for generative AI development, both within Innodata and for our clients, ensuring we remain at the forefront of the industry.
  • • This role requires a strong understanding of machine learning, Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), neural ranking architectures, and dense retrieval methods. Your ability to critically assess how GenAI techniques can be applied to improve search relevance, ranking, and user experience will be paramount.

Skills & Technologies

Data Science
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Innodata Inc. logo
Innodata Inc.
Visit Website

About Innodata Inc.

Innodata Inc. is a global digital services company that provides data engineering, data annotation, and content transformation solutions. They specialize in helping businesses leverage artificial intelligence and machine learning by preparing and structuring large datasets for training AI models. Their services cater to various industries, including technology, finance, healthcare, and automotive, enabling clients to improve operational efficiency, enhance customer experiences, and drive innovation through data-driven insights. Innodata's expertise lies in managing complex data challenges and delivering high-quality, scalable solutions.

Similar Opportunities

Remote
Full-time
Expires Mar 12, 2026
Data Science
Senior
Remote

2 months ago

Apply
❌ EXPIRED
Schnuck Markets, Inc. logo

Schnuck Markets, Inc.

Schnucks Store Support Center (Corporate Office)
Full-time
Expired Jan 30, 2026
Python
AWS
Data Science
+3 more

3 months ago

Apply
Remote
Full-time
Expires Mar 13, 2026
Python
TypeScript
FastAPI
+4 more

2 months ago

Apply
Remote
Full-time
Expires Apr 23, 2026
Python
PyTorch
Remote
+1 more

5 days ago

Apply