
Job Overview
Location
Remote
Job Type
Full-time
Category
Data Engineer
Date Posted
February 22, 2026
Full Job Description
đź“‹ Description
- • Join a pioneering AI-driven technology company at the forefront of building innovative forecasting and attribution intelligence products. This is a unique opportunity to contribute to the core of their offerings by developing and maintaining robust, scalable data pipelines that are essential for high-quality, analytics-ready data. As a Data Engineer, you will play a pivotal role in ensuring the reliability and efficiency of the data infrastructure that powers advanced analytics, predictive forecasting, and sophisticated AI use cases.
- • This is a hands-on, execution-focused contract role where your primary mission will be to build and maintain the foundational data pipelines. You will be instrumental in ensuring data quality, guaranteeing pipeline reliability, and fostering seamless collaboration across data engineering, analytics, data science, and product teams. Your work will directly impact customer onboarding, reporting workflows, and the development of cutting-edge AI applications within a dynamic and fast-paced environment.
- • You will be working within a modern analytics engineering stack, leveraging powerful tools such as Python for scripting and data manipulation, dbt (Data Build Tool) for transforming data in your warehouse, and Dagster for orchestrating complex data workflows. This stack is designed for efficiency, maintainability, and scalability, allowing you to build sophisticated data solutions.
- • A significant aspect of your role will involve the development and orchestration of data pipelines. This includes building scalable, fault-tolerant ELT (Extract, Load, Transform) pipelines using Python, ensuring that data can be efficiently moved and prepared for analysis. You will also be responsible for orchestrating and monitoring these data workflows using Dagster, a modern data orchestrator that provides robust scheduling, dependency management, and execution capabilities.
- • Troubleshooting pipeline failures, performance bottlenecks, and data inconsistencies will be a critical part of your day-to-day responsibilities. You will need to proactively monitor pipeline health using observability tools and key metrics, identifying and resolving issues swiftly to maintain data flow and integrity.
- • Beyond pipeline development, you will engage in analytics engineering and data modeling. This involves developing, optimizing, and meticulously documenting dbt models. These models are crucial for transforming raw data into clean, analytics-ready datasets that can be readily consumed by business intelligence tools, forecasting algorithms, and machine learning models. You will contribute to the continuous improvement of existing data workflows, adapting them to evolving product requirements and business needs.
- • A strong emphasis will be placed on data quality and reliability. You will implement and maintain comprehensive data quality checks and testing strategies, ensuring that the data is accurate, complete, and trustworthy. Adhering to established team standards for Service Level Agreements (SLAs), code quality, and deployment processes is paramount to maintaining a high standard of operational excellence.
- • Cross-functional collaboration is key to success in this role. You will work closely with data scientists to support their forecasting and AI-driven use cases, providing them with the data they need to build and refine their models. Furthermore, you will collaborate with analytics and product teams to ensure that the data infrastructure effectively meets all business and product requirements, bridging the gap between data capabilities and business objectives.
- • This contract role offers a compelling opportunity to make a direct impact on AI-driven products. You will be building high-impact data infrastructure that is fundamental to forecasting, attribution, and reporting functionalities. By working within a cutting-edge analytics engineering stack and collaborating closely with talented technical teams, you will gain invaluable hands-on experience with real-world AI and analytics use cases in a fast-paced, product-driven environment. This role is ideal for a self-starter who thrives on delivering results in a contract setting and is eager to contribute to the success of a forward-thinking technology company.
- • The role requires availability during U.S. business hours, specifically 9 AM - 5 PM EST, to align with client time zones. This ensures seamless communication and collaboration with U.S.-based clients and teams. Candidates are sought from LATAM, Africa, and Eastern Europe, bringing diverse perspectives and talent to the organization.
Skills & Technologies
Python
Pandas
Remote
About Scale AI
Scale AI provides data infrastructure for artificial intelligence, offering data labeling, model evaluation, and generative AI application development tools for enterprises and government agencies. Founded in 2016 and headquartered in San Francisco, the company supplies high-quality training datasets to automotive, defense, e-commerce, and technology customers, enabling deployment of accurate machine-learning models at scale.



