This job has expired

This position was posted on February 22, 2026 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Oceansxyz Inc. logo

G2 19 NB D - Data Engineer

Job Overview

Location

Remote

Job Type

Full-time

Category

Data Engineer

Date Posted

February 22, 2026

Full Job Description

đź“‹ Description

  • • As a Data Engineer at Oceansxyz, you will be at the forefront of building and architecting the complete data foundation for an innovative internal LLM-driven analytics platform. This platform is directly integrated with Amazon Seller Central, empowering businesses with advanced insights.
  • • Your core responsibility will be to own the entire data lifecycle, from ingestion and normalization to warehousing, semantic modeling, and ensuring query-ready access. This encompasses a wide array of critical datasets, including commerce, advertising, operational, and financial data, spanning multiple client brands.
  • • This role is a unique blend of deep data engineering expertise, specialized knowledge of Amazon's SP-API, analytics architecture design, and enabling Large Language Models (LLMs) for data analysis. It demands a strong sense of ownership and a holistic systems-thinking approach to deliver a robust, scalable, and reliable Business Intelligence (BI) platform.
  • • The ultimate goal is to support natural-language analytics for Amazon sellers, making complex data accessible and actionable through intuitive interfaces.
  • • Your success will be measured by the scalability, reliability, and auditability of the Amazon data platform you construct. This platform will be the bedrock for deterministic analytics, providing trusted business insights and paving the way for future product expansions.
  • • **Amazon Data Ingestion & Integration:**
  • • Establish and meticulously manage authenticated SP-API connections across a diverse portfolio of client brands.
  • • Ingest data from a comprehensive range of Amazon endpoints, including Orders, Reports, Advertising, Inventory, and Financial data.
  • • Implement robust strategies to handle API throttling, pagination, automatic retries, and strict rate-limit constraints, ensuring uninterrupted data flow.
  • • Develop and deploy efficient incremental data loading processes, manage historical data backfills, and establish reliable failure recovery mechanisms.
  • • Continuously work to expand the coverage and usability of Amazon datasets within the platform.
  • • **Data Pipeline & Warehouse Architecture:**
  • • Design, implement, and maintain scalable ELT/ETL pipelines that transform raw ingested data into analytics-ready schemas.
  • • Set up and manage a centralized data warehouse solution (e.g., Snowflake, BigQuery, Redshift, Postgres, or a comparable technology).
  • • Normalize multi-client and multi-brand datasets, incorporating historical data retention and versioning for comprehensive analysis.
  • • Model core e-commerce domains, including sales performance, advertising metrics, FBA inventory data, and profitability analysis.
  • • Ensure the architecture is designed for long-term maintainability, enhanced observability, and readiness for future scale.
  • • **Semantic Layer & LLM Query Enablement:**
  • • Define and standardize key e-commerce metrics such as sales, net revenue, ad spend, TACoS (Total Advertising Cost of Sale), and their underlying drivers.
  • • Create curated semantic or data mart layers specifically designed for deterministic querying by LLMs.
  • • Implement essential guardrails for filters, date logic, and metric definitions to ensure consistency and accuracy.
  • • Guarantee that query outputs are reproducible, traceable, and directly grounded in the source-of-truth tables.
  • • Collaborate closely with the client's founders on critical LLM model and architecture decisions.
  • • **Data Quality, Monitoring & Reliability:**
  • • Implement comprehensive data validation checks, anomaly detection systems, and establish Service Level Agreements (SLAs) for data freshness.
  • • Maintain robust logging, alerting, and incident-response readiness across all data pipelines.
  • • Ensure the auditability and accuracy of all analytics outputs, building trust in the data.
  • • Proactively identify and resolve reliability risks and operational gaps before they impact users.
  • • **Documentation & Operational Continuity:**
  • • Create detailed documentation for API integrations, data schemas, data refresh cadences, and critical assumptions.
  • • Produce clear and concise handoff materials to facilitate internal continuity and knowledge transfer.
  • • Prepare the architecture for secure multi-tenant capabilities and future evolution into an external-facing product.
  • • Foster a documentation-first mindset and a commitment to long-term ownership and system understanding.
  • • You will work directly with the client's founders, acting as the primary technical owner of the data platform. This will involve close collaboration on architecture, LLM strategy, and overall product readiness.
  • • Within Oceans, you will receive dedicated guidance from your Operations Manager to support your delivery excellence, professional growth, and long-term success in this impactful role.
  • • This role emphasizes T-shaped individuals: deep expertise in data architecture combined with broad curiosity across e-commerce analytics, LLM enablement, and pragmatic product thinking.
  • • You will be expected to work inclusively with individuals from diverse backgrounds, fostering an environment where both personal and collective dignity are supported.
  • • Showcase your skills in Data Architecture & Systems Thinking, Amazon Data Integration Mastery, Analytics Modeling & Metric Design, LLM-Ready Data Enablement, Reliability, Monitoring & Ownership, and Documentation & Continuity Mindset during the interview process.
  • • This is a remote role, requiring some overlap with the client's time zone. In-person training during the first 90 days is expected.

🎯 Requirements

  • • 4-6 years of experience in data engineering, analytics engineering, or backend data infrastructure within a fast-paced or high-growth environment.
  • • Strong proficiency in building and maintaining scalable data pipelines, including ETL/ELT workflows, data modeling, and performance optimization across modern data stacks.
  • • Hands-on experience with cloud data platforms (e.g., Snowflake, BigQuery, Redshift) and orchestration tools, with a focus on ensuring reliability, observability, and efficient data flow.
  • • Ability to collaborate closely with analytics, product, and business stakeholders, translating data requirements into structured, reliable, and well-documented data solutions.
  • • Deep understanding of Amazon's SP-API, including handling its constraints, ingestion complexities, and modeling e-commerce datasets.
  • • Experience in designing and implementing semantic layers for analytics and enabling deterministic, auditable AI-driven querying.

🏖️ Benefits

  • • Competitive salary and compensation philosophy that considers market trends and individual factors.
  • • Opportunities for career-defining growth and skill expansion within a community of top talent.
  • • Remote work flexibility with some overlap required for client time zone collaboration.
  • • In-person training programs during the initial 90 days to ensure a strong foundation and integration.
  • • Guidance and support from an Operations Manager focused on professional development and delivery excellence.
  • • The chance to work directly with visionary leaders and contribute to innovative, impactful projects.

Skills & Technologies

PostgreSQL
Remote
Degree Required

Ready to Apply?

You will be redirected to an external site to apply.

AI Job Fit Analysis
Pro

See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.

Oceansxyz Inc. logo
Oceansxyz Inc.
Visit Website

About Oceansxyz Inc.

Oceansxyz is a technology company focused on developing innovative solutions for ocean data management and analysis. They provide a platform that integrates diverse oceanographic data sources, enabling researchers, policymakers, and industries to access, visualize, and interpret complex environmental information. Their services aim to enhance understanding of marine ecosystems, support sustainable ocean practices, and facilitate climate change research. By leveraging advanced analytics and cloud-based infrastructure, Oceansxyz empowers users to make informed decisions regarding marine resource management, conservation efforts, and the development of blue economy initiatives. The company operates within the environmental technology and data analytics sectors.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Expired
Dallas, TX
Full-time
Expired May 18, 2026
Python
Azure
Onsite

3 months ago

Expired
Dallas, TX
Full-time
Expired May 12, 2026
Onsite

4 months ago

Expired
Warsaw
Full-time
Expired May 27, 2026
Python
Scala
Azure
+2 more

3 months ago

Expired
Argentina
Full-time
Expired Apr 25, 2026
Senior
Remote

4 months ago