
Job Overview
Location
New York, New York
Job Type
Full-time
Category
Data Science
Date Posted
June 3, 2026
Full Job Description
đź“‹ Description
- • Build and maintain ETL pipelines to extract, transform, and load large-scale datasets from APIs, databases, and third-party platforms for real-time analytics and reporting.
- • Automate data preprocessing workflows including cleaning, normalization, and validation of client account data using rule-based logic and statistical checks to ensure high data quality.
- • Prepare analysis-ready datasets for modeling and reporting by structuring raw data into consistent, reliable formats suitable for downstream machine learning and analytical applications.
- • Collaborate with security and engineering teams to align data pipelines with the needs of smart contract audits, on-chain monitoring, and incident response workflows.
- • Monitor data pipeline performance and reliability, identifying and resolving bottlenecks, data inconsistencies, or failures in real-time data flows.
- • Implement scalable data ingestion solutions capable of handling high-volume, high-velocity blockchain and Web3 data sources.
- • Document data lineage, transformation rules, and validation criteria to ensure transparency and reproducibility across team workflows.
- • Work with cross-functional teams to translate business requirements into technical data specifications for client-facing analytics and internal security intelligence tools.
- • Ensure compliance with data governance standards when processing sensitive blockchain transaction data and client information.
- • Contribute to the development of AI-powered security tools by providing clean, structured, and validated datasets that support model training and inference.
- • Stay current with emerging data engineering practices and blockchain data formats to continuously improve pipeline efficiency and accuracy.
- • Participate in code reviews and peer collaboration to maintain high standards in data pipeline architecture and implementation.
🎯 Requirements
- • Proven experience building and maintaining ETL pipelines for large-scale datasets from APIs, databases, and third-party platforms
- • Strong proficiency in data preprocessing techniques including cleaning, normalization, and validation using rule-based logic and statistical methods
- • Experience working with blockchain or Web3 data sources and understanding of on-chain transaction structures
- • Proficiency in programming languages such as Python or SQL for data manipulation and automation
- • Ability to work with real-time data flows and ensure high reliability and scalability of data systems
- • Experience collaborating with security, engineering, or analytics teams in a fast-paced technical environment
🏖️ Benefits
- • Competitive salary and performance-based bonuses
- • Comprehensive health, dental, and vision insurance
- • Unlimited paid time off and flexible work hours
- • Remote work options with support for global team collaboration
- • Access to cutting-edge Web3 security tools and research resources
- • Professional development stipend for conferences, courses, and certifications
Skills & Technologies
About CertiK, Inc.
CertiK is a blockchain security firm that performs formal verification audits of smart contracts and decentralized protocols. Its offerings include static analysis, penetration testing, on-chain monitoring via the Skynet platform, KYC verification and incident response. Founded in 2018 by Yale and Columbia professors, the company secures DeFi, NFT, layer-1 and bridge projects, identifying vulnerabilities before deployment and providing real-time threat detection after launch.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Highmark Health
3 months ago

FundraiseUp Inc.
3 months ago

Forward Financing, LLC
2 months ago
