Lead Data Engineer
About the Role
You will design, build, and operate the end-to-end data architecture for trading systems. You will ingest and transform onchain events and user activity, serve real-time analytics, implement ETL/ELT pipelines, establish data quality and monitoring, optimize performance and cost, and provide technical leadership and mentorship to engineers.
Requirements
- 7+ years of experience in data engineering, with at least 2 years in a technical leadership role
- Strong proficiency in Python and SQL for building production data pipelines and complex data transformations
- Experience designing, building, and maintaining cloud-based data pipelines using orchestration tools such as Airflow, Dagster, Prefect, or Temporal
- Hands-on experience with cloud data platforms (AWS, GCP, or Azure) and modern data stack tools
- Deep understanding of data warehousing concepts and experience with platforms like Snowflake, BigQuery, or Redshift
- Strong software engineering fundamentals including testing, CI/CD, version control, and writing maintainable, documented code
- Track record of optimizing data systems for performance, reliability, and cost efficiency at scale
- Excellent communication skills and ability to collaborate with cross-functional teams
- Familiarity with DeFi, trading platforms, or financial systems and concepts like liquidity, orderbooks, and market dynamics (highly desirable)
- Experience working with blockchain data and EVM-compatible chains (highly desirable)
- Experience with dbt for large-scale data systems (highly desirable)
- Experience with Dune Analytics for querying and visualizing blockchain data (highly desirable)
- Experience with streaming data and event-driven architectures using tools like Kafka, Kinesis, or Flink (highly desirable)
- Knowledge of GraphQL APIs and how to build data systems that power them (highly desirable)
Responsibilities
- Design and build scalable data pipelines to ingest, process, and transform blockchain data, trading events, user activity, and market signals at high volume and low latency
- Architect and maintain data infrastructure that powers real-time trading analytics, P&L calculations, leaderboards, market cap tracking, and liquidity monitoring
- Own ETL/ELT processes that transform raw onchain data from multiple blockchains into clean, reliable, and performant datasets
- Build and optimize data models and schemas that support operational systems and analytical use cases
- Establish data quality frameworks including monitoring, alerting, testing, and validation to ensure pipeline reliability and data accuracy
- Collaborate with backend engineers to design event schemas, data contracts, and APIs for real-time data flow
- Partner with product and analytics teams to translate data needs into engineering solutions
- Provide technical leadership by mentoring engineers, conducting code reviews, establishing best practices, and driving architectural decisions
- Optimize performance and costs of data infrastructure as trading volumes scale
Benefits
- Remote-First Culture: work from anywhere
- Pre-IPO stock options
- Token compensation
- Robust healthcare including fully covered medical, dental, and vision for employees
- Retirement contributions with up to 4% employer 401(k) match
- Health & wellness memberships (One Medical, Teladoc, Health Advocate)
- Unlimited time off and flexible vacation policies
- Home office reimbursement including internet and cell phone
- Ease of life reimbursement for expenses such as childcare, rides, or meal delivery
- Career development access including mentorship and training
- Inclusive and diverse work environment
