Where Work Meets Adventure
Location: Hybrid (4 days onsite – Downtown Toronto)
Type: Full-Time | Start-up
Join our team as a Data Platform Software Lead Engineer and drive the architecture that fuels our AI and ML pipelines, focusing on large-scale, code-based text datasets. You'll design and build reliable systems for data ingestion, transformation, and delivery—enabling teams to train, iterate, and deploy models with confidence.
Architect and implement scalable data platforms for code/text dataset ingestion, processing, and delivery.
Build web-scale crawling and metadata extraction tools from open-source code repositories.
Develop reliable, distributed pipelines with frameworks like Spark, Kafka, and Airflow/Prefect .
Enable data visualization, sampling, and analytics for re...