Join us as we build new cross-functional teams (Data Engineering, DevOps, Scrum) from the ground up on a greenfield project. Work with modern technologies like GCP (BigQuery, Dataflow, Dataproc), Python, and Spark, and help shape scalable data platforms from day one.
You’ll collaborate with experienced experts, grow your cloud skills, and have the opportunity to explore and learn AI/ML as part of your journey. This is your chance to make an impact while accelerating your career.
WHAT WILL YOU DO?
-
Design, build, and maintain scalable data solutions using GCP services such as BigQuery, Dataflow, Cloud Storage, Cloud SQL, Dataproc, and Dataform.
-
Develop and optimize ETL/ELT pipelines, including data transformation, integration, and loading workflows.
-
Apply advanced SQL and Python (PySpark, Pandas, NumPy) to process, analyze, and transform large datasets.
-
Work with distributed data processing frameworks like Spark, leveraging data parallelism and scalable architectures.
-
Implement security best practices, including Git-based CI/CD pipelines, role-based access control, encryption, and IAM policies.
-
Monitor and improve system performance by identifying bottlenecks and applying optimization strategies.
-
Contribute to data modeling and warehousing solutions, including dimensional modeling and schema design (star/snowflake).
-
Support the design of scalable, secure, and efficient cloud data architectures.