We are looking for a Mid-level Data Engineer to work closely with the platform architect to build and scale a data platform from the ground up.
In this role:
· The architecture and high-level design are defined by the Lead.
· You are responsible for implementing, refining, and operating data pipelines in detail.
This is a hands-on engineering role, not analytics-only, not dashboard-focused.
Responsibilities
· Implement and maintain data pipelines end-to-end (raw → standardized → curated).
· Write production-grade Python + SQL for ETL/ELT (batch-first, incremental when needed).
· Handle schema changes, edge cases, retries, logging, and pipeline reliability.
· Optimize for correctness, performance, and cost.
· Work closely with the Lead (architecture provided) — you execute the details.
· Strong Python (clean code, data processing, error handling, testing mindset).
· Strong SQL (transformations + validation)
· Solid understanding of pipeline patterns (full vs incremental, idempotency, partitioning/batching).
· Experience with data warehouse / OLAP (e.g., ClickHouse, BigQuery, Snowflake, etc.).
· Comfortable with Linux + cloud work.
· 2 -3 years experiences in Data Engineer
· Basic English
Nice to Have
· GCP experience (GCS/BigQuery) or similar cloud. (google cloud platform)
· Orchestration tools (Prefect / Airflow).
· Parquet + data lake practices