We are seeking a highly skilled Lead Data Engineer to design, build, and manage the data foundation for a large-scale digital transformation program. The role will focus on implementing an Azure-based Integrated Data Platform (Fabric OneLake, Purview, CluedIn MDM, Synapse, and event-driven ingestion with Kafka/webMethods).
The Lead Data Engineer will work closely with the Solution Architect, Data Governance team, and application squads, ensuring that data ingestion, pipelines, and curated layers (Bronze/Silver/Gold) are reliable, secure, and aligned with business requirements. This is a hands-on leadership role, requiring both technical implementation and mentoring of a data engineering team.
Key Responsibilities
- Architecture & DesignDesign and implement data pipelines and workflows for ingestion from digital services, legacy systems, IoT devices, and external partners.
- Define and enforce the medallion architecture (Bronze, Silver, Gold) in Fabric OneLake.
- Establish data partitioning, retention, and performance tuning strategies.
- Integrate pipelines with Purview for lineage and data classification.
- Work with the governance team to align with MDM (CluedIn) golden record models.
- Implementation & DeliveryBuild and maintain data pipelines using Fabric Dataflows, Data Factory, Notebooks, and Synapse.
- Implement streaming ingestion from Kafka and webMethods into Fabric (Bronze).
- Develop ETL/ELT transformations (deduplication, cleansing, standardization).
- Enable curated Gold models for analytics and Power BI dashboards.
- Automate data quality checks (completeness, accuracy, consistency) with alerting.
- Publish semantic models for BI/AI consumption.
- Leadership & CollaborationLead and mentor a team of data engineers and pipeline developers.
- Work with data scientists to ensure features are sourced from trusted data.
- Collaborate with application teams to ensure transactional DBs produce usable events and datasets.
- Support the Service Delivery Manager with progress reporting, risks, and mitigations.
- Operations & OptimizationEnsure pipelines meet freshness SLAs (daily batch, near real-time where required).
- Optimize storage and compute costs in Fabric/OneLake.
- Establish runbooks for pipeline failures, retries, and data reconciliation.
- Onboard pipelines into the Operations Management Hub (OMH) for monitoring.
- Technical Skills RequiredStrong experience with Azure Data Services:
Fabric (Data Pipelines, Lakehouse, Dataflows, Notebooks)
OneLake (Delta tables, medallion layers)
Synapse (SQL, dedicated pools)
Purview (catalog, lineage, sensitivity classification)
CluedIn (MDM golden record integration)
- Event streaming ingestion (Kafka, Event Hubs) and batch ingestion (Data Factory, REST, SFTP).
- Strong SQL skills for data modeling and optimization.
- Familiarity with Power BI datasets & semantic models.
- Data quality frameworks (Great Expectations or similar).
- Experience with DevOps for Data (CI/CD for pipelines, IaC with Terraform/Bicep).
- Security-first mindset: PII masking, encryption, RBAC in Fabric and Purview.
Soft Skills & Leadership Competencies
- Strong leadership and mentoring abilities for junior data engineers.
- Problem-solving mindset for data quality and pipeline reliability issues.
- Excellent communication skills, able to collaborate with architects, data scientists, and business users.
- Analytical thinker, comfortable with complex data integration scenarios.
- Adaptable, able to balance hands-on coding with oversight responsibilities.
Qualifications
- 8+ years in data engineering, with at least 3 years in a lead role.
- Proven experience with Azure-based data platforms.
- Bachelor’s or Master’s in Computer Science, Data Engineering, or related field.
- Certifications preferred:
Microsoft Certified: Azure Data Engineer Associate
Microsoft Certified: Fabric Analytics Engineer Associate
Any MDM or Data Governance certifications (nice to have).
Important Notes
- When applying, please send the following to admin@fivectech.com (Title your email with Lead data engineer applicant) :
Up-to-date resume
Notice period (immediate joiners strongly preferred)
Expected salary
Prior experience delivering Azure-based data platforms at enterprise scale is mandatory.