At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop robust data solutions for clients. They play a crucial role in transforming raw data into actionable insights, enabling informed decision-making and driving business growth.
In data engineering at PwC, you will focus on designing and building data infrastructure and systems to enable efficient data processing and analysis. You will be responsible for developing and implementing data pipelines, data integration, and data transformation solutions.
Key Responsibilities:
- Design and implement scalable and secure data pipelines using Microsoft Fabric components (Data Factory in Fabric, Dataflows Gen2, Eventstreams, and Synapse Data Engineering).
- Build and manage lakehouses and warehouses within OneLake, ensuring performance, scalability, and data governance compliance.
- Collaborate with business and analytics teams to gather data requirements and translate them into reliable datasets for reporting and advanced analytics.
- Develop and optimize notebooks (PySpark) for data transformation and machine learning workflows within Microsoft Fabric.
- Support Power BI semantic models and implement real-time or near-real-time analytics.
- Ensure data quality, lineage, and observability by leveraging Microsoft Purview or Fabric’s native monitoring capabilities.