Senior Data Engineer
CDC Foundation
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
Senior Data Engineer at CDC Foundation
The Senior Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization. This role is aligned to the Workforce Acceleration Initiative (WAI), a federally funded CDC Foundation program aimed at supporting the nation’s public health agencies with technology and data experts.
Working within the Prince George’s County Health Department, the Senior Data Engineer will support the development and operation of the enterprise data platform by building and maintaining data pipelines, supporting data ingestion and transformation, and enabling analytics and reporting use cases. This role collaborates closely with IT, data analysts, and program stakeholders to ensure data is reliable, accessible, and aligned with public health business needs. The Senior Data Engineer serves as a senior technical contributor supporting the modernization of PGCHD’s enterprise data platform, focusing on designing and implementing data engineering solutions, supporting data ingestion and transformation, and enabling analytics and reporting capabilities across public health programs. The Senior Data Engineer will be hired by the CDC Foundation and assigned to the Prince George’s County Health Department. This position is eligible for a fully remote work arrangement for U.S. based candidates.
Responsibilities
Azure Synapse & Medallion Architecture
- Design and implement Azure Synapse Analytics solutions using Spark pools, dedicated and serverless SQL pools.
- Implement and operationalize Medallion Architecture (Bronze, Silver, Gold) layers within Azure Data Lake Storage Gen2 (ADLS).
- Develop reusable, parameter-driven data pipelines leveraging Synapse pipelines and Azure Data Factory–style orchestration.
Data Ingestion, Transformation & Modeling
- Ingest and integrate data from diverse internal and external public health sources (clinical, operational, surveillance, census, and partner data).
- Build Spark-based transformations and SQL-based data models to cleanse, standardize, and enrich data.
- Design and maintain dimensional and analytical data models optimized for reporting, dashboards, and advanced analytics.
Performance, Reliability & Cost Optimization
- Monitor, troubleshoot, and tune Synapse workloads for performance, scalability, and cost efficiency.
- Identify and resolve data pipeline failures, data quality issues, and processing bottlenecks.
- Implement logging, monitoring, and alerting for production-grade data pipelines.
Data Governance, Security & Compliance
- Implement data governance, metadata management, and lineage using Microsoft Purview.
- Support compliance with public health data security, privacy, and regulatory requirements (HIPAA, CDC guidance, and county policies).
- Apply role-based access control (RBAC) and data protection best practices across Azure resources.
Collaboration & Stakeholder Engagement
- Partner with public health program leaders, analysts, epidemiologists, and informatics teams to translate defined business needs into technical solutions.
- Ensure data products are analytics-ready and aligned with reporting, performance management, and decision-support needs.
- Communicate technical concepts, progress, and recommendations clearly to both technical and non-technical stakeholders.
Engineering Standards & Knowledge Transfer
- Apply data engineering best practices including source control, CI/CD, automated testing, documentation, and code reviews.
- Provide technical guidance related to data engineering implementations and best practices.
- Support documentation and knowledge sharing to promote sustainability of data solutions.
- Stay current on Azure data platform capabilities and data engineering trends relevant to public health use cases.
Required Qualifications
- Bachelor’s degree in Computer Science, Information Technology, Data Science, or a related field.
- Minimum 5 years of professional experience in data engineering, analytics engineering, or data warehousing roles.
- Hands-on experience with Microsoft Azure data services, particularly Azure Synapse Analytics and ADLS Gen2.
- Strong proficiency in SQL and at least one programming language such as Python, Scala, or Java.
- Experience developing Spark-based data transformations and scalable ETL/ELT pipelines.
- Solid understanding of data warehousing concepts, dimensional modeling, and analytics-oriented data design.
- Experience implementing engineering best practices such as source control, CI/CD pipelines, automated testing, and peer review.
- Familiarity with agile development methodologies and modern software design patterns.
- Strong analytical, troubleshooting, and problem-solving skills.
- Excellent written and verbal communication skills, with the ability to explain technical concepts to non-technical audiences.
- Experience collaborating with distributed and remote teams.
- Up to 10% domestic travel may be required.
Preferred Qualifications
- Experience working with public health, healthcare, or government data environments.
- Familiarity with Microsoft Purview for data governance and lineage.
- Experience supporting Power BI or other analytics and visualization tools.
- Knowledge of HIPAA, public health data standards, and regulatory compliance.
- Prior experience designing enterprise-scale Azure data platforms using Medallion or Lakehouse architectures.
Job Highlights
- Location: Remote, must be based in the United States, Up to 10% domestic travel may be required.
- Salary Range: $103,500-$143,500 per year, plus benefits. Individual salary offers will be based on experience and qualifications unique to each candidate.
- Position Type: Grant funded, limited-term opportunity
- Position End Date: June 30, 2027
Key skills/competency
- Azure Synapse Analytics
- Azure Data Lake Storage Gen2 (ADLS Gen2)
- Spark
- SQL
- Python
- ETL/ELT Pipelines
- Dimensional Modeling
- Data Governance
- HIPAA Compliance
- CI/CD
How to Get Hired at CDC Foundation
- Research CDC Foundation's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Tailor your resume for data engineering: Highlight Azure Synapse, ADLS Gen2, Spark, SQL, and ETL/ELT experience.
- Showcase public health data experience: Emphasize any familiarity with clinical, surveillance, or government data and HIPAA.
- Prepare for technical Azure Synapse questions: Focus on Spark pools, dedicated/serverless SQL pools, and Medallion Architecture.
- Demonstrate communication and collaboration: Be ready to discuss translating technical concepts for non-technical public health stakeholders.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background