
Data Engineer (Remote)
The Phia Group, LLC · Louisville, KY
- On site
- Full-time
- $120,000 / year
- Louisville, KY
Job highlights
- Build and optimize data pipelines with Azure Data Factory and Snowflake.
- Ensure data quality, reliability, and performance with monitoring.
- Collaborate on data needs for AI/ML initiatives.
- Develop curated datasets for analytics and reporting.
- Utilize advanced SQL and Python for data processing.
About the role
Data Engineer - The Phia Group, LLC
The Phia Group is a service-oriented organization assisting employee health plans nationwide. We provide our clients with innovative cost-cutting solutions and constantly expanding service offerings. We continue to enjoy growth thanks to our most valuable resource – our talented and committed team.
At The Phia Group, whose mission is to provide high quality yet affordable healthcare to American employees and their families, you can look forward to not only unparalleled benefits for yourself but also being immersed in a company that was named one of USA Today’s Top Workplaces for 2026. Meanwhile, from a regional perspective, both The Boston Globe and Louisville Business First also recognized our unwavering commitment to upholding an internal culture of inclusivity, enjoyment, and empathy for our valued employees by listing The Phia Group in their respective lists for the Top Places to Work in 2026.
About the Role
The Data Engineer is responsible for supporting the development, maintenance, and optimization of data pipelines and analytics-ready datasets. You will be collaborating across multiple teams and stakeholders to solve complex problems and support data-driven initiatives.
Essential Duties and Responsibilities
- Build, maintain, and optimize data pipelines utilizing Azure Data Factory, ensuring data is ingested, transformed, and delivered to Snowflake reliably for analytics.
- Implement monitoring, alerts, and testing of data pipeline performance, data quality metrics, and lineage to ensure trustworthy data delivery.
- Troubleshoot data issues and perform root cause analysis to proactively resolve operational issues.
- Document data structures, processes, architectural decisions, and best practices for knowledge sharing.
- Develop, maintain, and optimize Snowflake objects (schemas, tables, views) and SQL transformations to produce curated, analytics-ready datasets.
- Collaborate with analysts, stakeholders, and product owners to translate business needs into data requirements and stable technical implementations.
- Enable data for AI/ML use cases by preparing feature-rich datasets, supporting feature engineering, and ensuring data consistency for model training and inference.
- Support deployment and operationalization of machine learning models by integrating pipelines with ML workflows (e.g., batch/real-time scoring).
- Continually improve ongoing reporting and analytics, automating or simplifying self-service or manual processes.
- Implement version control practices for all data engineering code and documentation.
Experience and Qualifications
- Bachelor's degree in Computer Science, Computer Engineering, Information Technology, or a related field; or equivalent experience.
- 5+ years of experience in data engineering or business intelligence roles working with ETL, data modeling, data architecture, and developing pipelines and applications for analytics (e.g., BI, reporting, machine learning, deep learning).
- Solid programming skills in advanced SQL, Python, or other programming languages for data processing and automation.
Experience Supporting or Working with AI/ML Workflows, Including
- Data preparation and feature engineering for machine learning models.
- Integration of data pipelines with ML frameworks (e.g., scikit-learn, TensorFlow, PyTorch, or similar).
- Understanding of model lifecycle concepts (training, validation, deployment, monitoring).
- Expertise working with Snowflake for data warehousing, including experience with schema design, performance tuning, and optimization.
- Proficiency with Git, Azure DevOps, and collaborative development best practices.
- Experience designing, developing, and deploying end-to-end pipelines using Azure Data Factory.
Working Conditions / Physical Demands
Sitting at workstation for prolong periods of time. Extensive computer work. Workstation may be exposed to overhead fluorescent lighting and air conditioning. Fast paced work environment. Operates office equipment including personal computer, copiers, and fax machines.
This job description is not intended to be and should not be construed as an all-inclusive list of all the responsibilities, skills or working conditions associated with the position. While it is intended to accurately reflect the position activities and requirements, the company reserves the right to modify, add or remove duties and assign other duties as necessary.
External and internal applicants, as well as position incumbents who become disabled as defined under the Americans with Disabilities Act, must be able to perform the essential job functions (as listed here) either unaided or with the assistance of a reasonable accommodation to be determined by management on a case by case basis.
Key skills/competency
- Data Engineer
- Azure Data Factory
- Snowflake
- SQL
- Python
- ETL
- Data Modeling
- Data Architecture
- AI/ML
- DevOps
Skills & topics
- Data Engineer
- Azure Data Factory
- Snowflake
- SQL
- Python
- ETL
- Data Modeling
- Data Architecture
- AI/ML
- Cloud Data Warehouse
- Data Pipelines
- Business Intelligence
- Data Quality
- Remote
- The Phia Group
How to get hired
- Customize your resume: Highlight your experience with Azure Data Factory, Snowflake, SQL, and Python, tailoring it to the Data Engineer role at The Phia Group.
- Showcase AI/ML experience: Emphasize your involvement in data preparation, feature engineering, and ML model integration as detailed in the job description.
- Quantify achievements: Use numbers to demonstrate the impact of your data pipeline optimization and data quality improvements.
- Prepare for technical questions: Be ready to discuss your experience with data warehousing, ETL processes, and cloud data platforms like Azure.
- Research company culture: Understand The Phia Group's commitment to innovation, employee well-being, and their recognition as a Top Workplace.
Technical preparation
Behavioral questions
Frequently asked questions
- What are the key technical skills required for a Data Engineer at The Phia Group?
- The Data Engineer role at The Phia Group requires strong programming skills in advanced SQL and Python, proficiency with Azure Data Factory for pipeline development, and expertise with Snowflake for data warehousing. Experience with data modeling, ETL processes, and supporting AI/ML workflows is also crucial.
- How does The Phia Group support employee growth and development for Data Engineers?
- The Phia Group emphasizes a commitment to its talented team and offers unparalleled benefits. While specific development programs aren't detailed, the company's recognition as a Top Workplace suggests a supportive environment for professional growth within data engineering and related analytics fields.
- What is the typical career progression for a Data Engineer at The Phia Group?
- While specific career paths are not outlined, a Data Engineer at The Phia Group can expect to advance through deepening expertise in data pipeline optimization, AI/ML enablement, and complex data warehousing solutions. Opportunities may arise to lead data initiatives or mentor junior team members.
- Can I work remotely as a Data Engineer for The Phia Group?
- Yes, the Data Engineer position at The Phia Group is listed as a remote role, offering flexibility for candidates to work from their preferred location.
- What is the company culture like at The Phia Group for a Data Engineer?
- The Phia Group fosters a culture of inclusivity, enjoyment, and empathy, and has been recognized as a Top Workplace by several publications. They are dedicated to providing innovative solutions and value their team members as their most important resource.
- What are the main responsibilities of a Data Engineer at The Phia Group?
- The primary responsibilities include building, maintaining, and optimizing data pipelines using Azure Data Factory and Snowflake, implementing monitoring and alerting for data quality, troubleshooting data issues, and developing analytics-ready datasets to support business initiatives and AI/ML use cases.
- What kind of AI/ML experience is The Phia Group looking for in a Data Engineer?
- The Phia Group seeks Data Engineers experienced in data preparation and feature engineering for machine learning models, integrating data pipelines with ML frameworks, and understanding the model lifecycle. This includes ensuring data consistency for model training and inference.