Google Cloud Data Fusion Engineer
Thrive IT Systems
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
Google Cloud Data Fusion Engineer
Thrive IT Systems is seeking a skilled Google Cloud Data Fusion Engineer with 5+ years of experience to join our team on a contract basis. This is a remote position.
Role Summary
As a Google Cloud Data Fusion Engineer, you will be responsible for designing, developing, and optimizing data integration pipelines on Google Cloud. You will build scalable ETL/ELT workflows, integrate diverse data sources, and enable a high-quality data platform and analytics in a cloud environment. You will work closely with data architects, analysts, and application teams to support enterprise-wide data initiatives.
Key Responsibilities
- Design and develop ETL/ELT data pipelines in Google Cloud Data Fusion.
- Build reusable pipeline templates and orchestration patterns.
- Support both batch and real-time streaming pipelines.
- Integrate data from on-premises, third-party, and cloud sources into GCP.
- Monitor pipeline performance and troubleshoot failures.
- Manage schema evolution, data quality checks, and validation logic.
- Implement data quality frameworks and automated testing.
- Configure and optimize Apache Spark jobs via Data Fusion.
- Apply data governance standards and best practices.
- Enable auditing and lineage capture using Data Fusion metadata.
- Implement scheduling, monitoring, and alerting.
- Integrate with workflow tools like Cloud Composer.
- Work with analysts to understand data requirements.
Required Skills
- 5+ years in data engineering or ETL development.
- 2+ years of Hands-on experience with Google Cloud Data Fusion.
- Strong SQL and relational database expertise.
- Experience with Apache Spark (SQL, dataframes, performance tuning).
- Working knowledge of BigQuery, Cloud Storage.
- Data modelling and metadata management.
- Version control (Git), CI/CD pipelines for data workflows.
Preferred Qualifications
- GCP certifications (e.g., Professional Data Engineer).
- IBM Datastage ETL tool knowledge.
Key skills/competency
- Google Cloud Data Fusion
- Data Engineering
- ETL/ELT Development
- SQL
- Apache Spark
- BigQuery
- Cloud Storage
- Data Modeling
- Metadata Management
- CI/CD
How to Get Hired at Thrive IT Systems
- Tailor your resume: Highlight your Google Cloud Data Fusion experience and SQL skills.
- Showcase your portfolio: Demonstrate past data pipeline projects with quantifiable results.
- Prepare for technical questions: Be ready to discuss ETL/ELT concepts and Spark optimization.
- Understand GCP: Familiarize yourself with BigQuery and Cloud Storage best practices.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background