
Google cloud data fusion consultant
Thrive IT Systems · Hyderabad, Telangana, India
- Hybrid
- Part-time
- $120,000 / year
- Hyderabad, Telangana, India
Job highlights
- Design and develop data pipelines using Google Cloud Data Fusion.
- Build and support batch and real-time data streams.
- Integrate diverse data sources into Google Cloud Platform.
- Monitor, troubleshoot, and optimize pipeline performance.
- Implement data quality, governance, and automated testing.
About the role
Google Cloud Data Fusion Engineer
Thrive IT Systems is seeking a skilled Google Cloud Data Fusion Engineer with 5+ years of experience to join our team on a contract basis. This is a remote position.
Role Summary
As a Google Cloud Data Fusion Engineer, you will be responsible for designing, developing, and optimizing data integration pipelines on Google Cloud. You will build scalable ETL/ELT workflows, integrate diverse data sources, and enable a high-quality data platform and analytics in a cloud environment. You will work closely with data architects, analysts, and application teams to support enterprise-wide data initiatives.
Key Responsibilities
- Design and develop ETL/ELT data pipelines in Google Cloud Data Fusion.
- Build reusable pipeline templates and orchestration patterns.
- Support both batch and real-time streaming pipelines.
- Integrate data from on-premises, third-party, and cloud sources into GCP.
- Monitor pipeline performance and troubleshoot failures.
- Manage schema evolution, data quality checks, and validation logic.
- Implement data quality frameworks and automated testing.
- Configure and optimize Apache Spark jobs via Data Fusion.
- Apply data governance standards and best practices.
- Enable auditing and lineage capture using Data Fusion metadata.
- Implement scheduling, monitoring, and alerting.
- Integrate with workflow tools like Cloud Composer.
- Work with analysts to understand data requirements.
Required Skills
- 5+ years in data engineering or ETL development.
- 2+ years of Hands-on experience with Google Cloud Data Fusion.
- Strong SQL and relational database expertise.
- Experience with Apache Spark (SQL, dataframes, performance tuning).
- Working knowledge of BigQuery, Cloud Storage.
- Data modelling and metadata management.
- Version control (Git), CI/CD pipelines for data workflows.
Preferred Qualifications
- GCP certifications (e.g., Professional Data Engineer).
- IBM Datastage ETL tool knowledge.
Key skills/competency
- Google Cloud Data Fusion
- Data Engineering
- ETL/ELT Development
- SQL
- Apache Spark
- BigQuery
- Cloud Storage
- Data Modeling
- Metadata Management
- CI/CD
Skills & topics
- Google Cloud Data Fusion
- Data Engineering
- ETL
- ELT
- Data Pipelines
- SQL
- Apache Spark
- BigQuery
- Cloud Storage
- GCP
- Remote
- Contract
How to get hired
- Tailor your resume: Highlight your Google Cloud Data Fusion experience and SQL skills.
- Showcase your portfolio: Demonstrate past data pipeline projects with quantifiable results.
- Prepare for technical questions: Be ready to discuss ETL/ELT concepts and Spark optimization.
- Understand GCP: Familiarize yourself with BigQuery and Cloud Storage best practices.
Technical preparation
Master Google Cloud Data Fusion capabilities.,Practice SQL queries and data modeling.,Understand Spark concepts and tuning.,Review BigQuery and Cloud Storage basics.
Behavioral questions
Describe a complex data pipeline challenge.,How do you ensure data quality?,How do you collaborate with analysts?,Tell me about optimizing Spark performance.
Frequently asked questions
- What are the key responsibilities for a Google Cloud Data Fusion Engineer at Thrive IT Systems?
- The key responsibilities include designing and developing ETL/ELT data pipelines in Google Cloud Data Fusion, supporting batch and real-time streams, integrating diverse data sources into GCP, monitoring pipeline performance, managing data quality, and implementing data governance standards.
- What level of experience is required for the Google Cloud Data Fusion Engineer role?
- We require 5+ years in data engineering or ETL development, with at least 2 years of hands-on experience specifically with Google Cloud Data Fusion.
- Is this a remote position for the Google Cloud Data Fusion Engineer?
- Yes, this Google Cloud Data Fusion Engineer position is fully remote.
- What are the essential technical skills for this Google Cloud Data Fusion Engineer role?
- Essential technical skills include strong SQL, relational database expertise, experience with Apache Spark, working knowledge of BigQuery and Cloud Storage, data modeling, metadata management, and version control (Git) with CI/CD pipelines.
- Are there any preferred qualifications for the Google Cloud Data Fusion Engineer position?
- Preferred qualifications include GCP certifications, such as the Professional Data Engineer certification, and knowledge of the IBM Datastage ETL tool.
- How can I best prepare my application for the Google Cloud Data Fusion Engineer role at Thrive IT Systems?
- To best prepare your application for the Google Cloud Data Fusion Engineer role, ensure your resume clearly outlines your experience with Google Cloud Data Fusion, your SQL proficiency, and any relevant GCP certifications. Highlight projects where you've built and optimized data pipelines.
- What kind of data sources will I be integrating as a Google Cloud Data Fusion Engineer?
- As a Google Cloud Data Fusion Engineer, you will integrate data from various sources including on-premises systems, third-party applications, and other cloud-based services into the Google Cloud Platform.
- Will I be working with real-time data streams in this role?
- Yes, this Google Cloud Data Fusion Engineer role involves supporting both batch and real-time streaming pipelines.