PitchMeAI
Calix

Staff Data Engineer – Cloud Data Platform

Calix · Bengaluru, Karnataka, India

  • On site
  • Full-time
  • $150,000 / year
  • Bengaluru, Karnataka, India

Job highlights

  • Design and implement cloud data platforms.
  • Develop scalable data ingestion and transformation pipelines.
  • Provide technical leadership in data engineering.
  • Collaborate with product owners on requirements.
  • Troubleshoot and resolve complex data issues.

About the role

Staff Data Engineer Cloud Data Platform

Calix is a leader in transforming service providers to offer enhanced subscriber experiences for Smart Homes and Businesses. We monetize networks through Role-based Cloud Services, Telemetry, Analytics, Automation, and Software-Driven Adaptive Networks.

About the Role

As a member of a high-performing global team, you will be a Calix Cloud Data Engineer, significantly contributing to architecture design, implementation, and technical leadership in data ingestion, extraction, transformation, and analytics.

Responsibilities and Duties

  • Collaborate with Cloud product owners to understand and analyze product requirements, providing valuable feedback.
  • Develop conceptual, logical, and physical data models, along with metadata solutions.
  • Design and manage data deliverables including data models, diagrams, data flows, and data dictionary documentation.
  • Determine database structural requirements by analyzing client operations, applications, and data from existing systems.
  • Provide technical leadership in software design, ensuring service stability, reliability, scalability, and security.
  • Guide technical discussions within the engineering group and make informed technical recommendations.
  • Conduct design and code reviews with peer engineers.
  • Oversee the testing architecture for large-scale data ingestion and transformations.
  • Act as a customer-facing engineer, debugging and resolving field issues.

Qualifications

  • 10-12 years of software engineering experience delivering quality products.
  • 10+ years of development experience in Data modeling, master data management, and building ETL/data pipelines.
  • Cloud Platforms: Proficiency in Google Cloud Platform (GCP) services (BigQuery, Dataflow, Dataproc, PubSub/Kafka, Cloud Storage) and AWS services (Redshift, Glue, Kinesis, S3).
  • Data Pipelines: Proven experience designing, building, and maintaining scalable data pipelines across GCP and AWS.
  • Big Data Technologies: Knowledge of big data processing frameworks like Apache Spark, Flink, and Beam, particularly with Dataproc, EKS, and AWS EMR.
  • Data Transformation: Proficient in using dbt/Dataform for data transformation and modeling within the data warehouse.
  • Programming Languages: Strong SQL skills and proficiency in Python, Java, or Scala.
  • File Formats: Proficient with open file formats like Apache Hudi and Apache Iceberg.
  • Data Visualization: Experience with BI tools (Google Data Studio, Looker, ThoughtSpot) and BigQuery BI Engine.
  • Containerization: Understanding of Docker and Kubernetes for data application deployment.
  • Data Governance: Knowledge of data catalog tools (DataHub, Collibra, Alation) for metadata management, data quality, lineage, and tagging is a plus.
  • Problem Solving: Strong analytical and troubleshooting skills for complex data scenarios.
  • Collaboration: Ability to work effectively in a team and with cross-functional teams.
  • Communication: Proficient in explaining complex technical concepts to stakeholders.
  • Data Governance & Security: Knowledge of data governance, security best practices, and compliance in GCP and AWS.
  • Education: Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • Certifications: Relevant certifications (e.g., Google Cloud Professional Data Engineer, AWS Certified Data Analytics – Specialty) are a plus.
  • Travel: This role may require travel for meetings and events.

Key skills/competency

  • Data Engineering
  • Cloud Data Platforms (GCP, AWS)
  • Data Modeling
  • ETL/Data Pipelines
  • Big Data Technologies (Spark, Flink, Beam)
  • SQL
  • Python
  • dbt/Dataform
  • Data Governance
  • Problem Solving

Skills & topics

  • Data Engineer
  • Cloud Data Platform
  • GCP
  • AWS
  • BigQuery
  • Redshift
  • Data Modeling
  • ETL
  • Data Pipelines
  • Spark
  • Python
  • SQL
  • dbt
  • Data Governance
  • Software Engineering
  • Bangalore
  • India

How to get hired

  • Tailor your resume: Highlight experience with GCP/AWS data services, data modeling, ETL, and big data technologies.
  • Showcase cloud expertise: Emphasize your proficiency in services like BigQuery, Redshift, Spark, and Python for data engineering.
  • Demonstrate leadership: Detail your experience in guiding technical discussions, design reviews, and code reviews.
  • Prepare for technical questions: Be ready to discuss data pipeline design, data governance, and troubleshooting complex data scenarios.
  • Understand Calix's mission: Research how Calix is transforming service providers with cloud data solutions.

Technical preparation

Master GCP and AWS data services deeply.,Build complex ETL pipelines end-to-end.,Practice SQL and Python for data manipulation.,Understand Spark, Flink, and Beam frameworks.

Behavioral questions

Describe a complex data problem you solved.,How do you handle conflicting product requirements?,Explain your technical leadership approach.,How do you ensure data quality and governance?

Frequently asked questions

What are the key cloud platforms for the Staff Data Engineer role at Calix?
The Staff Data Engineer role at Calix requires proficiency in both Google Cloud Platform (GCP) services like BigQuery, Dataflow, and Dataproc, and AWS services such as Redshift, Glue, and Kinesis.
What experience is needed in data pipelines for this Staff Data Engineer position?
This role demands proven experience in designing, building, and maintaining scalable data pipelines across both GCP and AWS environments.
How important are big data technologies for the Staff Data Engineer at Calix?
Knowledge of big data processing frameworks such as Apache Spark, Flink, and Beam, especially in conjunction with GCP and AWS big data services, is crucial for this position.
What programming languages are essential for the Staff Data Engineer role?
Strong SQL skills are a must, along with proficiency in at least one programming language like Python, Java, or Scala.
Does the Staff Data Engineer role involve data governance responsibilities at Calix?
Yes, knowledge of data governance principles and tools like DataHub, Collibra, or Alation for metadata management, data quality, and lineage is highly beneficial for this role.
What level of experience is required for the Staff Data Engineer position?
The role requires 10-12 years of software engineering experience, with at least 10 years focused on data modeling, master data management, and building ETL/data pipelines.
Is this a remote or on-site position for the Staff Data Engineer in Bangalore?
This position is based in Bangalore, India, and follows a flexible hybrid work model, requiring approximately 20 days in the Bangalore office per quarter.
What is the expected educational background for the Staff Data Engineer role?
A Bachelor’s degree in Computer Science, Information Technology, or a related field is required for this Staff Data Engineer position.