PitchMeAI
Saaf Finance

Senior Data Engineer

Saaf Finance · India

This listing has closed — view similar roles below.

  • Hybrid
  • Full-time
  • $150,000 / year
  • India

Job highlights

  • Build AI-powered data infrastructure for mortgage lending.
  • Develop and maintain scalable ETL/ELT data pipelines.
  • Design and optimize data warehouses and data models.
  • Integrate data from various internal and external sources.
  • Ensure data quality, security, and compliance.

About the role

About This Role

Saaf AI is building the future of mortgage lending by combining cutting-edge AI with robust data infrastructure. As part of a top-10 private lender processing billions in loan volume, backed by leading asset managers and funds, we are growing fast — and data and AI are at the center of everything we build.

We don’t just experiment with AI — we integrate it deeply into how we operate. Our systems rely on scalable data pipelines, structured data models, and real-time workflows that power underwriting, document processing, and borrower interactions. AI is embedded across these layers, from data extraction and validation to intelligent automation.

If you’re excited about building high-quality data systems in an AI-native environment — where data pipelines, automation, and intelligent workflows come together — you’ll fit right in.

Key Responsibilities

Data Pipeline Development

  • Design, implement, and maintain ETL/ELT pipelines for structured and unstructured datasets from internal and external sources.
  • Leverage AI-assisted development tools to accelerate pipeline authoring, generate transformation logic, and automate boilerplate code.

Data Warehousing & Modeling

  • Build and optimize data warehouses and marts (Snowflake, BigQuery, or similar) for analytics, reporting, and product use cases.
  • Design, implement, and maintain conceptual, logical, and physical data models to ensure scalable, consistent, and high-quality datasets for downstream analytics and applications.

Integration & Ingestion

  • Ingest data from APIs, SaaS platforms (CRM, financial data APIs), and internal systems into the core data platform.
  • Build and maintain reliable connectors and ingestion frameworks that handle schema evolution, rate limits, and error recovery.

Data Quality & Governance

  • Implement validation, schema management, and robust documentation to ensure data accuracy and compliance.
  • Use AI tools to support data profiling, anomaly detection, and automated documentation of data lineage and transformations.

AI-Integrated Data Engineering

  • Use AI-assisted tools (code generation, intelligent autocomplete, automated testing) as a regular part of your data engineering workflow.
  • Evaluate and integrate emerging AI tools and practices into the team's data development process.
  • Build and support agentic workflows and multi-step automated processes that act on data in real time, including AI-powered data validation and enrichment.
  • Apply AI-assisted analysis to debugging pipeline failures, optimizing query performance, and identifying data quality issues.

Performance & Reliability

  • Monitor and fine-tune pipeline and warehouse performance for scalability and cost efficiency.
  • Set up logging, monitoring, and alerting for data jobs to ensure reliability and fast incident response.

Security & Compliance

  • Apply data security and privacy controls aligned with financial regulatory requirements, ensuring full traceability of every transformation.
  • Foster a security-first mindset across all data operations.

Analytics Enablement

  • Provide clean, consistent datasets for analysts, product managers, and operational teams to support fast, data-driven decisions.
  • Collaborate closely with product managers, data scientists, and full stack engineers to align data models with business needs.

Qualifications

Required

  • 5+ years in a data engineering or similar backend data-focused role.
  • Strong SQL and Python development skills for data transformation and automation.
  • Experience with modern ETL/ELT frameworks such as dbt.
  • Proficiency with cloud platforms (AWS preferred) and serverless data services.
  • Strong experience with data warehouse technologies (Snowflake preferred).
  • Skilled in API integrations and ingestion from third-party systems.
  • Proficient in data modeling (Kimball/Star schema, Data Vault).
  • Demonstrated, regular use of AI-powered development tools (e.g., Cursor, GitHub Copilot, Claude Code, or similar) to accelerate data pipeline development, debugging, or documentation.
  • Proven track record of delivering production-grade data pipelines at scale.
  • Experience implementing CI/CD practices for data workflows.
  • Experience collaborating closely with product managers, data scientists, and full stack engineers.
  • Startup mindset: hands-on, resourceful, and comfortable operating in a fast-paced environment.

Preferred

  • Experience building agentic workflows and orchestrating multi-step automated processes that act on data in real time.
  • Familiarity with data engineering patterns and infrastructure required for AI-powered tools and automation platforms.
  • Experience working with financial datasets and APIs in a high-compliance environment.
  • Understanding of data privacy regulations such as GDPR and CCPA.
  • Experience with prompt engineering for code generation, data transformation logic, or building AI-powered data workflows.

Benefits

  • Competitive salary
  • Unlimited PTO
  • Remote-first with flexible hours
  • Upto $2,000/year professional development budget
  • Home office setup stipend

Key skills/competency

  • Senior Data Engineer
  • Data Pipelines
  • ETL/ELT
  • Data Warehousing
  • Data Modeling
  • Cloud Platforms
  • SQL
  • Python
  • AI-Integrated Data Engineering
  • Data Quality

Skills & topics

  • Senior Data Engineer
  • Data Engineering
  • ETL
  • ELT
  • Data Warehousing
  • Data Modeling
  • SQL
  • Python
  • AWS
  • Snowflake
  • AI
  • Machine Learning
  • Cloud Computing
  • BigQuery
  • dbt
  • API Integration
  • Data Quality
  • Data Governance
  • Remote
  • Finance

How to get hired

  • Tailor your resume: Highlight your 5+ years of data engineering experience, SQL/Python skills, cloud proficiency (AWS), and AI tool usage.
  • Showcase AI integration: Emphasize experience with AI-assisted development tools and building agentic workflows.
  • Demonstrate data modeling expertise: Detail your work with Snowflake, dbt, Kimball/Star schema, and Data Vault modeling.
  • Highlight startup experience: Mention your adaptability, resourcefulness, and comfort in fast-paced environments.
  • Prepare for technical and behavioral questions: Be ready to discuss pipeline development, data quality, and collaborative problem-solving.

Technical preparation

Master SQL and Python for data manipulation.,Practice building ETL/ELT pipelines with dbt.,Familiarize with Snowflake and cloud platforms (AWS).,Experiment with AI coding assistants like Copilot.

Behavioral questions

Describe a complex data pipeline you built.,How do you ensure data quality and accuracy?,Share an experience using AI tools in development.,How do you collaborate with cross-functional teams?

Frequently asked questions

What makes the Senior Data Engineer role at Saaf Finance unique?
The Senior Data Engineer role at Saaf Finance is unique because it places AI at the core of data infrastructure. You'll be instrumental in building AI-native systems for mortgage lending, integrating AI tools directly into data pipelines, warehousing, and quality assurance processes.
What specific AI tools are mentioned for the Senior Data Engineer role?
The job description mentions the demonstrated, regular use of AI-powered development tools such as Cursor, GitHub Copilot, Claude Code, or similar. It also highlights building agentic workflows and prompt engineering for AI-powered data workflows.
What are the primary data warehousing and modeling technologies used at Saaf Finance?
Saaf Finance utilizes modern data warehousing technologies like Snowflake and BigQuery. Proficiency in data modeling techniques such as Kimball/Star schema and Data Vault is also required.
What is the work arrangement for the Senior Data Engineer position?
The Senior Data Engineer position at Saaf Finance is a remote-first role with flexible hours, offering a high degree of autonomy and work-life balance.
What are the key benefits offered to a Senior Data Engineer at Saaf Finance?
Benefits include a competitive salary, unlimited PTO, a remote-first work environment with flexible hours, a professional development budget of up to $2,000 per year, and a home office setup stipend.
How does Saaf Finance emphasize data quality and governance for this role?
Saaf Finance emphasizes data quality and governance through implementing validation, schema management, and robust documentation. AI tools are leveraged for data profiling, anomaly detection, and automated lineage documentation.
What level of experience is required for the Senior Data Engineer position?
The role requires a minimum of 5 years in a data engineering or similar backend data-focused role. Proven experience delivering production-grade data pipelines at scale is also essential.
What are the preferred qualifications for this Senior Data Engineer role?
Preferred qualifications include experience with agentic workflows, familiarity with AI infrastructure, working with financial datasets in high-compliance environments, and understanding data privacy regulations like GDPR and CCPA.