11 hours ago

Data Science Engineer

DevSavant

Hybrid
Full Time
$145,000
Hybrid

Job Overview

Job TitleData Science Engineer
Job TypeFull Time
Offered Salary$145,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About DevSavant

At DevSavant, we are a trusted technology partner specializing in Software Development, Data Engineering, AI/Machine Learning, Cloud Solutions, Automation Testing, and UI/UX Design. We deliver innovative, high-quality solutions with a focus on excellence and results. Our people are at the heart of everything we do, fostering a culture of growth and well-being. Join us and thrive in a supportive, success-driven environment.

We're looking for a talented Data Science Engineer with expert Python skills and experience in processing large amounts of data to join our client's team. You'll be a key player in designing, building, and making our main data pipelines and ML systems (that power our advanced analytics and machine learning models) able to handle more. You'll work closely with data scientists and engineers to create strong, efficient, and scalable systems. If you love solving complex technical problems, building production-ready data systems, and want to make a big impact on a data-driven company, this job is for you!

Antenna, our client, is a a remote-first company, and we are looking for candidates who can work during US business hours. You will report to the Data Science Lead.

What You'll Do

  • Design, develop, test, and maintain strong and scalable data pipelines using Python and tools for large-scale data processing (like Spark, Dask, or similar on GCP).
  • Design and take ownership of key parts of our ML systems, making sure they are reliable, efficient, and can grow.
  • Set up and manage MLOps practices, including automatic updates for machine learning models (CI/CD), model monitoring, and automated launch plans.
  • Improve and manage data processing jobs on cloud platforms (GCP: Dataproc, BigQuery, Cloud Run, Cloud Build).
  • Work with data scientists to get machine learning models ready for production and connect them to our data systems.
  • Write detailed documents for the system designs, code, and systems you create and manage.
  • Fix complex technical problems in data systems that run on many computers and in ML pipelines.

Who You Are

  • You have 3-5+ years of work experience in software engineering, with a strong focus on data engineering, ML engineering, or building applications that use a lot of data.
  • You are an expert in Python, with a strong understanding of object-oriented design, software system design, and experience building high-quality, testable code for production.
  • You have strong, hands-on experience with tools for handling large amounts of data like Apache Spark (PySpark), Dask, or similar.
  • You have solid experience with cloud platforms (GCP is highly preferred). This includes putting services live, managing them, making them handle more users (e.g., Docker, Cloud Run, GKE), and working with large data systems (e.g., Dataproc, BigQuery).
  • You have strong SQL skills and experience working with large, complex datasets.
  • You have a deep understanding of machine learning ideas, the full process of creating a model, and MLOps principles.
  • You are an excellent problem-solver, good at fixing complex issues in systems that run on many computers, and making them perform better and handle more data.
  • You explain complex technical ideas and system design decisions clearly and effectively in English.
  • Advanced English proficiency (B2-C1); Excellent communication, teamwork, and consulting skills.
  • You are passionate about building strong, scalable systems and are eager to guide and work with a team.
  • You care deeply about code quality, system reliability, and writing good documentation.

Bonus Points

  • Experience in or passion for the Subscription Economy, especially in media and entertainment.
  • Deep knowledge of specific GCP services like Dataproc, Dataflow, Cloud Composer, Vertex AI, or Kubernetes Engine.
  • Experience building and maintaining Python code (libraries) used by many, or contributions to open-source projects.
  • Advanced knowledge of MLOps tools and ways to manage workflows (e.g. Cloudbuild, CloudRun).

Tech Stack

  • Languages: Python (expert), SQL (strong)
  • Large-Scale Data Processing: Apache Spark/PySpark (or similar like Dask)
  • Cloud Platform: Google Cloud (Dataproc, BigQuery, Cloud Storage, Cloud Run, Cloud Build, GKE - strong experience expected)
  • Version Control: Git (expert)
  • MLOps & Orchestration: Familiar with tools like Airflow, Kubeflow, Vertex AI Pipelines
  • Containerization: Docker, Kubernetes
  • Data Analysis Libraries: Pandas, NumPy (very good with these)
  • Machine Learning: scikit-learn, TensorFlow/PyTorch (understand how to get them to production)
  • AI Tools: Claude, Gemini, OpenAI offerings

Key skills/competency

  • Data Pipelines
  • MLOps
  • Scalable Systems
  • Python Development
  • Google Cloud Platform (GCP)
  • Apache Spark / PySpark
  • Machine Learning Engineering
  • SQL
  • System Design
  • Cloud Computing

Tags:

Data Science Engineer
Machine Learning Engineer
Data Engineer
Python Developer
data pipelines
ML systems
MLOps
scalable systems
cloud computing
data processing
model deployment
system design
problem solving
code quality
Python
SQL
Apache Spark
PySpark
Google Cloud Platform
Dataproc
BigQuery
Docker
Git
Kubernetes
TensorFlow

Share Job:

How to Get Hired at DevSavant

  • Research DevSavant's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor. Focus on their commitment to excellence and growth, as a technology partner.
  • Tailor your resume strategically: Highlight your 3-5+ years in data engineering, ML engineering, or data-intensive software engineering. Emphasize expert Python skills, large-scale data processing (Spark/Dask), and strong GCP experience (Dataproc, BigQuery).
  • Showcase impactful projects: Provide specific examples where you designed, built, and scaled data pipelines or ML systems, detailing your role, the technologies used, and the measurable outcomes.
  • Prepare for technical depth: Be ready for interviews on advanced Python (OOP, system design), SQL, large-scale data processing concepts, GCP services, MLOps principles, and problem-solving complex distributed systems.
  • Demonstrate excellent communication: Practice clearly articulating technical concepts and system design decisions in English, as collaboration and consulting skills are crucial for this client-facing role.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background