Job Overview
Job TitleSenior Data Scientist NLP
Job TypeFull Time
Offered Salary$200,000
LocationHybrid
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About Datavant
Datavant is the data collaboration platform trusted for healthcare. Guided by our mission to make the world’s health data secure, accessible and actionable, we provide critical data solutions for organizations across the healthcare ecosystem - including providers, health plans, researchers, and life sciences companies. From fulfilling a single patient’s request for their medical records to powering the AI revolution in healthcare, Datavanters are building the future of how data is connected and used to improve health. By joining Datavant today, you’re stepping onto a driven and highly collaborative team that is passionate about creating transformative change in healthcare.What We’re Looking For
We are looking for a motivated Data Scientist to help Datavant revolutionize the healthcare industry with AI. This is a critical role where the right candidate will have the ability to work on a wide range of problems in the healthcare industry with an unparalleled amount of data. You’ll join a team focused on deep medical document understanding, extracting meaning, intent, and structure from unstructured medical and administrative records. Our mission is to build intelligent systems that can reliably interpret complex, messy, and high-stakes healthcare documentation at scale.Role
This role is a unique blend of applied machine learning, NLP, and product thinking. You’ll collaborate closely with cross-functional teams to:- Design and develop models to extract entities, detect intents, and understand document structure
- Tackle challenges like long-context reasoning, layout-aware NLP, and ambiguous inputs
- Evaluate model performance where ground truth is partial, uncertain, or evolving
- Shape the roadmap and success metrics for replacing legacy document processing systems with smarter, scalable solutions
What You Will Do
- Play a key role in the success of our products by developing models for document understanding tasks.
- Perform error analysis, data cleaning, and other related tasks to improve models.
- Collaborate with your team by making recommendations for the development roadmap of a capability.
- Work with other data scientists and engineers to optimize machine learning models and insert them into end-to-end pipelines.
- Understand product use-cases and define key performance metrics for models according to business requirements.
- Set up systems for long-term improvement of models and data quality (e.g. active learning, continuous learning systems, etc.).
What You Need To Succeed
- 6+ years of experience with data science and machine learning in an industry setting, particularly in designing and building NLP models.
- Expertise with Python
- Experience with the latest developments in language models (transformers, LLMs, etc.)
- Proficiency with standard data analysis toolkits such as SQL, Numpy, Pandas, etc.
- Proficiency with deep learning frameworks like PyTorch (preferred) or TensorFlow
- Industry experience shepherding ML/AI projects from ideation to delivery
- Demonstrated ability to influence company KPIs with AI
- Demonstrated ability to navigate ambiguity
What Helps You Stand Out
- Experience with document layout analysis (using vision or multi-modal approaches).
- Experience with Spark/PySpark
- Experience with Databricks
- Experience in the healthcare industry
After 3 Months, You Will…
Have a strong grasp of technologies upon which our platform is built. Be fully integrated into ongoing model development efforts with your team.After 1 Year, You Will…
Be independent in reading literature and doing research to develop models for new and existing products. Have ownership over models internally, communicating with product managers, customer success managers, and engineers to make the model and the encompassing product succeed. Be a subject matter expert on Datavant’s models and a source from which other teams can seek information and recommendations.Key skills/competency
- Natural Language Processing (NLP)
- Data Science
- Machine Learning
- Python
- Deep Learning
- Transformers
- Large Language Models (LLMs)
- PyTorch
- SQL
- Data Analysis
How to Get Hired at Datavant
- Tailor your resume: Highlight your 6+ years of NLP and ML experience, Python expertise, and familiarity with transformers and LLMs. Emphasize industry experience shepherding AI projects.
- Showcase impact: Quantify your achievements and demonstrate how you’ve influenced company KPIs with AI in previous roles.
- Highlight relevant skills: Emphasize proficiency in Python, SQL, Pandas, Numpy, and deep learning frameworks like PyTorch or TensorFlow. Mention any healthcare industry or Databricks experience.
- Prepare for technical interviews: Be ready to discuss your approach to NLP challenges, model development, error analysis, and data quality systems.
- Understand Datavant's mission: Articulate your passion for improving healthcare through data and AI.
Frequently Asked Questions
Find answers to common questions about this job opportunity
01What is the expected salary for a Senior Data Scientist NLP at Datavant?
02What specific NLP tasks will a Senior Data Scientist perform at Datavant?
03What are the required technical skills for this Senior Data Scientist NLP role?
04Does Datavant offer employment sponsorship for this Senior Data Scientist NLP position?
05What is the work arrangement for the Senior Data Scientist NLP role at Datavant?
06What is Datavant's approach to diversity and inclusion for the Senior Data Scientist NLP role?
07What kind of impact can a Senior Data Scientist NLP have at Datavant?
08What are the key differentiators for this Senior Data Scientist NLP role at Datavant?
Explore similar opportunities that match your background