17 hours ago

Vectorization & Embeddings Engineer

Jobs via Dice

Hybrid
Full Time
$180,000
Hybrid

Job Overview

Job TitleVectorization & Embeddings Engineer
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$180,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Job Description: Vectorization & Embeddings Engineer

Sidmans is seeking a skilled Vectorization & Embeddings Engineer with 4+ years of experience to join their team. This role focuses on the critical area of developing and optimizing robust pipelines for converting various data types into high-quality embeddings, a cornerstone for advanced AI and search functionalities.

Key Responsibilities

  • Lead the design and development of scalable vectorization pipelines to convert text, PDFs, and multimodal data into high-quality embeddings using models like BGE, Ada, or Cohere.
  • Manage and optimize vector databases such as Pinecone, Weaviate, Milvus, and Ravecter, including building efficient indexing strategies (e.g., HNSW, RiSKANN).
  • Develop advanced chunking and parsing techniques to maintain context and improve retrieval accuracy within complex datasets.
  • Build and fine-tune hybrid search workflows, combining the power of dense vector search with sparse keyword methods like BM25.
  • Create comprehensive evaluation frameworks and gold-standard datasets to meticulously measure retrieval performance, focusing on metrics like Hit Rate, MRR, and Faithfulness.
  • Implement sophisticated re-ranking pipelines utilizing cross-encoders to refine search results before they are delivered to Large Language Models (LLMs).
  • Collaborate closely with engineering and product teams to ensure the development and deployment of scalable, high-performance retrieval systems.

Key skills/competency

  • Vectorization
  • Embeddings
  • Pipeline Development
  • Vector Databases (Pinecone, Weaviate)
  • Indexing Strategies (HNSW, RiSKANN)
  • Chunking & Parsing
  • Hybrid Search
  • BM25
  • Cross-encoders
  • LLMs Integration

Tags:

Vectorization & Embeddings Engineer
Vectorization
Embeddings
Pipeline Development
Vector Databases
Indexing
Retrieval
Hybrid Search
Evaluation
Re-ranking
LLMs Integration
Pinecone
Weaviate
Milvus
BGE
Ada
Cohere
HNSW
RiSKANN
BM25
Cross-encoders

Share Job:

How to Get Hired at Jobs via Dice

  • Research Sidmans' culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
  • Tailor your resume: Highlight experience in vectorization, embeddings, and relevant databases (Pinecone, Weaviate, Milvus) to match the Vectorization & Embeddings Engineer role.
  • Showcase technical expertise: Prepare to discuss scalable pipeline development, advanced retrieval techniques, and work with models like BGE, Ada, or Cohere.
  • Master behavioral questions: Practice articulating your problem-solving skills, collaboration style, and how you ensure high-performance systems.
  • Connect with the team: Network with current or former Sidmans employees on LinkedIn to gain insights and demonstrate genuine interest.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background