3 days ago

Staff Software Engineer AI

FourKites, Inc.

On Site
Full Time
$220,000
Greater Chennai Area

Job Overview

Job TitleStaff Software Engineer AI
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$220,000
LocationGreater Chennai Area

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About the Role: Staff Software Engineer AI

At FourKites, we have the opportunity to tackle complex challenges with real-world impacts. Whether it’s medical supplies from Cardinal Health or groceries for Walmart, the FourKites platform helps customers operate global supply chains that are efficient, agile, and sustainable. Join a team of curious problem solvers that celebrates differences, leads with empathy, and values inclusivity.

We are seeking an experienced Staff Software Engineer AI to join our AI and Data Platform team, where you'll play a pivotal role in building and scaling our next-generation AI workforce platform. You'll work on cutting-edge agent-based systems that are transforming supply chain operations for Fortune 500 companies, delivering real business value through intelligent automation.

What you'll be doing

Technical Leadership
  • Design and implement production-scale AI agent systems and orchestration frameworks (LangGraph, LangChain, similar architectures)
  • Lead architecture for multi-agent systems handling complex business workflows
  • Optimize deployment strategies using both LLMs and SLMs based on use case requirements
  • Build natural language-configurable business process automation frameworks
  • Implement multi-modal AI systems for document understanding (tables, charts, layouts)
AI/ML Implementation & Optimization
  • Deploy and optimize LLMs/SLMs in production with fine-tuning techniques (LoRA, QLoRA, DPO)
  • Implement quantization strategies (INT8, INT4) and model distillation for edge deployment
  • Build evaluation frameworks including LLM-as-judge systems and regression testing
  • Design streaming architectures for real-time LLM responses (SSE, WebSockets)
  • Create semantic caching and embedding-based retrieval systems
  • Develop GraphRAG and long-context handling strategies (100k+ tokens)
System Architecture & Engineering
  • Design scalable microservices with comprehensive observability (LangSmith, Arize, custom telemetry)
  • Build secure multi-tenant systems with prompt injection prevention and output validation
  • Implement cost optimization through intelligent model routing and fallback strategies
  • Develop document processing pipelines with OCR and layout understanding
  • Create event-driven architectures for real-time shipment tracking and exception handling
Data & Infrastructure
  • Build data pipelines for training data curation, synthetic generation, and PII masking
  • Implement RLHF/RLAIF feedback loops for continuous improvement
  • Design experiment tracking and model registry systems (MLflow, DVC)
  • Optimize inference costs through batch processing and spot instance utilization
  • Establish model governance, audit trails, and compliance frameworks

About the team

The AI team at FourKites builds the intelligence layer behind our next-generation AI workforce, powering autonomous agents used by Fortune 500 supply chains every day. We operate at the intersection of production-grade AI, distributed systems, and real-world logistics, focusing on scalability, reliability, and measurable business impact. The team values deep technical ownership, rapid experimentation, and thoughtful optimization—balancing model performance, cost, and latency while building systems that operate at massive scale.

Who you are

Technical Skills
  • 8+ years software engineering, 3+ years in production AI/ML systems
  • Expertise in Python, PyTorch/JAX, and AI frameworks (LangChain, Transformers, PEFT)
  • Experience with LLMs (GPT-4, Claude, Gemini) and SLMs (Phi, Llama, Mistral)
  • Hands-on experience with:
    • Fine-tuning techniques (LoRA, QLoRA, DPO, RLHF)
    • Model optimization (quantization, distillation, pruning)
    • Vector databases and RAG architectures
    • Streaming systems and real-time processing
    • Security measures (prompt injection prevention, jailbreak detection)
  • Strong background in distributed systems, Kubernetes, and cloud platforms
Domain Knowledge (nice to have)
  • Experience with document intelligence and multi-modal AI systems
  • Understanding of supply chain operations, EDI/API integrations
  • Knowledge of token economics and consumption-based pricing models
  • Familiarity with enterprise compliance requirements (GDPR, CCPA, SOC2)
Professional Skills
  • Track record of delivering complex projects with measurable business impact
  • Experience with technical sales support, POCs, and customer success
  • Strong communication for technical and non-technical audiences
  • Data-driven decision making for model selection and cost optimization

Preferred Qualifications

  • Supply chain, logistics, or transportation management experience
  • Experience with OCR pipelines and document extraction at scale
  • Knowledge of GraphRAG and knowledge graph integration
  • Contributions to open-source AI projects (Hugging Face, Ollama)
  • Experience reducing inference costs by 50%+ through optimization
  • Familiarity with MoE architectures and constitutional AI approaches
  • Background in building usage-based billing and margin optimization
  • Experience with specialized tools (vLLM, TGI, Triton, ONNX, TensorRT)

What You'll Work On

  • Building specialized AI agents solving supply chain problems
  • Fine-tuning domain-specific models for supply chain terminology
  • Implementing hybrid architectures combining cloud LLMs with edge SLMs
  • Creating secure document intelligence systems for Fortune 500 clients
  • Developing real-time exception handling for shipment tracking
  • Building observability and evaluation frameworks for agent performance
  • Designing fallback strategies and multi-provider redundancy

Technical Environment

Models: GPT-4, Claude, Gemini, Llama 3, Mistral, Phi-3, custom fine-tuned modelsFine-tuning: LoRA/QLoRA, PEFT, DeepSpeed, bitsandbytes, AxolotlInfrastructure: Kubernetes, AWS SageMaker/Bedrock, GPU clusters, edge devicesFrameworks: LangChain, LangGraph, vLLM, FastAPI, TransformersObservability: LangSmith, Weights & Biases, custom telemetryData: PostgreSQL, Redis, Vector DBs, Kafka, feature stores

Impact & Growth

You'll directly contribute to AI initiatives generating millions in revenue while shaping systems processing millions of transactions daily. Lead technical decisions affecting 25+ engineers while mentoring the next generation of AI engineers. Be at the forefront of production AI optimization, balancing performance, cost, and latency for enterprise customers.

We know that job postings can be intimidating, and research shows that while men apply to jobs when they meet an average of 60% of the criteria, women and other marginalized folks tend to only apply when they check every box. We encourage you to apply if you think you may be a fit and give us both a chance to find out!

Who we are

FourKites®, the leader in AI-driven supply chain transformation for global enterprises and pioneer of advanced real-time visibility, turns supply chain data into automated action. FourKites’ Intelligent Control Tower™ breaks down enterprise silos by creating a real-time digital twin of orders, shipments, inventory and assets. This comprehensive view, combined with AI-powered digital workers, enables companies to prevent disruptions, automate routine tasks, and optimize performance across their supply chain. FourKites processes over 3.2 million supply chain events daily — from purchase orders to final delivery — helping 1,600+ global brands prevent disruptions, make faster decisions and move from reactive tracking to proactive supply chain orchestration.

FourKites provides competitive compensation with stock options, outstanding benefits and a collaborative culture for all employees around the globe. To help you be your best, we have 5 global recharge days, in addition to generous PTO and standard holidays. Parental leave for all parents, an annual wellness stipend and volunteer days also provide you with time and resources for self care and to care for others. Throughout the year, FourKites sets aside time during the workday to learn and celebrate diversity. We're always listening for new ways to support everyone in and out of the office.

For India
  • Medical benefits start on first day of employment
  • 36 PTO days (Sick, Casual and Earned), 5 recharge days, 2 volunteer days
  • Home Office set ups and Technology reimbursement
  • Lifestyle & Family benefits
  • Mental Wellness support and guidance
  • Ongoing learning & development opportunities (Professional development program, Toast Master club etc.)

FourKites is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Key skills/competency

  • AI Agent Systems
  • LLM/SLM Optimization
  • Production AI/ML
  • Distributed Systems
  • Python & PyTorch
  • LangChain/LangGraph
  • Kubernetes & AWS
  • Data Pipelines
  • Microservices
  • Supply Chain AI

Tags:

Staff Software Engineer AI
AI Engineering
Machine Learning
Deep Learning
Software Development
Distributed Systems
Supply Chain AI
LLM
SLM
MLOps
AI Agent Systems
LLM Optimization
Microservices
Data Pipelines
Technical Leadership
Architecture Design
Production AI
Real-time Systems
Observability
Model Governance
Python
PyTorch
JAX
LangChain
Transformers
Kubernetes
AWS
GPT-4
Llama
PostgreSQL
Kafka
MLflow
Vector DB
Redis

Share Job:

How to Get Hired at FourKites, Inc.

  • Research FourKites' culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
  • Tailor your resume: Highlight expertise in AI agent systems, LLMs, distributed systems, and supply chain relevance.
  • Showcase technical projects: Demonstrate practical experience with LangChain, PyTorch, Kubernetes, and model optimization.
  • Prepare for AI/ML specific questions: Be ready to discuss architecture, fine-tuning, and real-world AI implementation challenges.
  • Emphasize problem-solving: Illustrate how you tackle complex challenges with data-driven decision-making and business impact.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background