Additional Important Note For Applicants
- Currently, only immediate joiners (who have already completed their notice period) or candidates serving a notice period of up to 30 days will be considered for this opportunity.
- Candidates with longer notice periods may not be considered at this stage due to urgent project requirements.
Important Note for Applicants
Kindly read the job description carefully before applying. Please apply only if your experience, technical skills, and notice period align with the mandatory requirements mentioned above. Profiles that do not meet the core criteria may face rejection during the screening process, which can lead to unnecessary time and effort from both sides. We appreciate your understanding and cooperation.
Job Title: Senior AI Engineer / Player-CoachExperience:
4.5+ Years
Location:
Pune (Viman Nagar)
Shift Timings
11:00 AM – 8:00 PM
Job Overview
We are looking for an experienced Senior AI Engineer / Player-Coach to lead the development of next-generation Generative AI solutions focused on intelligent search, content automation, and AI-driven business workflows.
This is a highly hands-on leadership role requiring strong expertise in LLM application development, Retrieval-Augmented Generation (RAG) systems, MLOps, and cloud-native AI infrastructure. The ideal candidate should be capable of architecting scalable AI systems while mentoring a small engineering team and collaborating closely with product and business stakeholders.
Key Responsibilities
-
Design, develop, and deploy advanced Retrieval-Augmented Generation (RAG) systems for intelligent search and discovery platforms
-
Build and optimize generative AI solutions for content creation, summarization, metadata enrichment, and workflow automation
-
Evaluate, implement, and manage Gen AI infrastructure using managed LLM services or self-hosted GPU-based environments
-
Define and implement best practices for:
-
Repository structure
-
CI/CD pipelines
-
Prompt management
-
Model versioning
-
Implement observability and monitoring for AI systems using tools such as OpenTelemetry and Prometheus
-
Monitor and optimize AI performance metrics including latency, cost, accuracy, hallucination detection, toxicity monitoring, and data drift
-
Lead and mentor a team of AI/ML engineers
-
Collaborate with product, engineering, and leadership teams to deliver scalable AI-driven solutions
-
Drive experimentation, rapid prototyping, and continuous improvement across AI initiatives
Required Skills & ExperienceCore Experience
- 5+ years of hands-on Python development experience
- Strong experience building and productionizing LLM-powered applications
- Experience leading technical teams or mentoring engineers
Must-Have Technical SkillsGenerative AI & LLM Frameworks
Hands-on Experience With
- DSPy
- LangChain
- LlamaIndex
- Hugging Face Transformers
RAG Systems & Vector Databases
Strong Expertise In
-
Retrieval-Augmented Generation (RAG)
-
Prompt Engineering
-
Chunking strategies
-
Embedding pipelines
-
Vector databases such as:
-
Pinecone
-
Weaviate
-
Milvus
Cloud & Infrastructure
Strong Cloud Experience With
- Azure OpenAI Service or AWS Bedrock
- Azure or AWS infrastructure services
- Kubernetes orchestration (AKS/EKS)
- Serverless services
- Cloud storage solutions
MLOps & DevOps
Experience With
- Docker
- CI/CD pipelines
- GitHub Actions
- Argo Workflows
- Model registries
- AI deployment best practices
Leadership & Communication
- Proven ability to lead small, highly technical teams
- Strong stakeholder communication and collaboration skills
- Ability to translate business requirements into scalable AI solutions
Good-to-Have Skills
-
Experience with agentic AI workflows:
-
AutoGen
-
CrewAI
-
Familiarity with multi-modal AI models (text, image, etc.)
-
Experience with advanced fine-tuning techniques:
-
LoRA
-
QLoRA
-
Strong SQL skills
-
Experience with ClickHouse
-
Exposure to inference cost optimization techniques
Skills: weaviate,mlops,kubernetes orchestration,dspy,devops,llamaindex,github actions,ci/cd pipelines,langchain,prompt engineering,docker,model registries,azure openai service,hugging face transformers,rag systems,chunking strategies,milvus,leadership,pinecone,embedding pipelines,argo workflows,aws infrastructure services