Join Our Community
Data Scientist – Senior
Role Overview
We're looking for a Senior GenAI \& Data Scientist who can own and drive end\-to\-end AI development independently. This is not a hand\-holding role — you'll architect, build, evaluate, and deploy production GenAI systems with full ownership. You'll work directly with the GenAI lead and product teams to solve hard problems in healthcare AI, multilingual NLP, and agentic automation.
Key Responsibilities
Design and build production\-grade GenAI applications — LLM pipelines, RAG systems, multi\-agent architectures, and AI\-powered APIs.
Develop and maintain agentic workflows with tool\-calling, function routing, session management, and streaming — using frameworks such as LangChain, LlamaIndex, or Google GenAI SDK.
Build and optimize RAG pipelines with vector databases (Qdrant, FAISS, Pinecone, or Weaviate), embedding models, and hybrid retrieval strategies.
Engineer multilingual NLP systems supporting language detection, translation caching, and LLM\-native language enforcement across regional Indian languages.
Develop and deploy ML/AI models via FastAPI backends — async, WebSocket\-enabled, and containerized for GCP/GKE.
Collaborate with the platform team to design scalable AI infrastructure on GCP: Cloud Run, GKE, Pub/Sub, Cloud Scheduler, GCS, BigQuery.
Write and maintain a comprehensive test suite for AI pipelines — unit, integration, and load tests with in\-memory mocking of LLMs and DBs.
Conduct model evaluation, prompt experimentation, and iterative improvement with structured metrics.
Translate clinical and business requirements into robust data\-driven solutions — working closely with product owners and domain experts.
Contribute to technical documentation, architecture decisions, and internal knowledge sharing.
Mentor junior engineers and uphold engineering quality across the GenAI stack.
Required Qualifications
Bachelor's or Master's degree in Computer Science, Data Science, Mathematics, or a related field.
4–6 years of hands\-on experience in Data Science, ML, or AI engineering — with at least 2 years in production GenAI systems.
Strong proficiency in Python — async programming, type hints, modular architecture.
Solid SQL skills for data analysis, pipeline validation, and query optimization.
Hands\-on experience with LLMs (Gemini, GPT\-4, Claude, Mistral, or equivalent) and prompt engineering at scale.
Practical experience building RAG systems with vector stores and embedding pipelines.
Experience developing AI agents — tool definition, chained execution, session handling, and failure recovery.
Working knowledge of ML frameworks — Scikit\-learn, TensorFlow, or PyTorch — for classical and deep learning use cases.
Experience with cloud platforms, preferably GCP (Cloud Run, GKE, GCS, Pub/Sub, BigQuery).
Familiarity with FastAPI or equivalent for building production AI backends.
Comfortable working in a fast\-moving startup environment with high ownership and low hand\-holding.
Good to Have
Experience with multilingual NLP — Indian languages (Hindi, Bengali, Odia, Kannada, Tamil) is a strong plus.
Exposure to clinical AI — SOAP notes generation, medical entity extraction, or healthcare data pipelines.
Knowledge of Speech AI — STT/TTS pipelines (Google Chirp, Azure Speech, or similar).
Familiarity with Computer Vision — CNNs, YOLO, OCR, or Vision Transformers.
Experience with WebSocket streaming in AI applications.
Prior work with async event\-driven architectures (Pub/Sub, Kafka, or equivalent).
Contributions to or publications in AI/ML communities.
Location
Bhubaneswar or Bangalore (Preferred)
Joining Timeline
immediate to 30 days
Pay: Up to ₹700,000\.00 per year
Work Location: In person
Azure Devsec Ops Engineer
Persistent Systems · Hyderabad, Telangana, India
Azure DevSecOps Engineer with Python
Persistent Systems · Hyderabad, Telangana, India
Senior AI Engineer – Generative AI & Azure
Innova ESI · Bengaluru, Karnataka, India