Rishi Tiwari / python-llm.workspace

README.md

import reliability, performance, correctness

Rishi Tiwari

Senior Software Engineer II | LLM Systems Engineer

Bangalore, India

Senior software engineer building production backend and AI systems with a focus on reliability, latency, and measurable delivery.

selected_work.py

Selected Work

Pulse (AI Observability and Evaluation Platform)

Unified release-quality platform for AI agents covering observability, simulation, and offline evaluation.

  • What: One system for inline telemetry, simulation, and post-hoc evaluations across 20K+ daily conversations
  • How: Temporal-orchestrated batch runs with mix-and-match model routing across OpenAI and Gemini
  • Impact: Reduced recurring post-hoc evaluation spend by ~50%; platform capacity supports 1M+ evals/day (cost-governed in production)
PythonTemporalOpenAI Batch APIGoogle Gemini Batch APIAzureGoogle Gen AI SDK

Destination Content Store (MakeMyTrip)

Data generation and retrieval pipeline that powers destination-related queries on Myra.

  • What: Automated content generation for destinations and attractions
  • How: Indexed embeddings in Qdrant and modeled entity relationships in a knowledge graph
  • Impact: Moved destination answers from generic LLM memory to DCS-powered retrieval with 30% higher groundedness and broader entity coverage (4-5 -> 10+)
LLMsPythonQdrantKnowledge GraphData Pipelines

Myra (MakeMyTrip)

First LLM-driven use case launched at MMT: a virtual flight booking assistant for chat and voice journeys in English and Hindi.

  • What: Shipped core assistant flow from search through payment
  • How: Productionized custom LangChain modules for low-latency execution
  • Impact: Piloted to 1% of users while reducing production response time by ~90% (25-30 s -> 2-3 s)
LLMsLangChainPythonPrompt Orchestration

Data Migration and Marketing ETL Pipelines

Data platform workstream focused on large-scale OLTP -> Delta Lake (OLAP) migration and downstream activation pipelines.

  • What: Migrated critical OLTP analytics sources (including payments) to Delta Lake for OLAP consumption
  • How: Built config-driven Spark pipelines tuned for multi-TB/day processing with minimal source-system impact
  • Impact: Stabilized MMT analytics backbone while reducing setup time from 2 days to 3-4 hours; full dump jobs completed in <1 hour
SparkDeltaPythonFeature StoreAWS

Curie (Energy NL-to-SQL Platform)

Natural-language analytics workflow for enterprise energy management operations.

  • What: Delivered NL-to-SQL access for enterprise operations teams
  • How: Built conversational querying and alerting widgets into the web app
  • Impact: Increased supported query types by ~80% in fewer release cycles
NLPNL-to-SQLWeb ApplicationData UX
Publication

philosophy.md

Engineering Philosophy

Reliability

  • Design for graceful degradation under partial failures.
  • Use clear ownership boundaries and observable runbooks.

Performance

  • Treat latency budgets as product requirements.
  • Profile bottlenecks first, then optimize with measurable deltas.

LLM Correctness

  • Constrain prompts with task-scoped context and guardrails.
  • Validate outputs with deterministic checks before user impact.

experience.py

Experience

Senior Software Engineer II

Oct 2021 - Present

MakeMyTrip · Bangalore

  • Led development of Pulse, the internal observability and evaluation platform for AI agents processing 20K+ conversations daily.
  • Defined inline and post-hoc quality metrics, including Relevance, Context Retention, and Frustration.

Systems Engineer

Aug 2019 - Oct 2021

Tata Consultancy Services · Gurugram

  • Built Curie, an NL-to-SQL web application for enterprise energy management analytics used by internal operations users.
  • Implemented a conversational interface that reduced dependency on ad-hoc analyst SQL for operations teams.

skills.py

Skills

Backend Systems

PythonJavaFlaskDjangoSQLAlchemyAPI DesignDistributed Services

LLM / AI

LangChainPrompt OptimizationLLM Application DevelopmentNL-to-SQLEvaluation LoopsGroundingAutogenLlamaIndexDSPyGoogle ADKLiteLLM

Data

SparkAirflowETLReverse ETLMySQLAmazon AthenaAWS Glue Catalog

Infrastructure

AWSAzureDockerGitObservabilityCI/CD

contact.py

Contact

Reach out via LinkedInLinkedIn