Technical Lead
2021 – Present Bengaluru, India
Overview
Leading cloud-native and AI system initiatives at IBM, focusing on resilient distributed platforms, automation, and scalable solutions. Driving innovations that enable real-time, high-throughput operations and building foundations for AI-driven decision support systems.
Key Achievements
Event-Driven Platform Architecture Architected a mission-critical publish-subscribe platform handling millions of events daily, ensuring reliability, scalability, and minimal latency.
High-Throughput Retry Engine Designed a retry engine capable of processing millions of events per day with minimal operational overhead and automated error handling.
Observability & Monitoring Built global ELK-based observability dashboards providing real-time insights into system health, enabling proactive incident detection and faster remediation.
AI-Powered Remediation Agent Developed a RAG-based remediation agent using LangChain & WatsonX to assist in automated problem resolution and anomaly detection.
Kubernetes & GitOps Transformation Led the cloud-native infrastructure transformation, implementing GitOps pipelines and container orchestration with Kubernetes to enhance deployment agility.
Open-Source Contribution & SDK Development Contributed to Go Kafka client libraries, improving the reliability and performance of messaging systems across multiple teams and projects.
Technology Stack
Kafka • Kubernetes • Go • Python • LangChain • IBM Cloud • WatsonX • Elastic Stack