Technical Lead

2021 – Present Bengaluru, India

Overview

Leading cloud-native and AI system initiatives at IBM, focusing on resilient distributed platforms, automation, and scalable solutions. Driving innovations that enable real-time, high-throughput operations and building foundations for AI-driven decision support systems.

Key Achievements

  • Event-Driven Platform Architecture Architected a mission-critical publish-subscribe platform handling millions of events daily, ensuring reliability, scalability, and minimal latency.

  • High-Throughput Retry Engine Designed a retry engine capable of processing millions of events per day with minimal operational overhead and automated error handling.

  • Observability & Monitoring Built global ELK-based observability dashboards providing real-time insights into system health, enabling proactive incident detection and faster remediation.

  • AI-Powered Remediation Agent Developed a RAG-based remediation agent using LangChain & WatsonX to assist in automated problem resolution and anomaly detection.

  • Kubernetes & GitOps Transformation Led the cloud-native infrastructure transformation, implementing GitOps pipelines and container orchestration with Kubernetes to enhance deployment agility.

  • Open-Source Contribution & SDK Development Contributed to Go Kafka client libraries, improving the reliability and performance of messaging systems across multiple teams and projects.

Technology Stack

Kafka • Kubernetes • Go • Python • LangChain • IBM Cloud • WatsonX • Elastic Stack