Principal Engineer · MTS

Madhur Goel

Systems engineer building latency-critical infrastructure and production ML pipelines from scratch.

Kernel-Bypass HFT Edge ML Inference End-to-End MLOps

Featured Work

Where the hard problems lived.

SelfAware Machines Founding Engineer
2022 – 2025

Build an HFT platform for a 100+ trader desk where every microsecond of OS scheduling jitter was a liability.

  • Bypassed the Linux kernel entirely with a user-space network driver to eliminate OS scheduling jitter from the critical path.
  • Replaced OS-level IPC with lock-free shared memory queues. Cache-line alignment of all hot data structures to minimize L1/L2 cache misses was the most time-intensive optimization across the project.
  • High-speed lock-free logger writing to shared memory, with a dedicated thread handling disk synchronization, keeping the trading path clear of I/O blocking.
  • Cleared NSE black-box certification and algo-approval. Built and maintained as a solo engineer throughout.
2μs tick-to-order at p99.2
200k packets/sec/core, zero loss
NSE black-box certified
C / C++ Kernel bypass User-space networking Lock-free data structures Cache-line optimization Linux internals
webrtc.ai Founding Engineer & CTO
2020 – 2022

Run real-time AI inference for 40 students simultaneously inside a browser, on a teacher's home network, without degrading audio or video.

  • Architected a 6-region AWS platform with state in European data centers. Students connect to regional servers; teachers connect to the Indian region, minimizing round-trip latency on both ends.
  • Separated video, audio, and data into three independent streams. Video degrades under low bandwidth while audio and data continue uninterrupted at 400kbps student-side.
  • Deployed MobileNet V3 via TensorFlow.js, throttled to 1fps to share the browser's compute budget with concurrent audio decode, video decode, and real-time whiteboard.
  • Distracted students surfaced automatically to the teacher's feed, reducing monitoring load across all 40 sessions.
99.9% SLA on voice and whiteboard
1fps ML inference, in-browser
6 AWS regions, geo-routed
MobileNet V3 TensorFlow.js Edge inference WebRTC AWS multi-region Distributed systems
Tangent.ai Founding Engineer & CTO
2017 – 2020

Build an ML pipeline flexible enough to train and deploy across entirely different domains without rebuilding from scratch each time.

  • Built the model-independent CI/CD training and deployment pipeline before any production use case, enabling rapid cross-domain experimentation.
  • First deployment: automated visual defect detection for a bottle manufacturing plant. Second: real-time image personalization for consumer brands using custom StyleGAN architectures.
  • End-to-end data pipeline covering collection, PII removal, and tagging across public and proprietary sources. Data preparation was consistently the most time-intensive phase of each new deployment.
StyleGAN custom-trained architectures
2+ domains from one pipeline
Seed Alchemist Accelerator Cohort XX
StyleGAN PyTorch MLOps CI/CD pipelines Data pipeline PII removal
Current Work SAM42 · Nextgen
2025 – Present

Production AI systems and intelligent infrastructure: agentic pipelines, hybrid RAG, and statistically-driven payment routing.

  • Hybrid RAG pipeline combining dense vector search with knowledge graph traversal for joint retrieval over text and image corpora.
  • Agentic portfolio risk analysis system across equity, equity derivatives, and currency derivatives, replacing manual analysis workflows.
  • Payment orchestration engine handling 1,500+ transactions/minute on Indian UPI rails. Gateway routing uses a parameterized 2σ deviation threshold, outperforming regression and decision tree baselines in live testing.
  • Building DSPy-based modular AI agents with built-in prompt optimization, portable across models and inference backends by design.
LangChain LangGraph DSPy RAG Knowledge graphs Cloudflare Workers Supabase / PostgreSQL

Experience

18 years across systems, ML, and markets.

2025 – Now
SAM42.com Principal Engineer (Independent)

LLM systems, agentic pipelines, and RAG infrastructure.

2025 – Now
Nextgen Techno Ventures Principal Engineer (Fractional)

Payment orchestration engine on Indian UPI rails, 1,500+ TPS.

2022 – 2025
SelfAware Machines Founding Engineer

Kernel-bypass HFT platform, NSE-certified, 2μs tick-to-order at p99.2.

2020 – 2022
webrtc.ai Founding Engineer & CTO

6-region video conferencing platform with in-browser AI inference.

2017 – 2020
Tangent.ai Founding Engineer & CTO

Real-time image personalization with custom StyleGAN. Alchemist Accelerator Cohort XX.

2015 – 2017
Freelance Deep Learning Consultant

Video anomaly detection, NLP, and Netflix server load prediction.

2010 – 2015
Prop. Algo Trading Desk Quant Strategist & Tech Lead

HFT strategies and low-latency trading platform for NSE.

2008 – 2010
iXiGO.com Product Associate

Early team member at India's leading travel search platform.

Technical Skills

Tools and domains.

Low-Latency Systems

Kernel bypass User-space networking Lock-free data structures Cache-line optimization Real-time IPC High-speed logging NSE connectivity Linux internals C C++

ML / MLOps

PyTorch TensorFlow TensorFlow.js MobileNet V3 StyleGAN LangChain LangGraph DSPy openai-agents-sdk Multi-GPU training Edge inference Model CI/CD

Distributed Infrastructure

AWS multi-region Cloudflare Workers WebRTC Supabase / PostgreSQL MongoDB UPI payment rails Serverless

Get in Touch

Let's work on something interesting.

Open to Principal Engineer, Staff Engineer, and MTS roles at companies working on hard systems problems. Also available for select consulting engagements.