Surya Teja Reddy Dwarampudi

SDE2 · SCALER AI LABS · Bengaluru, IN · available · 2026

Currently: building RL environments, authoring SWE-Bench tasks, wiring up data pipelines, shipping internal AI tools, shipping voice agents, tuning sub-ms search.

About

Engineer. Builder. Curious about how things scale.

I work on AI infrastructure, realtime systems, and the internal tools companies live inside. Started as an intern at Scaler in 2023; full-time soon after. Now at Scaler AI Labs — founding team, building RL environments, eval tasks (SWE-Bench, Terminal-Bench), data pipelines, and the AI tooling the research team runs on.

I care about the unglamorous parts: the call that answers in 800ms, the search that returns under a millisecond, the CRM your team doesn't dread opening. If a product feels effortless, someone did the hard work under the hood. That's the part I enjoy.

Outside work: a first-principles thinker, occasional hackathon winner, and Bengaluru-based optimist about the next decade of software.

50+ eng — org I build with
$10M+ — revenue generated
5.4 cr+ — annual costs cut
9.8/10 — cgpa · top 1%

Experience

Software Engineer II · Scaler AI Labs (current)

Oct 2025 to present · Bengaluru

RL environments for computer-use agents — high-fidelity GUI simulation so frontier models can learn to operate a desktop. Single-agent and multi-agent.
SWE-Bench & Terminal-Bench tasks: authoring and curating evals that grade agents on real software-engineering work.
Data pipelines turning messy real-world traces into training-grade data.
Internal AI productivity tools: the stack the research team uses day-to-day.
Partnering with 3+ global AI research labs to benchmark model performance. Mentoring junior engineers across workstreams.

Software Engineer I · Scaler by InterviewBit

Jul 2024 to Jan 2026 · Bengaluru

Real-time AI Voice Agent (IVR): low-latency calling platform on Gemini + Deepgram (ASR) + ElevenLabs (TTS) handling 15+ concurrent sales calls at <800ms latency.
Sales AI Simulator: internal training platform where AI agents role-play as prospects, so reps can practice pitches before real calls.
Automated Call Intelligence: compliance pipeline auditing 10K+ calls/month across sales and support. LLMs flag compliance issues and surface coaching insights.
CRM Cost Optimization: architected in-house CRM replacing LeadSquared. Cut annual costs by 90% (₹5.4Cr+) with 100% adoption in 4 months.
High-Scale Search: advanced filtering engine on 3M+ leads serving 100K+ queries/mo at sub-ms latency (ElasticSearch + tuned indexing).
Zero-Conflict RBAC across 12+ modules · scalable monorepo (Turbo + Next.js) · no-code workflow builder on React Flow · double-digit concurrent A/B tests.

Software Engineer Intern · Scaler by InterviewBit

Mar 2023 to Jul 2024 · Bengaluru

Needle-Mover Award: reduced mentor no-shows from 11% → 3% via a custom scheduling feature on Ruby on Rails + React.
Self-serve admin tools cut weekly support tickets by 60% (30→12), saving 100+ eng-hours/yr.
Optimized the Article page with lazy-loading + server-side pagination, serving 500K+ MAU with improved core web vitals.

Full-Stack Developer Intern · SCSET · Bennett University

Sep 2022 to Jan 2023 · Greater Noida

Built interactive admin dashboards in Django to visualize inter-school activity data.

Projects

01 · Real-time voice agent for sales calls

flagship · 2025

A low-latency calling platform that picks up, holds a real conversation, and books meetings. Built on Gemini + Deepgram + ElevenLabs, tuned for 15+ concurrent calls without sounding like an IVR.

Stack: Gemini, Deepgram, ElevenLabs, Node.js, Redis, WebRTC

Role: Lead engineer · architecture, latency, orchestration

<800 ms — avg latency
15+ — concurrent calls
24/7 — uptime

02 · In-house CRM that replaced a ₹5.4 Cr vendor

scale · 2024

Architected the in-house CRM that retired LeadSquared in 4 months. Zero-conflict RBAC across 12+ modules, sub-millisecond filtering on 3M+ leads, a no-code workflow builder, and double-digit concurrent A/B tests from day one.

Stack: Next.js, TurboRepo, ElasticSearch, PostgreSQL, React Flow, Kafka

Role: Frontend architecture · search infra · RBAC · workflow engine

90% — cost reduction
3M+ — leads indexed
100% — team adoption

03 · Call audit pipeline at 10K calls/month

compliance · 2024

Built the automated call-intelligence layer. Every sales and support call gets transcribed, classified, scored against a compliance policy, and fed back as a coaching nudge. LLMs catch the issues a manual QA pass tends to miss.

Stack: Python, FastAPI, OpenAI, AWS MSK, Postgres, OpenSearch

Role: Pipeline architecture · LLM prompting · review tooling

10K+ — calls/month
94% — precision flagging
3 d — review-to-prod loop

04 · Sales AI simulator: agents that role-play as leads

training · 2025

An internal training arena where AI agents role-play as buyers across personas, objections, and stalling tactics. Reps practice live, get scored, and walk into the real call already warmed up.

Stack: Gemini, TypeScript, Next.js, Postgres, pgvector

Role: End-to-end · prompts, eval harness, frontend

6 wks — rep ramp-up cut
40+ — persona library
5x — practice volume

05 · Founding work at Scaler AI Labs (current)

current · founding · 2025–

RL environments (single + multi-agent) for computer-use agents, SWE-Bench and Terminal-Bench eval tasks, data pipelines, and the internal AI tools the research team runs on. Founding team — started Oct '25 inside Scaler, workspace spun out Apr '26.

Stack: Python, TypeScript, Playwright, Docker, RL · PPO/GRPO

Role: IC SDE2 · mentor to junior eng across workstreams

50+ — engineers in org
3 — research partners
7 — workstreams

Now — Scaler AI Labs · founding team

Splitting the week across RL environments for computer-use agents, authoring SWE-Bench and Terminal-Bench eval tasks, the data pipelines that feed them, and the internal AI tools the research team relies on. Partnering with 3 global AI research labs — the bar keeps moving, which is the fun part.

reading: DeepMind's SIMA paper, again. This time for the eval harness.
learning: PPO tuning + process supervision for browser-use tasks.
listening to: A lot of Khruangbin while the GPUs spin.

Skills

Generative AI: Gemini, OpenAI, Anthropic, Computer-use agents, RL (PPO/GRPO), LangChain, Deepgram (ASR), ElevenLabs (TTS)
Languages: TypeScript, Python, JavaScript, Ruby, C++, SQL
Frameworks: Next.js, React, Node.js, FastAPI, Ruby on Rails, GraphQL
Libraries: Redux / RTK Query, React Flow, Prisma, Sentry, TurboRepo
Infrastructure: AWS, ECS / EC2 / RDS, OpenSearch / Elastic, MSK · Kafka, Docker, Jenkins, Redis
Patterns I love: Event-driven, RBAC / Zero-conflict, A/B + feature flags, Workflow engines
Learning next: Rust, native CUDA, iOS / Swift
Currently learning: Process supervision, Eval design, WebGPU