Rohit Raju, AI Engineer, researcher & creator

01 · Selected Work

Projects I'm proud of.

A tight selection across LLMs, statistical modeling, and AI for education. Each one shipped, with paper, code, or video.

01 LangGraph · Gemini · Root Cause Analysis

Sherlogs, log-only RCA for microservices

An agentic root cause analysis system that reads raw logs from a failing microservice stack and pinpoints which service caused the incident. Drain3 templating compresses ~85k log lines into ~3k patterns, loaded into DuckDB, then a LangGraph agent with bounded investigation tools follows the error chain to the origin. 92% top-1 accuracy on the RCAEval RE3 benchmark across 90 incidents and 3 systems.

GitHub
02 Snowflake · NL-to-SQL · Agentic

Census Query Agent

A chat-based agent that converts natural language questions about US demographics into validated SQL, runs them against ACS 5-year census data on Snowflake, and returns readable answers with follow-up memory. Self-correcting SQL pipeline (up to 5 retries) and multi-layer guardrails for safety.

Live app GitHub
03 99P Labs / Honda Capstone · GPT-4o · 2025

DataFlow Architect, Data Science as a Service

Capstone with 99P Labs (Honda's R&D innovation arm). An AI-driven system using GPT-4o for automated dataset profiling, EDA generation, and ML use-case synthesis, cutting manual workflow design time by 60%. Prompt-driven, dockerized reporting with latency-cost benchmarking across LLMs and on-demand PDF export.

GitHub
04 UC Berkeley AI Hackathon · Finalist · 2024

TaleTutor, classroom PDFs as narrative learning

A GPT + LangChain RAG system that turns classroom PDFs into narrative-driven, zero-shot responses, so learning feels like a story, not a search. Finalist at the UC Berkeley AI Hackathon.

Devpost GitHub
05 EdTech · Evaluation

Auto Semantic & Syntactic Grader

An evaluator that grades student code on both semantic correctness and syntactic structure, cutting TA grading time while keeping feedback meaningful.

GitHub Walkthrough

02 · Writing

Peer-reviewed publications.

Work at the intersection of language models, error correction, and collaborative learning.

IEEE I2CT2023

A System for Enhancing Accuracy of Noisy Text Using Deep Network Language Models

BART & MarianMT applied to OCR outputs, 35% reduction in word-error rate.
Read paper ↗
IEEE I2CT2024

Comparative Study on Synthetic and Natural Error Analysis with BART & MarianMT

Identifies a 26% break-even point where synthetic errors begin to mislead evaluation.
Read paper ↗
JIKM (World Scientific)2024

Grammatical vs. Spelling Error Correction: Responsiveness of Transformer Language Models

BART outperforms MarianMT on spelling correction by 24.6%; behavior diverges on grammar.
Read paper ↗
EDM2025

Improving the Generalizability of Models of Collaborative Discourse

Cross-dataset generalization for classifiers of student talk in collaborative learning.
Read paper ↗
ISLS2025 · Accepted

Facilitating Productive Uncertainty in Small-Group Jigsaw Activities

How AI-driven feedback can foster productive uncertainty in small-group learning.
Accepted

03 · About

Why I do this.

I'm an AI Engineer and ML Researcher focused on agentic LLM systems. Lately that means real merchant workflows: support, analytics automation, and campaign optimization. The hard part is reliability and scale, not a slick demo.

At Per Diem (YC W21), I designed and deployed a production-grade multi-agent platform on Amazon Bedrock, built on agent orchestration, RAG, and NL-to-SQL, now used by 500+ retail stores. It helped drive 30% weekly order growth and up to 70% retention across stores running on it.

Through the NSF AI Institute for Student-AI Teaming at CU Boulder, I work on NLP and collaborative learning systems. Five AI papers total: two with the institute, plus three first-author papers from undergrad spanning transformer-based models, interpretability, and applied AI in education.

What I'm actually chasing is AI for education. Not chatbot tutors, but personalized narrative, learning that adapts to the student in front of it and eventually lets them rewrite their own story. That's the long bet.

Experience

Jul 2025 - Apr 2026

AI Engineer

Per Diem (YC W21) · New York, USA

Architected a production-grade multi-agent AI system on Amazon Bedrock with serverless infrastructure, safety boundaries, and secure deployment pipelines, now live across 450+ retail stores, driving 30% weekly order growth and up to 70% retention through AI-powered campaign optimization.

Built the agent orchestration layer (Bedrock Agents + few-shot prompting) to route natural-language queries across Support, Analytics, and Marketing agents. Implemented a RAG-powered support workflow on Pinecone + Bedrock Knowledge Bases. Designed an NL-to-SQL analytics pipeline via tool calling (schema retrieval + read-only execution). Shipped an autonomous Marketing AI agent that recommends and deploys margin-aware, time-optimized campaigns.
Oct 2023 - Apr 2025

Machine Learning Research Assistant

NSF AI Institute for Student-AI Teaming · Institute of Cognitive Science, CU Boulder

Built classification models that detect collaborative-discourse patterns in student conversations, enabling real-time feedback in K-12 classrooms. Trained and evaluated 6 Mistral-based variants (Few-shot, LoRA, Mistral+SVM) across four diverse datasets spanning human-coded and ASR-transcribed data, 14% average AUROC improvement in cross-domain generalization over a RoBERTa baseline.

Integrated LIME-based interpretability into evaluation pipelines to analyze token-level feature contributions. Contributed to an AI moderation agent for K-12 classrooms using zero-shot prompting on AWS LLaMA 70B with safety guardrails.
Nov 2022 - Aug 2023

PLC Simulations Systems Designer (Intern)

BOSCH Research

Designed and implemented the SimuBridge backend in C# to emulate PLCs and feed clean data into DeviceBridge. Integrated with the OPC server, accelerating DeviceBridge stress testing by 50%.

Education MS Data Science, University of Colorado Boulder, GPA 4.0/4.0 (2023-2025) B.Tech Computer Science, Amrita School of Computing, GPA 8.68/10 (2019-2023)

04 · Kind Words

What people say.

"Rohit served as a teaching assistant for my Neural Networks and Deep Learning course, preparing a hands-on TensorFlow session, managing research volunteering for a large student group, and mentoring two project teams. He's confident, self-disciplined, and a committed leader with strong time management and communication skills."

Dr. Peeta Basa Pati Professor, Amrita University Bangalore

"Henri Frederic Amiel could have had Rohit in mind when he said, 'Doing easily what others find difficult is talent; doing with talent what is impossible is genius.' Amrita School of Engineering feels blessed to nurture talented students like him."

Rashmi Verma Alumni Relations Office, Amrita University Bangalore

05 · YouTube

Off the clock, as FROST tube.

Outside of work I run a vlog channel about everyday life, travel, and the odd small story. Different craft from the day job, but the same itch to make things.

10,194 Subscribers

1.7M+ Total views

108 Videos shipped

Visit the channel

06 · Contact

Let's build something.

Research collaborations, AI engineering, or just a conversation about where this is all going. My inbox is open.

rohit.raju@colorado.edu

Curious by default. Shipping by habit.

Sherlogs, log-only RCA for microservices

Census Query Agent

DataFlow Architect, Data Science as a Service

TaleTutor, classroom PDFs as narrative learning

Auto Semantic & Syntactic Grader

A System for Enhancing Accuracy of Noisy Text Using Deep Network Language Models

Comparative Study on Synthetic and Natural Error Analysis with BART & MarianMT

Grammatical vs. Spelling Error Correction: Responsiveness of Transformer Language Models

Improving the Generalizability of Models of Collaborative Discourse

Facilitating Productive Uncertainty in Small-Group Jigsaw Activities

Experience

AI Engineer

Machine Learning Research Assistant

PLC Simulations Systems Designer (Intern)

Let's build something.

Curious by default.
Shipping by habit.