Shubhankit Singh

Founder · Researcher · SF Bay Area

These days I work the most on deep-RL and distributed AI infrastructure.

I like math, machine learning and computer architecture / science. I love working on the intersection of these subjects along with real-world domains like Deep RL, Quant Finance, Cognitive Neuroscience, and Sci-ML.

If you have an idea and want to collaborate, drop an email at shubhankitsingh@researchcommons.ai.

Personally — have a pillow cushion named Niko, who also seems to bark.

Currently: Lanturn (Co-Founder, CTO) · Research Commons (Founder)
Previously: Microsoft · Flipkart · Kreaitor · Invsto · Piramal
Research: MSML 2025 · 3 papers targeting NeurIPS 2026 · ASD-Bench · Lumbar MRI
Education: MSc Financial Engineering, Worldquant · B.E. Computer Science, BITS Pilani
Credentials: CFA Level 1 · Worldquant Gold · GRE 327/340 (Quant: full marks)

Based: SF Bay Area
Email: shubhankitsingh@researchcommons.ai
GitHub: github.com/shoobiedoo
LinkedIn: linkedin.com/in/shubh101
HuggingFace: huggingface.co/rescommons
X: x.com/shoobiedoo313

02Experience

7 roles · current first

Sep 2025 — PresentRemote

Co-Founder & CTO —

Lanturn

Building a Behavioural Data Platform with a granular capture layer baked into native desktop applications, automated RL environment generation, synthetic data generation, and end-to-end post-training signal pipelines focused on both the observation and action layer.

Capture modalities span browser, desktop, CAD / 3-D, and domain-specific application data.
Generating complete RLHF and long-horizon RL environments with verifiers from expert demonstrations — targeting labs and enterprises.
Received acqui-hire and licensing offers from 2 of the biggest data vendors in the space.

Feb 2025 — PresentBengaluru

Founder —

Research Commons

Building an end-to-end research suite — a unified control plane (Tensile) for distributed training, fine-tuning (SFT, DPO, ORPO, GRPO++), inference, and agentic workflows. The core framework powers both internal research and external client engagements.

Bootstrapped with $200K of personal capital; grew to six-figure revenue through client partnerships.
Client partnerships: Finance firms, major neo-clouds, and datacenters — helping them train and fine-tune agents with custom SFT pipelines, reward modelling, evaluation harnesses, and inference optimization on the Tensile platform (e.g. Aion).
Research: Extremely horizontal — broke down and reverse-engineered all SOTA AI techniques from scratch, spanning distributed systems, post-training, inference, and agents.
Shipped speculative decoding models (EAGLE3, Arctic speculators for Llama 3.2) and 7 open datasets (44K+ rows, 40+ likes) on HuggingFace.
Built a SOTA programmatic PDF parser benchmarked against AI and commercial pipelines — 10–20× faster at zero API cost on born-digital formats.
Scaled Research Commons + MathCommons pages to a combined 20K followers.

Jul 2022 — Jan 2025Bangalore

Tech Lead —

Microsoft

Led Bluetooth HAL work under Silicon Graphics & Media; founding engineer of the Sigma Bluetooth India team.
Owned engineering for Office Android, Office iOS, and Universal Print integration into M365 mobile.
Built Networking RAGs to enhance developer productivity; instructor at Microsoft AI School, teaching LLM theory and tooling.
Authored a proof-of-concept for billions of users running Bluetooth Android applications over Windows via a Linux kernel — covering gaming, hearing aids, and adjacent verticals.

Jun 2024 — Jan 2025Remote · Part-time

Head of Engineering —

Kreaitor (Web3 startup)

Trained diffusion models and built social-media AI agents.
Re-architected the platform: more robust, more secure, AI-ready.
Drove the B2C → B2B pivot; aligned the product to a steady five-figure MRR.

Jun 2023 — Dec 2024Pennsylvania · Part-time

CTO & Board Member — IInvsto

Built an end-to-end retail trading platform — high-throughput Strategy Engine, Order Management System, data infra, and frontend for a quant-sciences firm focused on LFT algorithms.

Performance: Numba / JIT compilation and Cython-based optimizations on the critical path; multi-threaded execution for concurrent strategy evaluation and order routing.
Integrations: Worked with dYdX, Polygon, and Rootstock for exchange connectivity, market-data feeds, and on-chain settlement.
Shipped Release 1.0; continues to serve on the board.

Aug 2020 — Jun 2022Bangalore

Software Engineer —

Flipkart (India's biggest startup to date)

Multiple microservices on the seller side — state machines, Kubernetes, ElasticSearch. SCRUM master for the team.
Expiry Workflow: Activiti (BPMN) workflow making non-expirable invoices expirable — fakes ↓ 10%, seller NPS ↑ 20%.
Actioning Service: auto-delist and relist on brand or vertical regulation changes — 40 hrs/week of ops bandwidth saved.
Kubernetes Stateful Onboarding: proof-of-concept for the ElasticSearch Operator on Kubernetes.

Jan 2020 — Jun 2020Mumbai

Data Science Intern —

Piramal Financial Services

Banking Chatbot: deployed IBM Watson on a Node.js server for NLP and configurable dialogue flow — 24×7 customer service.
Aadhar Masking (OCR): first-eight-digit masking per new government norms using Tesseract OCR and OpenCV.

03Papers

6 entries

2025Naples, Italy

A Noise Taxonomy for Bayesian Neural ODEs —

MSML 2025 · Poster

When Does Posterior Calibration Survive on Lotka-Volterra Dynamics? Proposes a noise taxonomy (Gaussian, heavy-tailed Student-t, sparse impulses, regime-switching heteroscedastic) for Bayesian Neural ODEs on ecological systems. Shows 90% CI coverage above ~83% on both states, with regime-switching yielding localised overconfidence despite aggregate calibration.

PDF Poster Code venue

MSML · 2025

2026Research Commons · arXiv

ASD-Bench: Four-Axis Benchmark for Autism Spectrum Disorder — Medical AI

Comprehensive benchmark on 4,068 AQ-10 records across three age cohorts, 17 model configurations (classical ML, MLP, deep tabular transformers, TabPFN v2) evaluated on four axes: predictive performance, calibration, interpretability, and adversarial robustness. Introduces the Heuristic Aggregate Penalty metric.

arXiv PDF Code

ASD · 2025

2025Research Commons

Keypoint-Guided Lumbar Spine Severity Classification — Medical Imaging

Two-phase deep learning pipeline on RSNA MRI data: Phase I EfficientNet-B4 keypoints on sagittal T2/STIR and T1; Phase II ResNet50d on keypoint-guided patches with focal loss. Achieves ~87.7% SCS accuracy and ~80–82% NFN accuracy for automated spinal condition screening.

PDF Code

Lumbar · 2025

2026NeurIPS E&D

Document Parsing Benchmark & SOTA Programmatic PDF Parser — RAG Infrastructure

Built a SOTA programmatic PDF parser and benchmarked it against AI and commercial pipelines across PDF, DOCX, PPTX, HTML, and LaTeX. Demonstrates that schema-driven extraction can saturate on born-digital formats, with rankings inverting by format and element type. Targeting NeurIPS 2026 Empirical & Data track.

PDF Code

NeurIPS · 2026

2026NeurIPS E&D

OmniChunk: Cross-Format RAG Chunking Benchmark — RAG Infrastructure

Intrinsic chunking benchmark isolating chunking from QA confounds: ~19,845 files, 30 format–scenario combinations, and 16 strategies (LangChain, LlamaIndex, Chonkie). Finds that winners are format- and scenario-specific — no single strategy dominates. Proposes routing guidelines for enterprise RAG pipelines.

PDF Code

NeurIPS · 2026

2026NeurIPS E&D

FormatFlux: Programmatic vs AI/Commercial Parsing & Chunking — RAG Infrastructure

Combined parsing + chunking benchmark with five hypotheses. Demonstrates the SOTA programmatic parser is 10–20× faster at zero API cost vs commercial pipelines on born-digital formats. Shows parser×chunker interaction effects and format-conditioned routing outperforms universal pipelines.

PDF Code

NeurIPS · 2026

04Projects

7 entries

2025 — PresentLanturn

Lanturn Capture Platform —

Behavioural data capture & post-training signal platform

Desktop Capture Agent (Python): Local HTTP server (aiohttp), OS-level input & screenshot capture (pynput, mss), network monitor, SQLite persistence, pywebview dashboard, pause/resume, batch REST import to cloud. Packaged via PyInstaller (DMG / EXE).
Plugin Monorepo (v2.7): Unified event schemas; plugins for Chrome/Edge/Firefox (MV3 extension, React 19), VS Code (lanturn-code-capture), Excel (xlwings + Office.js), CAD (AutoCAD/PyRx, Siemens NX, SolidWorks C# add-in).
Cloud Backend: FastAPI + SQLModel + PostgreSQL microservices — API gateway, browser & desktop ingestion, event & screenshot writers, workflow workers (LLM-powered). Redis, GCP Pub/Sub, GCS, Qdrant; Alembic migrations, OpenTelemetry, Sentry.
Dashboard (Next.js 15 / React 19): Operator UI for sessions, mined workflows, training datasets, eval rubrics, data evaluations, campaigns, org management. TanStack Query, Tailwind 4, React Aria, Recharts.
Data & Eval: Short-horizon and long-horizon training dataset pipelines; golden-reference, pairwise, and score-based eval rubrics; automated RL environment generation from expert demonstrations.
Stack — Python, TypeScript, Go, C#, React 19, Next.js 15, FastAPI, PostgreSQL, Redis, GCP Pub/Sub, GCS, Qdrant, PyInstaller, esbuild.

2025 — PresentResearch Commons

Tensile Suite —

End-to-end ML infrastructure platform for distributed training, inference & evaluation

Control Plane: FastAPI + Temporal + PostgreSQL service managing multi-cluster job orchestration, real-time WebSocket streaming (LISTEN/NOTIFY), and multi-cloud provisioning (GKE, Nebius).
Cluster Agents (Go): Custom Kubernetes operator (TrainingJob CRD → Volcano gang scheduling), cluster-watcher (informer → Postgres NOTIFY), RBAC controller (team-scoped isolation).
Tensile-Train: SFT / DPO / ORPO / RL (PPO, GRPO++) pipelines with async checkpointing, Nydus image acceleration, preemption recovery, and heartbeat liveness.
Tensile-Infer: Multi-engine serving (vLLM, SGLang, TensorRT) with KEDA autoscaling, speculative decoding, disaggregated prefill/decode.
Tensile-Agents: Financial, browser/computer-use, and API-MCP agents; modular enterprise RAG with RAGBench evaluation.
Stack — Python, Go, TypeScript, React 19, PyTorch, DeepSpeed, FSDP, Volcano, Helm, PostgreSQL, Temporal.

2025 — PresentResearch Commons

MathCommons —

EdTech product

Mathematics education platform under Research Commons with full product, GTM, design, and community strategy.
Part of the combined 20K followers community footprint.

2025Research Commons

Enrichment Metrics Pipeline — E2E data infrastructure

End-to-end pipeline: Azkaban → Python → FastAPI → GCS → BigQuery external tables → Grafana.
Cost-aware migration from Postgres; partitioned JSON contract with on-call playbook.

2024 — 2025Research Commons

ML Systems from Scratch — github.com/Research-Commons

cpptensor: C++ tensor operations library.
cppgrad: C++ autograd engine with reverse-mode AD.
cppnet: C++ neural network library built on top.

Aug 2022 — Oct 2022Baruch College

Options Pricer —

Baruch College · PDE / Monte-Carlo solver in C++

PDE solver for exact European put / call pricing under Black-Scholes; perpetual American options; exact solutions for the Greeks.
Monte-Carlo simulations approximating put / call prices and their reactions to expiry and simulation count.
Compared exact solutions against advanced finite-difference and finite-element methods.
Stack — C++, Boost, Black-Scholes, Monte Carlo, FDM, FEM, stochastic calculus.

repo writeup

Jan 2019 — Aug 2019BITS Pilani

MATLAB Compiler —

BITS Pilani · C++ static analyzer

MATLAB compiler in C++ with a static code analyzer built on unordered maps, equivalence relations, and various hashing techniques.
Stack — C++, Boost, dynamic and structural equivalence, compiler construction.

repo

05Education

2 entries

Oct 2022 — Jan 2025Virtual

M.Sc. Financial Engineering —

Worldquant University

Merit-based admission offered by Worldquant, a quantitative hedge fund. 100% tuition-free.
Coursework: Financial Markets, Econometrics, Derivative Pricing, Stochastic Modelling, ML in Finance, Portfolio & Wealth Management.

Aug 2016 — Aug 2020Hyderabad

B.E. Computer Science —

BITS Pilani

Graduated with First Division.

06Certifications

7 entries

Mar 2024

UC Berkeley MFE — pre-program: Mathematics, Statistics, Python.

Pass

Sep 2023

Worldquant Gold Certification — market-neutral long-short alpha construction.

Pass

Dec 2023

Akuna Capital — Options 101.

Pass

Feb 2023

CFA Level 1 — Chartered Financial Analyst.

Pass

Oct 2022

Baruch College — C++ Primer for Financial Engineering.

88%

Jun 2022

Wharton — Fundamentals of Quantitative Modelling.

83%

May 2022

QuantInsti — Algorithmic Trading; CPD-accredited, UK.

98%

07Recognition

8 entries

2024

Flow Traders · E-house Day Trading — 1st place; qualified the math test for a trading role.

India

2022

AAlfalgo · CTO — architected a stealth algo-trading platform; $100K pre-seed offer at 5% from First Cheque.

India

2022

Walmart · Instant Karma Award — for handling complex brand-regulation features and mentoring junior engineers at Flipkart.

India

2021

GRE & TOEFL — 327/340 (full marks in Quantitative); 115/120 on TOEFL.

India

2021

Walmart · Best Team Award — for business excellence saving USD 5M / annum.

India

2020

National University of Singapore — selected for the Global Academic Internship Programme.

India

2020

Codechef — global rank 322 in Lunchtime; qualified for Google Code Jam.

India

2017

Government of India · Certificate of Appreciation — with the state's Additional Director General of Police on dispatch-time reduction for police vehicles.

India

08Volunteering

3 entries

2018

UUP 100 · Police Emergency Management System — with senior police and cyber-security officials: PRV dispatch-time reduction, IP tracker against DDoS, HRMS with Gantt-chart on-call rosters.

India

2022

Child Rights and You — designed written and spoken English curriculum for underprivileged children.

India

2022

Microsoft · Giving Champion — raised approximately ₹3,00,000 for the organisation.

India