Shubhankit Singh

Shubhankit Singh

Founder · Researcher · SF Bay Area

These days I work the most on deep-RL and distributed AI infrastructure.

I like math, machine learning and computer architecture / science. I love working on the intersection of these subjects along with real-world domains like Deep RL, Quant Finance, Cognitive Neuroscience, and Sci-ML.

If you have an idea and want to collaborate, drop an email at shubhankitsingh@researchcommons.ai.

Personally — have a pillow cushion named Niko, who also seems to bark.

Currently
Lanturn (Co-Founder, CTO) · Research Commons (Founder)
Previously
Microsoft · Flipkart · Kreaitor · Invsto · Piramal
Research
MSML 2025 · 3 papers targeting NeurIPS 2026 · ASD-Bench · Lumbar MRI
Education
MSc Financial Engineering, Worldquant · B.E. Computer Science, BITS Pilani
Credentials
CFA Level 1 · Worldquant Gold · GRE 327/340 (Quant: full marks)
Based
SF Bay Area
Email
shubhankitsingh@researchcommons.ai
GitHub
github.com/shoobiedoo
LinkedIn
linkedin.com/in/shubh101
HuggingFace
huggingface.co/rescommons
X
x.com/shoobiedoo313

02Experience

7 roles · current first
Sep 2025 — PresentRemote
Co-Founder & CTO Lanturn

Building a Behavioural Data Platform with a granular capture layer baked into native desktop applications, automated RL environment generation, synthetic data generation, and end-to-end post-training signal pipelines focused on both the observation and action layer.

  • Capture modalities span browser, desktop, CAD / 3-D, and domain-specific application data.
  • Generating complete RLHF and long-horizon RL environments with verifiers from expert demonstrations — targeting labs and enterprises.
  • Received acqui-hire and licensing offers from 2 of the biggest data vendors in the space.
Feb 2025 — PresentBengaluru
Founder Research Commons

Building an end-to-end research suite — a unified control plane (Tensile) for distributed training, fine-tuning (SFT, DPO, ORPO, GRPO++), inference, and agentic workflows. The core framework powers both internal research and external client engagements.

  • Bootstrapped with $200K of personal capital; grew to six-figure revenue through client partnerships.
  • Client partnerships: Finance firms, major neo-clouds, and datacenters — helping them train and fine-tune agents with custom SFT pipelines, reward modelling, evaluation harnesses, and inference optimization on the Tensile platform (e.g. Aion).
  • Research: Extremely horizontal — broke down and reverse-engineered all SOTA AI techniques from scratch, spanning distributed systems, post-training, inference, and agents.
  • Shipped speculative decoding models (EAGLE3, Arctic speculators for Llama 3.2) and 7 open datasets (44K+ rows, 40+ likes) on HuggingFace.
  • Built a SOTA programmatic PDF parser benchmarked against AI and commercial pipelines — 10–20× faster at zero API cost on born-digital formats.
  • Scaled Research Commons + MathCommons pages to a combined 20K followers.
Jul 2022 — Jan 2025Bangalore
Tech Lead Microsoft
  • Led Bluetooth HAL work under Silicon Graphics & Media; founding engineer of the Sigma Bluetooth India team.
  • Owned engineering for Office Android, Office iOS, and Universal Print integration into M365 mobile.
  • Built Networking RAGs to enhance developer productivity; instructor at Microsoft AI School, teaching LLM theory and tooling.
  • Authored a proof-of-concept for billions of users running Bluetooth Android applications over Windows via a Linux kernel — covering gaming, hearing aids, and adjacent verticals.
Jun 2024 — Jan 2025Remote · Part-time
Head of Engineering Kreaitor (Web3 startup)
  • Trained diffusion models and built social-media AI agents.
  • Re-architected the platform: more robust, more secure, AI-ready.
  • Drove the B2C → B2B pivot; aligned the product to a steady five-figure MRR.
Jun 2023 — Dec 2024Pennsylvania · Part-time
CTO & Board Member Invsto

Built an end-to-end retail trading platform — high-throughput Strategy Engine, Order Management System, data infra, and frontend for a quant-sciences firm focused on LFT algorithms.

  • Performance: Numba / JIT compilation and Cython-based optimizations on the critical path; multi-threaded execution for concurrent strategy evaluation and order routing.
  • Integrations: Worked with dYdX, Polygon, and Rootstock for exchange connectivity, market-data feeds, and on-chain settlement.
  • Shipped Release 1.0; continues to serve on the board.
Aug 2020 — Jun 2022Bangalore
Software Engineer Flipkart (India's biggest startup to date)
  • Multiple microservices on the seller side — state machines, Kubernetes, ElasticSearch. SCRUM master for the team.
  • Expiry Workflow: Activiti (BPMN) workflow making non-expirable invoices expirable — fakes ↓ 10%, seller NPS ↑ 20%.
  • Actioning Service: auto-delist and relist on brand or vertical regulation changes — 40 hrs/week of ops bandwidth saved.
  • Kubernetes Stateful Onboarding: proof-of-concept for the ElasticSearch Operator on Kubernetes.
Jan 2020 — Jun 2020Mumbai
Data Science Intern Piramal Financial Services
  • Banking Chatbot: deployed IBM Watson on a Node.js server for NLP and configurable dialogue flow — 24×7 customer service.
  • Aadhar Masking (OCR): first-eight-digit masking per new government norms using Tesseract OCR and OpenCV.

03Papers

6 entries
2025Naples, Italy
A Noise Taxonomy for Bayesian Neural ODEs MSML 2025 · Poster

When Does Posterior Calibration Survive on Lotka-Volterra Dynamics? Proposes a noise taxonomy (Gaussian, heavy-tailed Student-t, sparse impulses, regime-switching heteroscedastic) for Bayesian Neural ODEs on ecological systems. Shows 90% CI coverage above ~83% on both states, with regime-switching yielding localised overconfidence despite aggregate calibration.

MSML · 2025
2026Research Commons · arXiv
ASD-Bench: Four-Axis Benchmark for Autism Spectrum Disorder — Medical AI

Comprehensive benchmark on 4,068 AQ-10 records across three age cohorts, 17 model configurations (classical ML, MLP, deep tabular transformers, TabPFN v2) evaluated on four axes: predictive performance, calibration, interpretability, and adversarial robustness. Introduces the Heuristic Aggregate Penalty metric.

ASD · 2025
2025Research Commons
Keypoint-Guided Lumbar Spine Severity Classification — Medical Imaging

Two-phase deep learning pipeline on RSNA MRI data: Phase I EfficientNet-B4 keypoints on sagittal T2/STIR and T1; Phase II ResNet50d on keypoint-guided patches with focal loss. Achieves ~87.7% SCS accuracy and ~80–82% NFN accuracy for automated spinal condition screening.

Lumbar · 2025
2026NeurIPS E&D
Document Parsing Benchmark & SOTA Programmatic PDF Parser — RAG Infrastructure

Built a SOTA programmatic PDF parser and benchmarked it against AI and commercial pipelines across PDF, DOCX, PPTX, HTML, and LaTeX. Demonstrates that schema-driven extraction can saturate on born-digital formats, with rankings inverting by format and element type. Targeting NeurIPS 2026 Empirical & Data track.

NeurIPS · 2026
2026NeurIPS E&D
OmniChunk: Cross-Format RAG Chunking Benchmark — RAG Infrastructure

Intrinsic chunking benchmark isolating chunking from QA confounds: ~19,845 files, 30 format–scenario combinations, and 16 strategies (LangChain, LlamaIndex, Chonkie). Finds that winners are format- and scenario-specific — no single strategy dominates. Proposes routing guidelines for enterprise RAG pipelines.

NeurIPS · 2026
2026NeurIPS E&D
FormatFlux: Programmatic vs AI/Commercial Parsing & Chunking — RAG Infrastructure

Combined parsing + chunking benchmark with five hypotheses. Demonstrates the SOTA programmatic parser is 10–20× faster at zero API cost vs commercial pipelines on born-digital formats. Shows parser×chunker interaction effects and format-conditioned routing outperforms universal pipelines.

NeurIPS · 2026

04Projects

7 entries
2025 — PresentLanturn
Lanturn Capture Platform Behavioural data capture & post-training signal platform
  • Desktop Capture Agent (Python): Local HTTP server (aiohttp), OS-level input & screenshot capture (pynput, mss), network monitor, SQLite persistence, pywebview dashboard, pause/resume, batch REST import to cloud. Packaged via PyInstaller (DMG / EXE).
  • Plugin Monorepo (v2.7): Unified event schemas; plugins for Chrome/Edge/Firefox (MV3 extension, React 19), VS Code (lanturn-code-capture), Excel (xlwings + Office.js), CAD (AutoCAD/PyRx, Siemens NX, SolidWorks C# add-in).
  • Cloud Backend: FastAPI + SQLModel + PostgreSQL microservices — API gateway, browser & desktop ingestion, event & screenshot writers, workflow workers (LLM-powered). Redis, GCP Pub/Sub, GCS, Qdrant; Alembic migrations, OpenTelemetry, Sentry.
  • Dashboard (Next.js 15 / React 19): Operator UI for sessions, mined workflows, training datasets, eval rubrics, data evaluations, campaigns, org management. TanStack Query, Tailwind 4, React Aria, Recharts.
  • Data & Eval: Short-horizon and long-horizon training dataset pipelines; golden-reference, pairwise, and score-based eval rubrics; automated RL environment generation from expert demonstrations.
  • Stack — Python, TypeScript, Go, C#, React 19, Next.js 15, FastAPI, PostgreSQL, Redis, GCP Pub/Sub, GCS, Qdrant, PyInstaller, esbuild.
2025 — PresentResearch Commons
Tensile Suite End-to-end ML infrastructure platform for distributed training, inference & evaluation
  • Control Plane: FastAPI + Temporal + PostgreSQL service managing multi-cluster job orchestration, real-time WebSocket streaming (LISTEN/NOTIFY), and multi-cloud provisioning (GKE, Nebius).
  • Cluster Agents (Go): Custom Kubernetes operator (TrainingJob CRD → Volcano gang scheduling), cluster-watcher (informer → Postgres NOTIFY), RBAC controller (team-scoped isolation).
  • Tensile-Train: SFT / DPO / ORPO / RL (PPO, GRPO++) pipelines with async checkpointing, Nydus image acceleration, preemption recovery, and heartbeat liveness.
  • Tensile-Infer: Multi-engine serving (vLLM, SGLang, TensorRT) with KEDA autoscaling, speculative decoding, disaggregated prefill/decode.
  • Tensile-Agents: Financial, browser/computer-use, and API-MCP agents; modular enterprise RAG with RAGBench evaluation.
  • Stack — Python, Go, TypeScript, React 19, PyTorch, DeepSpeed, FSDP, Volcano, Helm, PostgreSQL, Temporal.
2025 — PresentResearch Commons
MathCommons EdTech product
  • Mathematics education platform under Research Commons with full product, GTM, design, and community strategy.
  • Part of the combined 20K followers community footprint.
2025Research Commons
Enrichment Metrics Pipeline — E2E data infrastructure
  • End-to-end pipeline: Azkaban → Python → FastAPI → GCS → BigQuery external tables → Grafana.
  • Cost-aware migration from Postgres; partitioned JSON contract with on-call playbook.
2024 — 2025Research Commons
ML Systems from Scratch github.com/Research-Commons
  • cpptensor: C++ tensor operations library.
  • cppgrad: C++ autograd engine with reverse-mode AD.
  • cppnet: C++ neural network library built on top.
Aug 2022 — Oct 2022Baruch College
Options Pricer Baruch College · PDE / Monte-Carlo solver in C++
  • PDE solver for exact European put / call pricing under Black-Scholes; perpetual American options; exact solutions for the Greeks.
  • Monte-Carlo simulations approximating put / call prices and their reactions to expiry and simulation count.
  • Compared exact solutions against advanced finite-difference and finite-element methods.
  • Stack — C++, Boost, Black-Scholes, Monte Carlo, FDM, FEM, stochastic calculus.
Jan 2019 — Aug 2019BITS Pilani
MATLAB Compiler BITS Pilani · C++ static analyzer
  • MATLAB compiler in C++ with a static code analyzer built on unordered maps, equivalence relations, and various hashing techniques.
  • Stack — C++, Boost, dynamic and structural equivalence, compiler construction.

05Education

2 entries
Oct 2022 — Jan 2025Virtual
M.Sc. Financial Engineering Worldquant University
  • Merit-based admission offered by Worldquant, a quantitative hedge fund. 100% tuition-free.
  • Coursework: Financial Markets, Econometrics, Derivative Pricing, Stochastic Modelling, ML in Finance, Portfolio & Wealth Management.
Aug 2016 — Aug 2020Hyderabad
B.E. Computer Science BITS Pilani
  • Graduated with First Division.

06Certifications

7 entries
Mar 2024
UC Berkeley MFE — pre-program: Mathematics, Statistics, Python.
Pass
Sep 2023
Worldquant Gold Certification — market-neutral long-short alpha construction.
Pass
Dec 2023
Akuna Capital — Options 101.
Pass
Feb 2023
CFA Level 1 — Chartered Financial Analyst.
Pass
Oct 2022
Baruch College — C++ Primer for Financial Engineering.
88%
Jun 2022
Wharton — Fundamentals of Quantitative Modelling.
83%
May 2022
QuantInsti — Algorithmic Trading; CPD-accredited, UK.
98%

07Recognition

8 entries
2024
Flow Traders · E-house Day Trading1st place; qualified the math test for a trading role.
India
2022
Alfalgo · CTO — architected a stealth algo-trading platform; $100K pre-seed offer at 5% from First Cheque.
India
2022
Walmart · Instant Karma Award — for handling complex brand-regulation features and mentoring junior engineers at Flipkart.
India
2021
GRE & TOEFL327/340 (full marks in Quantitative); 115/120 on TOEFL.
India
2021
Walmart · Best Team Award — for business excellence saving USD 5M / annum.
India
2020
National University of Singapore — selected for the Global Academic Internship Programme.
India
2020
Codechef — global rank 322 in Lunchtime; qualified for Google Code Jam.
India
2017
Government of India · Certificate of Appreciation — with the state's Additional Director General of Police on dispatch-time reduction for police vehicles.
India

08Volunteering

3 entries
2018
UP 100 · Police Emergency Management System — with senior police and cyber-security officials: PRV dispatch-time reduction, IP tracker against DDoS, HRMS with Gantt-chart on-call rosters.
India
2022
Child Rights and You — designed written and spoken English curriculum for underprivileged children.
India
2022
Microsoft · Giving Champion — raised approximately ₹3,00,000 for the organisation.
India