const steven = {
role: "AI Engineer · CTO @ INTU",
focus: ["LLM agents", "agentic issue-fix pipelines", "RAG",
"multimodal doc + image ingestion", "evals", "observability"],
stack: ["React", "Node", "Python", "Rust", "Postgres", "GraphQL"],
web3: ["MPC", "DKG", "EVM", "Solana"],
shipping: "production agent systems",
};I build agent systems that survive contact with production — tool-using LLMs wired through MCP, grounded by RAG, gated by eval harnesses, and instrumented end-to-end so failures are observable instead of mysterious. Before AI I spent four years deep in Web3, leading an MPC wallet-infrastructure team across cryptography, smart contracts, and Rust.
AI / ML
Languages & Core
Platforms
Web3
🔐 INTU — Web3 onboarding via MPCCTO · Lead Engineer Open-source NPM package orchestrating distributed key generation (DKG) and multi-party computation, removing seed phrases from the onboarding flow. Cross-chain transaction flows across EVM networks, bridged to Solana — sending a Solana tx authorized by an EVM signature. Self-hosted The Graph indexers for chains without hosted support.
|
Selected Project · 2026 LLM agent layered onto an open-source EHR that reads patient charts and relays clinical context on demand. Lab-report ingestion pipeline produces summaries with source-page citations, so clinicians can verify any agent-surfaced claim — a RAG pattern tuned for high-stakes clinical use.
|
🍽️ GURUPass / Pass Rewards — Restaurant AI AgentsLead AI & Blockchain Engineer Tool-using LLM agents handling order intake and menu Q&A, wired through MCP with structured-output validation. Curated eval set + offline regression harness catches failures before deploy; production traces drive failure-mode analysis. Personalization layer surfaces targeted coupons from purchase history.
|
Autonomous coding agent An agent that triages open GitHub issues, reproduces the bug, drafts a fix, and opens a PR — closing the loop from issue to reviewable change. Proof: medplum/medplum#9293 — an upstream OSS fix landed fully agentically (working branch).
|
Backtest harness and execution research for systematic futures strategies — applying the same eval + observability discipline I use on AI agents to strategy selection, slippage modeling, and live risk.
Python · Quant · Backtesting · WIP


