I'm Prit Patel — a CSE student at CHARUSAT University and an independent AI product builder who ships ambitious, full-stack, AI-native systems from idea to deployment.
I specialize in building LLM applications, RAG pipelines, multi-agent systems, and real-time full-stack platforms that don't just demo well — they scale. My work sits at the intersection of product thinking, modern engineering, and generative AI — and I move fast.
Currently obsessed with: agentic AI workflows, autonomous research systems, real-time financial simulations powered by LLMs, and AI-native developer tooling that replaces incumbent SaaS.
Open To: AI/ML Internships · Full-Stack Collaborations · Hackathons · Open Source · Research Partnerships · Speaking Opportunities
| Domain | Proficiency | Details |
|---|---|---|
| Large Language Models | ★★★★★ | OpenAI GPT-4o, Claude (Anthropic), Gemini 1.5 Pro, Groq LLaMA 3.3, NVIDIA NIM |
| RAG Pipelines | ★★★★★ | Vector stores (Qdrant, Supabase), semantic chunking, hybrid retrieval, reranking |
| Agentic AI / Multi-Agent | ★★★★★ | LangGraph, LangChain, autonomous research agents, tool-use orchestration |
| AI-Native APIs | ★★★★☆ | Developer-facing search APIs, TTS engines, LLM-optimized result processing |
| On-Device / Edge AI | ★★★★☆ | ONNX Runtime, WebGPU inference, TensorFlow.js, MediaPipe browser-side models |
| Deep Learning | ★★★☆☆ | Transfer learning (EfficientNetB4), image classification, custom architectures |
| Prompt Engineering | ★★★★★ | System prompts, chain-of-thought, structured outputs, multi-turn reasoning |
Kyren — AI Active Learning OS
Full courses. Real structure. Generated in under 60 seconds from any topic. Kyren is an AI-powered learning platform that transforms any subject into a complete, structured, multi-module course with lessons, quizzes, and progress tracking — entirely generated by LLM pipelines, not curated by hand.
| Attribute | Details |
|---|---|
| Stack | Next.js · Gemini 1.5 Pro · LangChain · Supabase · PostgreSQL · Tailwind CSS |
| Scale | Full multi-module course generation in under 60 seconds end-to-end |
| Performance | Sub-60s full course generation · Real-time streaming to client |
| Security | Supabase Auth · Row-level security · JWT-based session management |
| Impact | Democratizes structured learning for any topic, language, or skill level |
| Repository | github.com/pritpatel2412/kyren |
The platform uses a multi-stage LLM pipeline: topic decomposition → curriculum generation → per-module content synthesis → assessment generation. Each pipeline stage is independently streamed to the UI, giving users instant feedback and a perceived sub-10s start time even for complex topics.
SearchMind API — AI-Native Web Search Platform (Tavily Alternative)
A production-grade, developer-first web search API built for LangGraph, LangChain, RAG pipelines, and autonomous AI agents. SearchMind delivers clean, structured, LLM-optimized search results via a simple REST API and pip-installable Python SDK — designed as a direct Tavily alternative with superior content extraction and native agentic tooling.
| Attribute | Details |
|---|---|
| Stack | FastAPI · PostgreSQL · Redis · Celery · Brave Search · Claude API · React · Docker |
| Scale | Tiered plans from 1K to 100K+ requests/month · Redis-backed rate limiting per API key |
| Performance | Sub-second cached responses · Async parallel fetch + extract pipeline · Playwright fallback for JS pages |
| Security | SHA-256 hashed API keys · Per-key rate limits · Redis sliding window · CORS middleware |
| Impact | Drop-in Tavily replacement · Native LangChain BaseTool + LangGraph StructuredTool wrappers · pip installable SDK |
| Repository | github.com/pritpatel2412/searchmind |
SearchMind's core pipeline runs: Brave Search fetch → domain filtering → safety screening → parallel content extraction (trafilatura + readability fallback) → credibility-weighted relevance ranking → optional Claude-powered answer synthesis. The Python SDK ships with native LangGraph and LangChain tool wrappers, making it a one-line drop-in for any agentic AI stack.
ReadAloud SaaS — Privacy-First Multilingual TTS Platform
The only on-device TTS platform combining enterprise-grade speech quality, 31-language support, expressive speech controls, and GDPR-compliant zero-data-retention — accessible via a SaaS dashboard, REST API, and Chrome extension. Powered by Supertonic's ONNX inference engine running entirely in-browser via WebGPU.
| Attribute | Details |
|---|---|
| Stack | Next.js · ONNX Runtime · WebGPU · Stripe · FastAPI · Flutter · PostgreSQL · Firebase Auth |
| Scale | 31-language support · 10K chars/request · Batch endpoint up to 50 files · OpenAI TTS API-compatible |
| Performance | Sub-3s audio generation for 500-char input · 100% on-device inference · zero server-side text transmission |
| Security | Zero cloud text exposure · GDPR-compliant · WCAG 2.1 AA · SAML 2.0 SSO for enterprise |
| Impact | Accessibility, e-learning, content creation, and developer RAG/voice-agent use cases across 31 languages |
| Repository | github.com/pritpatel2412/readaloud |
ReadAloud solves the core privacy failure of every cloud TTS platform: user text is never transmitted. All inference runs in-browser via ONNX Runtime + WebGPU acceleration. The platform ships an OpenAI-compatible /v1/audio/speech endpoint, making it a zero-code migration for any existing TTS integration. An expression tag system (<laugh>, <breath>, <emphasis>, <pause>) delivers natural prosody that cloud alternatives cannot match on-device.
ARIA v2.0 — Autonomous Multi-Agent Research Platform
Multi-agent web research with browser automation, visual delta analysis, and synthesized intelligence. ARIA dispatches autonomous sub-agents to explore the web in parallel, compares visual states across time, and produces structured research reports via Groq LLaMA 3.3.
| Attribute | Details |
|---|---|
| Stack | FastAPI · LangChain · Playwright · Groq LLaMA 3.3 · Python · WebSockets |
| Scale | Parallel multi-agent orchestration · Concurrent browser sessions · Real-time streaming output |
| Performance | Sub-agent parallelism reduces research time by 60%+ vs sequential approaches |
| Security | Sandboxed browser contexts · No persistent session leakage between agent runs |
| Impact | Autonomous research synthesis that outperforms single-query LLM approaches |
| Repository | github.com/pritpatel2412/ARIA |
ARIA's architecture decomposes a research query into sub-queries, dispatches independent Playwright browser agents for each, runs visual diff analysis across page snapshots, and synthesizes all findings through a structured LLM reasoning chain. The result is a report that cites live web sources rather than training-data knowledge.
StockMind — Real-Time Multi-Agent Financial Market Simulation
A real-time financial market simulation where AI agents trade, reason, and compete using LLMs and RAG. Thousands of autonomous agent decisions execute per tick, with live market state streamed to connected clients via WebSocket.
| Attribute | Details |
|---|---|
| Stack | FastAPI · WebSockets · Groq · NVIDIA NIM · Qdrant · Python · React |
| Scale | Thousands of concurrent agent decisions per simulation tick |
| Performance | Real-time WebSocket streaming · Sub-100ms tick latency · Vector-augmented agent reasoning |
| Security | Isolated agent state · Deterministic tick engine · No shared mutable agent context |
| Impact | Demonstrates emergent market behavior from pure LLM-driven agent incentives |
| Repository | github.com/pritpatel2412/StockMind |
Each simulated agent in StockMind holds a portfolio, receives a market state context window, queries a Qdrant vector store for historical analogs, and reasons through a Groq-hosted LLaMA model to determine its next action. The aggregate of thousands of such micro-decisions produces emergent price discovery, volatility clustering, and boom-bust cycles observable in real-time on the dashboard.
CodeGuard — AI-Powered PR Security Review Platform
Autonomous PR risk scoring and vulnerability detection before every merge. CodeGuard integrates with GitHub Webhooks, analyzes every pull request for security vulnerabilities via GPT-4o, and surfaces risk scores and fix suggestions directly in the PR timeline using real-time Socket.io updates.
| Attribute | Details |
|---|---|
| Stack | Next.js · React 18 · TypeScript · Node.js · Express · PostgreSQL · GPT-4o · GitHub API · Socket.io · XYFlow |
| Scale | Per-PR analysis on webhook trigger · Cross-file taint analysis graph · Policy-as-Prompt via .codeguard.yml |
| Performance | Real-time streaming risk scores via Socket.io · Sub-5s analysis for most PRs |
| Security | GitHub OAuth · Zod-validated policy schemas · Scoped webhook secrets per repository |
| Impact | Catches vulnerabilities at review time, not post-merge — eliminates a class of production security incidents |
| Repository | github.com/pritpatel2412/CodeGuard |
CodeGuard's Policy-as-Prompt system allows teams to define custom security rules in .codeguard.yml, which are dynamically injected into the GPT-4o analysis prompt at review time. The Cross-File Taint Analysis engine builds a data flow graph across all changed files using XYFlow, identifying vulnerabilities that span module boundaries and cannot be caught by per-file linters.
RedForge — Autonomous Red-Team Security Orchestration Platform
AI-driven red-team orchestration platform that automates adversarial security testing workflows. RedForge models attacker reasoning chains, generates targeted exploit scenarios, and orchestrates autonomous scanning and reporting pipelines.
| Attribute | Details |
|---|---|
| Stack | Python · FastAPI · LangChain · React · PostgreSQL |
| Scale | Multi-target concurrent orchestration · Modular attack chain composition |
| Performance | Autonomous end-to-end workflow execution without human-in-the-loop |
| Security | Scope-bounded execution · Audit logging · Safe mode for non-destructive recon |
| Impact | Reduces red-team engagement cost and time by automating reconnaissance and scenario generation |
| Repository | github.com/pritpatel2412/RedForge |
KemLang — Gujarati-Inspired Programming Language
A complete toy programming language built from scratch, rooted in Gujarati syntax. KemLang ships with a full interpreter, CLI, React-based web playground, and a Python/FastAPI backend — demonstrating end-to-end language tooling implementation as a culturally grounded systems project.
| Attribute | Details |
|---|---|
| Stack | Python · FastAPI · React · TypeScript · Tailwind CSS |
| Scale | Full interpreter pipeline: lexer → parser → AST → evaluator |
| Performance | In-browser execution via playground API · Sub-100ms eval for standard programs |
| Security | Sandboxed execution context · No host filesystem access |
| Impact | Demonstrates language design, interpreter construction, and culturally grounded CS education |
| Repository | github.com/pritpatel2412/kemlang |
AI Developer Intern · StayChat AI June 2026 – Present
StayChat AI is a hospitality tech startup that automates hotel bookings and guest communication via WhatsApp AI agents. As AI Developer Intern, I build and maintain the agentic AI pipelines that power real-time guest interactions across hotel properties.
- Designing and implementing WhatsApp-native conversational agents using LLM orchestration
- Building agentic booking workflows with tool use, state management, and fallback handling
- Integrating hotel PMS APIs with AI agent pipelines for real-time availability and reservation management
- Developing prompt engineering frameworks for hospitality-specific conversation flows
LangChain LLM Agents WhatsApp API FastAPI Python Prompt Engineering Agentic AI
| Recognition | Details |
|---|---|
| GPA 9.71 / 10 | B.Tech CSE, CHARUSAT University (2023–2027) — top of cohort academic performance |
| LeetCode 400+ | 400+ problems solved across DSA, Dynamic Programming, Graphs, and Trees |
| GfG 160 | Completed GeeksforGeeks 160-Day Problem Solving streak without interruption |
| Pull Shark | GitHub Pull Shark Achievement — consistent open-source contribution record |
| 51 Public Repos | Active public portfolio spanning full-stack, AI systems, security tooling, and language design |
| < 60s Course Gen | Kyren generates complete multi-module AI courses in under 60 seconds end-to-end |
| 30 FPS AI Inference | ProctorX runs browser-side proctoring inference at 30 fps using MediaPipe — no server required |
| Zero-Transmission TTS | ReadAloud achieves enterprise-grade TTS across 31 languages with zero text leaving the user's browser |
| Multi-Agent Simulation | StockMind processes thousands of concurrent LLM-driven agent decisions per market tick in real-time |
Amazon Web Services
IBM via Coursera
Amazon via Coursera
Forage
Let's Upgrade
HackerRank
GeeksforGeeks
learning:
- Advanced LangGraph state machine patterns for production agentic systems
- On-device ML inference optimization with ONNX Runtime + WebGPU
- Distributed systems design for high-throughput AI API platforms
building:
- SearchMind API — AI-native web search infrastructure for LangGraph / RAG developers
- ReadAloud SaaS — privacy-first on-device multilingual TTS platform
- StayChat AI — WhatsApp agentic booking and guest communication pipelines
exploring:
- AI-native developer tooling that replaces incumbent SaaS categories
- Multi-agent coordination protocols and emergent behavior in LLM agent swarms
- WebGPU compute shaders for real-time in-browser AI inference
open_to:
- AI / ML internships and full-time roles
- AI product collaborations and co-founding opportunities
- Open source contributions to LangChain, LangGraph, and agentic frameworks
- Hackathons, research partnerships, and speaking engagementsI'm always open to a conversation — whether it's about a project, an opportunity, or a problem worth solving.
