Skip to content
View pritpatel2412's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report pritpatel2412

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pritpatel2412/README.md

Who Am I?

I'm Prit Patel — a CSE student at CHARUSAT University and an independent AI product builder who ships ambitious, full-stack, AI-native systems from idea to deployment.

I specialize in building LLM applications, RAG pipelines, multi-agent systems, and real-time full-stack platforms that don't just demo well — they scale. My work sits at the intersection of product thinking, modern engineering, and generative AI — and I move fast.

Currently obsessed with: agentic AI workflows, autonomous research systems, real-time financial simulations powered by LLMs, and AI-native developer tooling that replaces incumbent SaaS.

Open To: AI/ML Internships  ·  Full-Stack Collaborations  ·  Hackathons  ·  Open Source  ·  Research Partnerships  ·  Speaking Opportunities


Tech Stack







AI / ML Expertise

Domain Proficiency Details
Large Language Models ★★★★★ OpenAI GPT-4o, Claude (Anthropic), Gemini 1.5 Pro, Groq LLaMA 3.3, NVIDIA NIM
RAG Pipelines ★★★★★ Vector stores (Qdrant, Supabase), semantic chunking, hybrid retrieval, reranking
Agentic AI / Multi-Agent ★★★★★ LangGraph, LangChain, autonomous research agents, tool-use orchestration
AI-Native APIs ★★★★☆ Developer-facing search APIs, TTS engines, LLM-optimized result processing
On-Device / Edge AI ★★★★☆ ONNX Runtime, WebGPU inference, TensorFlow.js, MediaPipe browser-side models
Deep Learning ★★★☆☆ Transfer learning (EfficientNetB4), image classification, custom architectures
Prompt Engineering ★★★★★ System prompts, chain-of-thought, structured outputs, multi-turn reasoning

Featured Projects

Kyren — AI Active Learning OS

Full courses. Real structure. Generated in under 60 seconds from any topic. Kyren is an AI-powered learning platform that transforms any subject into a complete, structured, multi-module course with lessons, quizzes, and progress tracking — entirely generated by LLM pipelines, not curated by hand.

Attribute Details
Stack Next.js · Gemini 1.5 Pro · LangChain · Supabase · PostgreSQL · Tailwind CSS
Scale Full multi-module course generation in under 60 seconds end-to-end
Performance Sub-60s full course generation · Real-time streaming to client
Security Supabase Auth · Row-level security · JWT-based session management
Impact Democratizes structured learning for any topic, language, or skill level
Repository github.com/pritpatel2412/kyren

The platform uses a multi-stage LLM pipeline: topic decomposition → curriculum generation → per-module content synthesis → assessment generation. Each pipeline stage is independently streamed to the UI, giving users instant feedback and a perceived sub-10s start time even for complex topics.


SearchMind API — AI-Native Web Search Platform (Tavily Alternative)

A production-grade, developer-first web search API built for LangGraph, LangChain, RAG pipelines, and autonomous AI agents. SearchMind delivers clean, structured, LLM-optimized search results via a simple REST API and pip-installable Python SDK — designed as a direct Tavily alternative with superior content extraction and native agentic tooling.

Attribute Details
Stack FastAPI · PostgreSQL · Redis · Celery · Brave Search · Claude API · React · Docker
Scale Tiered plans from 1K to 100K+ requests/month · Redis-backed rate limiting per API key
Performance Sub-second cached responses · Async parallel fetch + extract pipeline · Playwright fallback for JS pages
Security SHA-256 hashed API keys · Per-key rate limits · Redis sliding window · CORS middleware
Impact Drop-in Tavily replacement · Native LangChain BaseTool + LangGraph StructuredTool wrappers · pip installable SDK
Repository github.com/pritpatel2412/searchmind

SearchMind's core pipeline runs: Brave Search fetch → domain filtering → safety screening → parallel content extraction (trafilatura + readability fallback) → credibility-weighted relevance ranking → optional Claude-powered answer synthesis. The Python SDK ships with native LangGraph and LangChain tool wrappers, making it a one-line drop-in for any agentic AI stack.


ReadAloud SaaS — Privacy-First Multilingual TTS Platform

The only on-device TTS platform combining enterprise-grade speech quality, 31-language support, expressive speech controls, and GDPR-compliant zero-data-retention — accessible via a SaaS dashboard, REST API, and Chrome extension. Powered by Supertonic's ONNX inference engine running entirely in-browser via WebGPU.

Attribute Details
Stack Next.js · ONNX Runtime · WebGPU · Stripe · FastAPI · Flutter · PostgreSQL · Firebase Auth
Scale 31-language support · 10K chars/request · Batch endpoint up to 50 files · OpenAI TTS API-compatible
Performance Sub-3s audio generation for 500-char input · 100% on-device inference · zero server-side text transmission
Security Zero cloud text exposure · GDPR-compliant · WCAG 2.1 AA · SAML 2.0 SSO for enterprise
Impact Accessibility, e-learning, content creation, and developer RAG/voice-agent use cases across 31 languages
Repository github.com/pritpatel2412/readaloud

ReadAloud solves the core privacy failure of every cloud TTS platform: user text is never transmitted. All inference runs in-browser via ONNX Runtime + WebGPU acceleration. The platform ships an OpenAI-compatible /v1/audio/speech endpoint, making it a zero-code migration for any existing TTS integration. An expression tag system (<laugh>, <breath>, <emphasis>, <pause>) delivers natural prosody that cloud alternatives cannot match on-device.


ARIA v2.0 — Autonomous Multi-Agent Research Platform

Multi-agent web research with browser automation, visual delta analysis, and synthesized intelligence. ARIA dispatches autonomous sub-agents to explore the web in parallel, compares visual states across time, and produces structured research reports via Groq LLaMA 3.3.

Attribute Details
Stack FastAPI · LangChain · Playwright · Groq LLaMA 3.3 · Python · WebSockets
Scale Parallel multi-agent orchestration · Concurrent browser sessions · Real-time streaming output
Performance Sub-agent parallelism reduces research time by 60%+ vs sequential approaches
Security Sandboxed browser contexts · No persistent session leakage between agent runs
Impact Autonomous research synthesis that outperforms single-query LLM approaches
Repository github.com/pritpatel2412/ARIA

ARIA's architecture decomposes a research query into sub-queries, dispatches independent Playwright browser agents for each, runs visual diff analysis across page snapshots, and synthesizes all findings through a structured LLM reasoning chain. The result is a report that cites live web sources rather than training-data knowledge.


StockMind — Real-Time Multi-Agent Financial Market Simulation

A real-time financial market simulation where AI agents trade, reason, and compete using LLMs and RAG. Thousands of autonomous agent decisions execute per tick, with live market state streamed to connected clients via WebSocket.

Attribute Details
Stack FastAPI · WebSockets · Groq · NVIDIA NIM · Qdrant · Python · React
Scale Thousands of concurrent agent decisions per simulation tick
Performance Real-time WebSocket streaming · Sub-100ms tick latency · Vector-augmented agent reasoning
Security Isolated agent state · Deterministic tick engine · No shared mutable agent context
Impact Demonstrates emergent market behavior from pure LLM-driven agent incentives
Repository github.com/pritpatel2412/StockMind

Each simulated agent in StockMind holds a portfolio, receives a market state context window, queries a Qdrant vector store for historical analogs, and reasons through a Groq-hosted LLaMA model to determine its next action. The aggregate of thousands of such micro-decisions produces emergent price discovery, volatility clustering, and boom-bust cycles observable in real-time on the dashboard.


CodeGuard — AI-Powered PR Security Review Platform

Autonomous PR risk scoring and vulnerability detection before every merge. CodeGuard integrates with GitHub Webhooks, analyzes every pull request for security vulnerabilities via GPT-4o, and surfaces risk scores and fix suggestions directly in the PR timeline using real-time Socket.io updates.

Attribute Details
Stack Next.js · React 18 · TypeScript · Node.js · Express · PostgreSQL · GPT-4o · GitHub API · Socket.io · XYFlow
Scale Per-PR analysis on webhook trigger · Cross-file taint analysis graph · Policy-as-Prompt via .codeguard.yml
Performance Real-time streaming risk scores via Socket.io · Sub-5s analysis for most PRs
Security GitHub OAuth · Zod-validated policy schemas · Scoped webhook secrets per repository
Impact Catches vulnerabilities at review time, not post-merge — eliminates a class of production security incidents
Repository github.com/pritpatel2412/CodeGuard

CodeGuard's Policy-as-Prompt system allows teams to define custom security rules in .codeguard.yml, which are dynamically injected into the GPT-4o analysis prompt at review time. The Cross-File Taint Analysis engine builds a data flow graph across all changed files using XYFlow, identifying vulnerabilities that span module boundaries and cannot be caught by per-file linters.


RedForge — Autonomous Red-Team Security Orchestration Platform

AI-driven red-team orchestration platform that automates adversarial security testing workflows. RedForge models attacker reasoning chains, generates targeted exploit scenarios, and orchestrates autonomous scanning and reporting pipelines.

Attribute Details
Stack Python · FastAPI · LangChain · React · PostgreSQL
Scale Multi-target concurrent orchestration · Modular attack chain composition
Performance Autonomous end-to-end workflow execution without human-in-the-loop
Security Scope-bounded execution · Audit logging · Safe mode for non-destructive recon
Impact Reduces red-team engagement cost and time by automating reconnaissance and scenario generation
Repository github.com/pritpatel2412/RedForge

KemLang — Gujarati-Inspired Programming Language

A complete toy programming language built from scratch, rooted in Gujarati syntax. KemLang ships with a full interpreter, CLI, React-based web playground, and a Python/FastAPI backend — demonstrating end-to-end language tooling implementation as a culturally grounded systems project.

Attribute Details
Stack Python · FastAPI · React · TypeScript · Tailwind CSS
Scale Full interpreter pipeline: lexer → parser → AST → evaluator
Performance In-browser execution via playground API · Sub-100ms eval for standard programs
Security Sandboxed execution context · No host filesystem access
Impact Demonstrates language design, interpreter construction, and culturally grounded CS education
Repository github.com/pritpatel2412/kemlang

Experience

AI Developer Intern  ·  StayChat AI June 2026 – Present

StayChat AI is a hospitality tech startup that automates hotel bookings and guest communication via WhatsApp AI agents. As AI Developer Intern, I build and maintain the agentic AI pipelines that power real-time guest interactions across hotel properties.

  • Designing and implementing WhatsApp-native conversational agents using LLM orchestration
  • Building agentic booking workflows with tool use, state management, and fallback handling
  • Integrating hotel PMS APIs with AI agent pipelines for real-time availability and reservation management
  • Developing prompt engineering frameworks for hospitality-specific conversation flows

LangChain LLM Agents WhatsApp API FastAPI Python Prompt Engineering Agentic AI


Achievements

Recognition Details
GPA 9.71 / 10 B.Tech CSE, CHARUSAT University (2023–2027) — top of cohort academic performance
LeetCode 400+ 400+ problems solved across DSA, Dynamic Programming, Graphs, and Trees
GfG 160 Completed GeeksforGeeks 160-Day Problem Solving streak without interruption
Pull Shark GitHub Pull Shark Achievement — consistent open-source contribution record
51 Public Repos Active public portfolio spanning full-stack, AI systems, security tooling, and language design
< 60s Course Gen Kyren generates complete multi-module AI courses in under 60 seconds end-to-end
30 FPS AI Inference ProctorX runs browser-side proctoring inference at 30 fps using MediaPipe — no server required
Zero-Transmission TTS ReadAloud achieves enterprise-grade TTS across 31 languages with zero text leaving the user's browser
Multi-Agent Simulation StockMind processes thousands of concurrent LLM-driven agent decisions per market tick in real-time

Certifications

Amazon Web Services

AWS

IBM via Coursera

Cybersecurity DevOps

Amazon via Coursera

SWE

Forage

Walmart

Let's Upgrade

ML

HackerRank

HackerRank

GeeksforGeeks

GfG


Coding Profiles

LeetCode GeeksforGeeks HackerRank GitHub


GitHub Analytics



GitHub Trophies


Contribution Activity


Contribution Snake


Current Focus

learning:
  - Advanced LangGraph state machine patterns for production agentic systems
  - On-device ML inference optimization with ONNX Runtime + WebGPU
  - Distributed systems design for high-throughput AI API platforms

building:
  - SearchMind API — AI-native web search infrastructure for LangGraph / RAG developers
  - ReadAloud SaaS — privacy-first on-device multilingual TTS platform
  - StayChat AI — WhatsApp agentic booking and guest communication pipelines

exploring:
  - AI-native developer tooling that replaces incumbent SaaS categories
  - Multi-agent coordination protocols and emergent behavior in LLM agent swarms
  - WebGPU compute shaders for real-time in-browser AI inference

open_to:
  - AI / ML internships and full-time roles
  - AI product collaborations and co-founding opportunities
  - Open source contributions to LangChain, LangGraph, and agentic frameworks
  - Hackathons, research partnerships, and speaking engagements

Connect

I'm always open to a conversation — whether it's about a project, an opportunity, or a problem worth solving.


LinkedIn Email Portfolio GitHub


"The best way to predict the future is to build it — one commit at a time."


Last updated: 2026 · Built with obsession and too much caffeine

Pinned Loading

  1. kemlang kemlang Public

    A Gujarati programming language — learn to code with desi keywords like 'sharu', 'lakho', 'samaapt'. Built with Python + FastAPI + React.

    JavaScript 8

  2. ARIA ARIA Public

    🕵️ Autonomous multi-agent web research platform — parallel browser agents, swarm architecture. Groq LLaMA 3.3, Gemini 1.5 Pro, SSE streaming.

    TypeScript 1

  3. RedForge RedForge Public

    RedForge is an autonomous security orchestration platform that performs real HTTP probing (not simulations) on target URLs. It runs 11 parallel detection modules, correlates results into multi-stag…

    TypeScript 5

  4. SearchMind-API SearchMind-API Public

    Open-source Tavily alternative for AI agents. Provides real-time search extraction, parallel page scraping, multi-query deep research, and AI answers, bundled with developer portal & admin tools.

    JavaScript 4

  5. tinyfish-io/bigset tinyfish-io/bigset Public

    What if you had all the data in the world?

    TypeScript 1.6k 178

  6. CodeGuard CodeGuard Public

    🛡️ AI code security & PR risk analysis — auto-reviews PRs, detects vulnerabilities, opens fix PRs automatically. GPT-4o + GitHub Webhooks.

    TypeScript 7