ChatVRM Agent

A 3D digital-human conversation demo — load a VRM avatar in the browser and chat with an LLM-driven character via voice. Forked from pixiv/ChatVRM.

🌐 Languages: English · 中文 · 日本語

Overview

ChatVRM Agent is a browser-based demo that lets you have a real-time conversation with a 3D character. Import a VRM model, pick a voice, and the avatar will listen to your microphone, generate a response through an LLM, speak it back with emotional expression, and animate its body and face to match.

This repository is a fork of the Pixiv original, repackaged and re-documented for English-first users while keeping Chinese and Japanese translations one click away.

Features

🎭 VRM avatar support — drop in any .vrm file and the character appears in the browser.
🎤 Voice input — uses the browser's Web Speech API (SpeechRecognition).
🤖 LLM responses — replies generated through the OpenAI Chat Completions API.
🗣️ Emotional TTS — Japanese expressive voice synthesis via Koeiromap (Koemotion / rinna).
👀 Facial expressions & idle motion — blinking, gaze tracking, and lip sync driven by @pixiv/three-vrm.
⚙️ Tunable character — adjust voice parameters, speaking style, and system prompt at runtime from the settings panel.

Tech Stack

Layer	Technology
Framework	Next.js 13 (Pages Router)
Language	TypeScript 5
UI	React 18, Tailwind CSS
3D	three.js, @pixiv/three-vrm
LLM	OpenAI Chat Completions API
TTS	Koeiromap (rinna)
STT	Web Speech API

Getting Started

Prerequisites

Node.js 16.14.2 (see engines in package.json) or a compatible 16.x / 18.x
npm (bundled with Node.js)
An OpenAI API key with access to a chat model (e.g. gpt-4o-mini, gpt-4o, gpt-3.5-turbo)

1. Clone and install

git clone https://github.com/badhope/ChatVRM-Agent.git
cd ChatVRM-Agent
npm install

2. Configure environment

Copy the example file and fill in your secrets:

cp .env.example .env

Then edit .env:

# OpenAI API key used by /api/chat
OPEN_AI_KEY=sk-...

💡 You can also paste the key directly into the in-app settings panel at runtime; it will be used for that session only and never written to disk.

3. Run the dev server

npm run dev

Open http://localhost:3000 in a modern Chromium-based browser (Chrome / Edge recommended for full Web Speech API support).

4. Build for production

npm run build
npm run start

To export a fully static site (no API routes, no server):

npm run export

⚠️ The static export cannot call /api/chat or /api/tts server-side. To use those routes in production, deploy to a Node-capable host (Vercel, Railway, Fly.io, your own VPS).

Configuration

All configuration is read from environment variables at server start and from the in-app settings panel at runtime. See .env.example for the full list.

Variable	Required	Default	Description
`OPEN_AI_KEY`	Yes (server)	—	OpenAI API key. Falls back to the value entered in the UI.

Project Structure

ChatVRM-Agent/
├── .github/
│   ├── workflows/          # CI (next build)
│   ├── ISSUE_TEMPLATE/     # Bug report & feature request forms
│   └── PULL_REQUEST_TEMPLATE.md
├── public/                 # Static assets (default VRM, idle animation, OGP image)
├── src/
│   ├── components/         # React UI (chat log, settings, menu, VRM viewer, …)
│   ├── features/
│   │   ├── chat/           # OpenAI client
│   │   ├── emoteController/  # Blink, gaze, expression
│   │   ├── koeiromap/      # TTS client
│   │   ├── lipSync/        # Mouth animation from audio analysis
│   │   ├── messages/       # Message store, TTS orchestration
│   │   └── vrmViewer/      # three.js + three-vrm wrapper
│   ├── lib/                # VRM / VRMA loaders and helpers
│   ├── pages/
│   │   ├── api/            # /api/chat, /api/tts
│   │   ├── _app.tsx
│   │   ├── _document.tsx
│   │   └── index.tsx
│   ├── styles/             # globals.css
│   └── utils/              # Small helpers
├── .editorconfig
├── .env.example
├── .eslintrc.json
├── .gitignore
├── CODE_OF_CONDUCT.md
├── CONTRIBUTING.md
├── LICENSE                 # MIT (inherited from pixiv/ChatVRM)
├── README.md               # You are here
├── README.zh.md
├── README.ja.md
├── SECURITY.md
├── next.config.js
├── package.json
├── postcss.config.js
├── tailwind.config.js
├── tsconfig.json
└── watch.json

Scripts

Command	What it does
`npm run dev`	Start the Next.js dev server on port 3000 with HMR.
`npm run build`	Produce an optimized production build.
`npm run start`	Run the production build.
`npm run export`	Export a static site to `out/`.
`npm run lint`	Run ESLint with the Next.js config.

API Routes

POST /api/chat — proxies a chat completion request to OpenAI. Body: { apiKey?: string, messages: ChatMessage[] }. Falls back to process.env.OPEN_AI_KEY.
POST /api/tts — calls Koeiromap to synthesize Japanese speech. Body: { text: string, speakerX: number, speakerY: number }.

Privacy & Data

Conversation history is kept in-memory in the browser only. Reloading the page clears it.
Audio is captured locally by the Web Speech API. The recognised text is sent to OpenAI (or your configured LLM) and to Koeiromap for synthesis.
No telemetry or analytics are collected by this codebase.
See OpenAI's data usage policy and Koeiromap's terms for how those providers handle your data.

Contributing

Contributions are welcome! Please read CONTRIBUTING.md first, and follow the Code of Conduct. Bug reports and feature requests should use the issue templates in .github/ISSUE_TEMPLATE/.

Security

Please do not file public issues for security problems. See SECURITY.md for the reporting procedure.

License

This project is released under the MIT License — see LICENSE. Originally created by pixiv; this fork is maintained by @badhope.

Acknowledgments

pixiv — original ChatVRM project and @pixiv/three-vrm
rinna / Koemotion — Koeiromap expressive TTS
OpenAI — Chat Completions API
All contributors and the open-source VRM community

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatVRM Agent

Overview

Features

Tech Stack

Getting Started

Prerequisites

1. Clone and install

2. Configure environment

3. Run the dev server

4. Build for production

Configuration

Project Structure

Scripts

API Routes

Privacy & Data

Contributing

Security

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github		.github
public		public
src		src
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.nvmrc		.nvmrc
.prettierignore		.prettierignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.ja.md		README.ja.md
README.md		README.md
README.zh.md		README.zh.md
SECURITY.md		SECURITY.md
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
vercel.json		vercel.json
vitest.config.ts		vitest.config.ts
watch.json		watch.json

Folders and files

Latest commit

History

Repository files navigation

ChatVRM Agent

Overview

Features

Tech Stack

Getting Started

Prerequisites

1. Clone and install

2. Configure environment

3. Run the dev server

4. Build for production

Configuration

Project Structure

Scripts

API Routes

Privacy & Data

Contributing

Security

License

Acknowledgments

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages