Skip to content
@OpenMOSS

OpenMOSS (SII)

OpenMOSS Team is a research group under the Shanghai Innovation Institution (SII), working in close collaboration with Fudan University and MOSI Intelligence.

Introduction 👋

OpenMOSS Team is a research group under the Shanghai Innovation Institution (SII), working in close collaboration with Fudan University and MOSI Intelligence. Led by Prof. Xipeng Qiu, the team conducts cutting-edge research on large language models (LLMs), advancing the frontiers of model architecture, evaluation, and application with a strong commitment to open, collaborative, and impactful AI innovation.

We warmly welcome researchers, students, and collaborators who share our vision to join us in pushing the boundaries of LLM technology. For inquiries or collaboration opportunities, please contact us at openmoss@sii.edu.cn .

🌐 Website: https://openmoss.github.io/ or http://openmoss.sii.edu.cn/

💻 GitHub: https://github.com/OpenMOSS

  • SII is dedicated to fostering innovation in education and research in the field of artificial intelligence.

Pinned Loading

  1. MOSS MOSS Public

    An open-source tool-augmented conversational language model from Fudan University

    Python 12.1k 1.1k

  2. MOSS-VL MOSS-VL Public

    MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.

    Python 224 4

  3. MOSS-TTS MOSS-TTS Public

    MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

    Python 1.6k 139

  4. MOVA MOVA Public

    MOVA: Towards Scalable and Synchronized Video–Audio Generation

    Python 951 77

  5. MOSS-TTS-Nano MOSS-TTS-Nano Public

    MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run direc…

    Python 1.6k 189

  6. MOSS-Audio MOSS-Audio Public

    MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenarios.

    Python 117 3

Repositories

Showing 10 of 50 repositories
  • Llamascopium Public

    Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.

    OpenMOSS/Llamascopium’s past year of commit activity
    Python 213 28 8 0 Updated Apr 18, 2026
  • OpenMOSS/MOSS-TTS-Nano-Reader’s past year of commit activity
    JavaScript 20 1 0 0 Updated Apr 17, 2026
  • MOSS-TTS-Nano Public

    MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run directly on CPU without a GPU, and keeps the deployment stack simple enough for local demos, web serving, and lightweight product integration.

    OpenMOSS/MOSS-TTS-Nano’s past year of commit activity
    Python 1,552 Apache-2.0 189 27 3 Updated Apr 17, 2026
  • MOSS-Audio Public

    MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenarios.

    OpenMOSS/MOSS-Audio’s past year of commit activity
    Python 117 3 1 0 Updated Apr 16, 2026
  • mlx-audio Public Forked from Blaizzy/mlx-audio

    A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

    OpenMOSS/mlx-audio’s past year of commit activity
    Python 5 MIT 566 0 0 Updated Apr 16, 2026
  • MOSS-VL Public

    MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.

    OpenMOSS/MOSS-VL’s past year of commit activity
    Python 224 Apache-2.0 4 1 0 Updated Apr 14, 2026
  • sglang Public
    OpenMOSS/sglang’s past year of commit activity
    Python 3 Apache-2.0 0 0 0 Updated Apr 14, 2026
  • MOSS-Audio-Tokenizer Public

    MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA reconstruction and strong performance in generation and understanding—serving as a unified interface for next-generation native audio language models.

    OpenMOSS/MOSS-Audio-Tokenizer’s past year of commit activity
    Python 193 Apache-2.0 13 3 1 Updated Apr 13, 2026
  • OpenMOSS/MOSS-TTS-Nano-Demo’s past year of commit activity
    CSS 1 1 0 0 Updated Apr 13, 2026
  • MOSS-TTS Public

    MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

    OpenMOSS/MOSS-TTS’s past year of commit activity
    Python 1,550 Apache-2.0 139 23 1 Updated Apr 13, 2026