Skip to content
Change the repository type filter

All

    Repositories list

    • The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.
      Python
      718350Updated Feb 4, 2026Feb 4, 2026
    • Melos

      Public
      Melos: Sentence-to-section Training with Multi-task Learning for LLM-Driven Song Generation
      JavaScript
      0000Updated Jan 30, 2026Jan 30, 2026
    • HASap

      Public
      Official repository for HASAP: HASap: Hierarchical Acoustic-Semantic Annotation Pipeline for Scripted Speech Data
      Python
      0000Updated Jan 28, 2026Jan 28, 2026
    • https://thuhcsi.github.io/
      HTML
      Other
      2400Updated Sep 20, 2025Sep 20, 2025
    • AutoStyle-TTS:Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis
      JavaScript
      1000Updated Jun 26, 2025Jun 26, 2025
    • HTML
      1000Updated Jun 25, 2025Jun 25, 2025
    • AdaMesh

      Public
      Python
      21820Updated Jun 14, 2025Jun 14, 2025
    • StarVC

      Public
      HTML
      0000Updated May 28, 2025May 28, 2025
    • WAKE

      Public
      Python
      21020Updated May 28, 2025May 28, 2025
    • HTML
      MIT License
      0000Updated May 20, 2025May 20, 2025
    • PerTTS

      Public
      HTML
      0000Updated Mar 10, 2025Mar 10, 2025
    • TES-VC

      Public template
      Create a site or blog from your GitHub repositories with GitHub Pages.
      JavaScript
      MIT License
      9.1k000Updated Mar 1, 2025Mar 1, 2025
    • NeuFA

      Public
      Neural network-based forced alignment with bidirectional attention mechanism
      Python
      87850Updated Jan 17, 2025Jan 17, 2025
    • Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
      Python
      101800Updated Nov 27, 2024Nov 27, 2024
    • VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
      Python
      MIT License
      69640Updated Nov 9, 2024Nov 9, 2024
    • CoVoC2024

      Public
      HTML
      0000Updated Nov 6, 2024Nov 6, 2024
    • ex2 for dpss
      Python
      1700Updated Nov 5, 2024Nov 5, 2024
    • JavaScript
      0010Updated Oct 30, 2024Oct 30, 2024
    • MagicMan

      Public
      Official repository for paper "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement"
      Python
      MIT License
      1231961Updated Sep 16, 2024Sep 16, 2024
    • BinaryAUD

      Public
      HTML
      0000Updated Sep 10, 2024Sep 10, 2024
    • Python
      MIT License
      1400Updated Jul 29, 2024Jul 29, 2024
    • SECap

      Public
      Python
      1217730Updated Jul 9, 2024Jul 9, 2024
    • Python
      MIT License
      713340Updated Jul 8, 2024Jul 8, 2024
    • Please visit https://thuhcsi.github.io/interspeech2024-CSG
      SCSS
      Creative Commons Zero v1.0 Universal
      1000Updated Jun 12, 2024Jun 12, 2024
    • HTML
      0000Updated Jun 12, 2024Jun 12, 2024
    • NeuCoSVC

      Public
      Python
      4329840Updated May 22, 2024May 22, 2024
    • SCNet

      Public
      Python
      MIT License
      1910Updated Apr 18, 2024Apr 18, 2024
    • SCSS
      Creative Commons Zero v1.0 Universal
      0000Updated Jan 5, 2024Jan 5, 2024
    • SCSS
      Creative Commons Zero v1.0 Universal
      0000Updated Jan 5, 2024Jan 5, 2024
    • Please visit https://thuhcsi.github.io/icassp2023-coherent-tts
      SCSS
      Creative Commons Zero v1.0 Universal
      0200Updated Nov 11, 2023Nov 11, 2023