Skip to content
Change the repository type filter

All

    Repositories list

    • Voice quality manipulation for TTS – controllably modify breathiness, creakiness, and other voice characteristics using a conditioned CNF manipulation block.
      Python
      MIT License
      0100Updated Mar 6, 2026Mar 6, 2026
    • A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an emphasis on speech proc…
      Python
      MIT License
      167245Updated Feb 26, 2026Feb 26, 2026
    • Python
      MIT License
      94050Updated Feb 18, 2026Feb 18, 2026
    • meeteval

      Public
      MeetEval - A meeting transcription evaluation toolkit
      Python
      MIT License
      1814761Updated Jan 27, 2026Jan 27, 2026
    • dlp_mpi

      Public
      Python
      MIT License
      2600Updated Jan 20, 2026Jan 20, 2026
    • lazy_dataset: Process large datasets as if it was an iterable.
      Python
      MIT License
      81831Updated Dec 1, 2025Dec 1, 2025
    • local_sqa

      Public
      Localizable Speech Quality Assessment
      Python
      MIT License
      1200Updated Dec 1, 2025Dec 1, 2025
    • Python
      MIT License
      1700Updated Oct 7, 2025Oct 7, 2025
    • Python
      MIT License
      2310Updated Oct 2, 2025Oct 2, 2025
    • Evaluation code for the Interspeech publication "Towards Frame-level Quality Predictions of Synthetic Speech". Evaluate frame-level representations of MOS predi…
      Python
      MIT License
      11400Updated Aug 15, 2025Aug 15, 2025
    • Jupyter noteboooks for the lecture "Nachrichtentechnik" (communications engineering) with explanations in german.
      Jupyter Notebook
      4500Updated Jul 24, 2025Jul 24, 2025
    • paderbox

      Public
      Paderbox: A collection of utilities for audio / speech processing
      Python
      MIT License
      1274324Updated Jul 21, 2025Jul 21, 2025
    • https://fgnt.github.io/meeteval_viz/ Demo pages for meeteval alignment visualization
      HTML
      0000Updated Jun 4, 2025Jun 4, 2025
    • pb_bss

      Public
      Collection of EM algorithms for blind source separation of audio signals
      Python
      MIT License
      6329831Updated May 19, 2025May 19, 2025
    • ci_sdr

      Public
      Python
      MIT License
      85300Updated May 15, 2025May 15, 2025
    • paderwasn

      Public
      Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).
      Python
      MIT License
      71800Updated May 8, 2025May 8, 2025
    • Jupyter Notebook
      4900Updated Apr 14, 2025Apr 14, 2025
    • nara_wpe

      Public
      Different implementations of "Weighted Prediction Error" for speech dereverberation
      Python
      MIT License
      166557111Updated Mar 19, 2025Mar 19, 2025
    • pb_chime5

      Public
      Speech enhancement system for the CHiME-5 dinner party scenario
      Python
      MIT License
      3410931Updated Feb 6, 2025Feb 6, 2025
    • mms_msg

      Public
      Multipurpose Multi Speaker Mixture Signal Generator
      Python
      MIT License
      94600Updated Feb 6, 2025Feb 6, 2025
    • Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
      Python
      MIT License
      11300Updated Feb 5, 2025Feb 5, 2025
    • mnist

      Public
      Makefile
      152600Updated Jan 21, 2025Jan 21, 2025
    • libriwasn

      Public
      Tools and scripts for the LibriWASN data set from zenodo
      Python
      MIT License
      1600Updated Jul 15, 2024Jul 15, 2024
    • sms_wsj

      Public
      SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
      Python
      MIT License
      2712852Updated Jun 7, 2024Jun 7, 2024
    • Jupyter Notebook
      0000Updated Apr 16, 2024Apr 16, 2024
    • pb_sed

      Public
      Paderborn Sound Event Detection
      Python
      MIT License
      97940Updated Jul 18, 2023Jul 18, 2023
    • sins

      Public
      Python
      MIT License
      1810Updated Oct 28, 2022Oct 28, 2022
    • graph_pit

      Public
      Python
      Other
      93900Updated Oct 14, 2022Oct 14, 2022
    • 0000Updated Oct 7, 2022Oct 7, 2022
    • asnsig

      Public
      ASNSIG – A Signal Generator for Ad-Hoc Acoustic Sensor Networks in Smart Home Environments
      Python
      MIT License
      2200Updated Aug 31, 2022Aug 31, 2022