Linux Speech Recognition #680

MattEqualsCoder · 2025-03-28T03:42:44Z

I still need to do my own proofreading of the code here, but wanted to get a PR out there finally. The goal here was to create an application and nuget package that could be used as close as possible with the setup we had before. I basically created classes that mirrored what we had before for building grammar, then added a way to build that into C# System.Speech native grammar for Windows users.

For Linux users, this utilizes the PySpeechService application I created, which is a Python application that uses gRPC to communicate back and forth to tracker. For text to speech, this uses Piper and for speech recognition it uses Vosk.

PySpeechService GitHub
Python app documentation
Grammar JSON documentation (This basically took the previous grammar builder we used and created a JSON format for it.)
C# nuget Package documentation

Still have some things I want to do before merging in:

Add SMZ3 documentation for Linux users
Review and potentially clean up code
Remove temp changes to the GitHub action
Test on Fedora and clean Arch & Linux Mint installs to verify setup steps
Create config PR for the speech replacements
Probably need to clean up some warnings

src/TrackerCouncil.Smz3.Tracking/Services/Speech/PySpeechRecognitionService.cs

Vivelin · 2025-03-28T06:30:52Z

src/TrackerCouncil.Smz3.Abstractions/TrackerCouncil.Smz3.Abstractions.csproj

    </ItemGroup>

+    <ItemGroup>
+      <PackageReference Include="PySpeechServiceClient" Version="0.1.0" />


If this is only available on Linux, maybe we should have a compiler symbol for this (if one isn't built-in already) and surrounding the code where it's used. Then we can avoid adding the dependency itself entirely on Windows if it'll never be used there anyway.

Well, it is used for the grammar construction even on Windows, but I could possibly pull that out into a separate project/repo so that just the actual services themselves can be done that way.

That being said, I do plan on making a Windows version just for possible testing for pronunciations if desired for Pink's streams. It's just a lower priority.

I split it out so that grammar building is in its on nuget package so that it could be used on Windows whereas the actual services be Linux only, and I marked all of the functions as Linux only for now until I get it built for Mac and/or Windows. However, I was having issues getting OS dependent compiler constants working for actually being able to make the built code different. Might give it another go.

MattEqualsCoder · 2025-03-31T03:08:08Z

Okay, I think I've got everything sorted out the best I can for right now. I want to do some end-to-end testing with SMZ3 on various OSes, and I want to do a bit of testing just to make sure this didn't break Windows at all. Once that's done, I think this is ready to merge in and have a new build created.

MattEqualsCoder · 2025-04-02T02:32:36Z

Okay, I think this one is about as done as it's going to be for now. I tested SMZ3 with it in Linux Mint, Fedora, and Arch. Also should have fixed the issue Pink ran into with items not being tracked together, and fixed a potential issue that might come up with two identical messages back to back.

MattEqualsCoder and others added 13 commits March 2, 2025 18:29

Add voice recognition and TTS support for Linux

da1eecc

Merge branch 'main' into linux-speech-recognition

6d0e3a9

Update version

7155d2d

Fix freeze when closing and PySpeechService is not setup

89e4f4e

Add volume setting

9c1af21

Merge remote-tracking branch 'origin/main' into linux-speech-recognition

1dbd417

Fix issues from merge

077d733

Merge branch 'main' into linux-speech-recognition

a3a1859

Fix tracker sprite not updating

8f038d6

Merge branch 'main' into linux-speech-recognition

33af914

Merge branch 'main' into linux-speech-recognition

87d9686

Update PyTextToSpeechCommunicator to match recent changes

bb59586

Merge branch 'main' into linux-speech-recognition

fd7ba71

Vivelin previously approved these changes Mar 28, 2025

View reviewed changes

Updates for split nuget packages

f94ace5

MattEqualsCoder dismissed Vivelin’s stale review via f94ace5 March 29, 2025 00:25

MattEqualsCoder added 3 commits March 29, 2025 00:54

Cleanup abstractions project

0832a48

Update documentation and nuget package

24d3030

Update readme and remove test branch from GitHub action

1ac281b

CPColin previously approved these changes Mar 31, 2025

View reviewed changes

Fix PySpeechService communicator not combining item tracking

b317a4b

MattEqualsCoder dismissed CPColin’s stale review via b317a4b April 2, 2025 02:26

MattEqualsCoder and others added 2 commits April 1, 2025 22:30

Add documentation around custom voice models

4e95989

Merge branch 'main' into linux-speech-recognition

355c7b5

Cleanup log statements

a6d7ec4

CPColin approved these changes Apr 2, 2025

View reviewed changes

MattEqualsCoder merged commit e9b6dd8 into main Apr 2, 2025
2 checks passed

MattEqualsCoder deleted the linux-speech-recognition branch April 2, 2025 03:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Linux Speech Recognition #680

Linux Speech Recognition #680

Uh oh!

MattEqualsCoder commented Mar 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Vivelin Mar 28, 2025

Uh oh!

MattEqualsCoder Mar 28, 2025

Uh oh!

MattEqualsCoder Mar 29, 2025

Uh oh!

MattEqualsCoder commented Mar 31, 2025

Uh oh!

MattEqualsCoder commented Apr 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Linux Speech Recognition #680

Linux Speech Recognition #680

Uh oh!

Conversation

MattEqualsCoder commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Vivelin Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

MattEqualsCoder Mar 28, 2025

Choose a reason for hiding this comment

Uh oh!

MattEqualsCoder Mar 29, 2025

Choose a reason for hiding this comment

Uh oh!

MattEqualsCoder commented Mar 31, 2025

Uh oh!

MattEqualsCoder commented Apr 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

MattEqualsCoder commented Mar 28, 2025 •

edited

Loading