-
Notifications
You must be signed in to change notification settings - Fork 82
Description
Describe the bug
After install nuget package for Catalyst.Models.English version: 1.0.30952; can not find where the model .bin files are located. Only empty folders are created.
To Reproduce
Steps to reproduce the behavior
Create one C# console project, with the following .csproj file:
using System;
using System.Diagnostics;
using System.IO;
using System.Linq;
using System.Text;
using System.Threading.Tasks;
using Catalyst;
using Catalyst.Models;
using Mosaik.Core;
namespace CatalystExample
{
class Program
{
static async Task Main(string[] args)
{
// Set up local storage for models
string storagePath =
Path.Combine(Environment.GetFolderPath(Environment.SpecialFolder.ApplicationData), "CatalystModels");
Console.WriteLine($"Storage Path: {storagePath}");
Debug.Print($"Storage Path: {storagePath}");
Storage.Current = new DiskStorage(storagePath);
// Initialize the NLP pipeline
Console.WriteLine("Loading NLP pipeline...");
Debug.Print("Loading NLP pipeline...");
var pipeline = await Pipeline.ForAsync(Language.English);
// Add a part-of-speech (POS) tagger to the pipeline
Console.WriteLine("Adding POS tagger to pipeline...");
Debug.Print("Adding POS tagger to pipeline...");
pipeline.Add(await AveragePerceptronTagger.FromStoreAsync(Language.English, Mosaik.Core.Version.Latest, tag: ""));
// Text for processing
string text = "Which city is the biggest city of Ireland?";
Console.WriteLine($"Text for processing: {text}");
Debug.Print($"Text for processing: {text}");
var document = new Document(text, Language.English);
// Process the document
Console.WriteLine("Processing text...");
Debug.Print("Processing text...");
pipeline.ProcessSingle(document);
// Display the tokens and their part-of-speech (POS) tags
Console.WriteLine("Token\t\tPOS Tag");
Console.WriteLine("-----------------------------");
Debug.Print("Token\t\tPOS Tag");
Debug.Print("-----------------------------");
foreach (var sentence in document)
{
foreach (var token in sentence)
{
Console.WriteLine($"{token.Value}\t\t{token.POS}");
Debug.Print($"{token.Value}\t\t{token.POS}");
}
}
}
}
}
Expected behavior
A clear and concise description of what you expected to happen.
For my question: "Which city is the biggest city of Ireland?"
I should get an answer: "Dublin"
Screenshots
If applicable, add a code example to help explain your problem.
Additional context
Add any other context about the problem here.
I show the empty folder after installed "Catalyst.Models.English" Version="1.0.30952" :
C:\Users\zydjo>dir C:\Users\zydjo\AppData\Roaming\CatalystModels
Volume in drive C is Windows
Volume Serial Number is 2E05-450C
Directory of C:\Users\zydjo\AppData\Roaming\CatalystModels
01/12/2025 09:21 PM
.01/12/2025 09:21 PM ..
01/12/2025 09:21 PM Models
0 File(s) 0 bytes
3 Dir(s) 39,037,837,312 bytes free
C:\Users\zydjo>dir C:\Users\zydjo\AppData\Roaming\CatalystModels\Models
Volume in drive C is Windows
Volume Serial Number is 2E05-450C
Directory of C:\Users\zydjo\AppData\Roaming\CatalystModels\Models
01/12/2025 09:21 PM
.01/12/2025 09:21 PM ..
01/12/2025 09:21 PM en
0 File(s) 0 bytes
3 Dir(s) 39,037,833,216 bytes free
C:\Users\zydjo>dir C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en
Volume in drive C is Windows
Volume Serial Number is 2E05-450C
Directory of C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en
01/12/2025 09:21 PM
.01/12/2025 09:21 PM ..
01/12/2025 09:21 PM AveragePerceptronTaggerModel
01/12/2025 09:21 PM SentenceDetectorModel
01/12/2025 09:21 PM SentenceDetectorModel-Version
0 File(s) 0 bytes
5 Dir(s) 39,037,833,216 bytes free
C:\Users\zydjo>dir C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\AveragePerceptronTaggerModel
Volume in drive C is Windows
Volume Serial Number is 2E05-450C
Directory of C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\AveragePerceptronTaggerModel
01/12/2025 09:21 PM
.01/12/2025 09:21 PM ..
01/12/2025 09:21 PM v000000
0 File(s) 0 bytes
3 Dir(s) 39,037,698,048 bytes free
C:\Users\zydjo>dir C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\AveragePerceptronTaggerModel\v000000
Volume in drive C is Windows
Volume Serial Number is 2E05-450C
Directory of C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\AveragePerceptronTaggerModel\v000000
01/12/2025 09:21 PM
.01/12/2025 09:21 PM ..
0 File(s) 0 bytes
2 Dir(s) 39,037,763,584 bytes free
C:\Users\zydjo>dir C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\SentenceDetectorModel
Volume in drive C is Windows
Volume Serial Number is 2E05-450C
Directory of C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\SentenceDetectorModel
01/12/2025 09:21 PM
.01/12/2025 09:21 PM ..
01/12/2025 09:21 PM v000000
0 File(s) 0 bytes
3 Dir(s) 39,037,820,928 bytes free
C:\Users\zydjo>dir C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\SentenceDetectorModel\v000000
Volume in drive C is Windows
Volume Serial Number is 2E05-450C
Directory of C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\SentenceDetectorModel\v000000
01/12/2025 09:21 PM
.01/12/2025 09:21 PM ..
0 File(s) 0 bytes
2 Dir(s) 39,037,693,952 bytes free
C:\Users\zydjo>dir C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\SentenceDetectorModel-Version
Volume in drive C is Windows
Volume Serial Number is 2E05-450C
Directory of C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\SentenceDetectorModel-Version
01/12/2025 09:21 PM
.01/12/2025 09:21 PM ..
01/12/2025 09:21 PM v-000001
0 File(s) 0 bytes
3 Dir(s) 39,037,562,880 bytes free
C:\Users\zydjo>dir C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\SentenceDetectorModel-Version\v-000001
Volume in drive C is Windows
Volume Serial Number is 2E05-450C
Directory of C:\Users\zydjo\AppData\Roaming\CatalystModels\Models\en\SentenceDetectorModel-Version\v-000001
01/12/2025 09:21 PM
.01/12/2025 09:21 PM ..
0 File(s) 0 bytes
2 Dir(s) 39,037,628,416 bytes free
C:\Users\zydjo>
By the way, I am using Windows 11 with Visual Studio 2022 version: 17.12.3