Skip to content

Chinese word segmentation model for spaCy #12923

@PythonCancer

Description

@PythonCancer

The Chinese word segmentation model zh_core_web_sm-3.5.0 in spaCy has two files. One is weights.npz, which contains dimensions and model weight values, and I can understand that. The other file is features.msgpack; what is this file for? Is it for features? Because I want to train my own word segmentation model and embed it into spaCy, can you explain it?

Metadata

Metadata

Assignees

No one assigned

    Labels

    lang / zhChinese language data and modelsmodelsIssues related to the statistical modelsthird-partyThird-party packages and services

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions