Skip to content
@kuhumcst

Centre for Language Technology, University of Copenhagen

Popular repositories Loading

  1. cstlemma cstlemma Public

    Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervis…

    C++ 36 7

  2. stucco stucco Public archive

    An experimental adaptive UI toolkit.

    Clojure 30 1

  3. DanNet DanNet Public

    The Danish WordNet as an RDF graph.

    Clojure 24

  4. xml-hiccup xml-hiccup Public

    Convert XML into Hiccup in Clojure and ClojureScript.

    Clojure 22 1

  5. taggerXML taggerXML Public

    Modernized version of Eric Brill's Part Of Speech tagger.

    C++ 15 6

  6. tf-idf tf-idf Public

    A reasonably performant TF-IDF implementation.

    Clojure 12 1

Repositories

Showing 10 of 66 repositories
  • DanNet Public

    The Danish WordNet as an RDF graph.

    kuhumcst/DanNet’s past year of commit activity
    Clojure 24 MIT 0 30 0 Updated Feb 12, 2026
  • dspace-angular Public Forked from ufal/dspace-angular

    DSpace 7.x (and above) User Interface built on Angular.io

    kuhumcst/dspace-angular’s past year of commit activity
    TypeScript 0 BSD-3-Clause 513 0 0 Updated Feb 6, 2026
  • OpenPose2tab Public

    Read all OpenPose json files and create single tab separated file for pose_keypoints_2d for single person

    kuhumcst/OpenPose2tab’s past year of commit activity
    0 GPL-3.0 0 0 0 Updated Jan 14, 2026
  • letterfunc Public

    Functions for upper/lower casing, for testing whether a character is a letter and for conversion between Unicode encodings UTF-8 and UTF-16

    kuhumcst/letterfunc’s past year of commit activity
    C 2 GPL-2.0 1 0 0 Updated Nov 18, 2025
  • kuhumcst/danish-semantic-reasoning-benchmark’s past year of commit activity
    3 0 0 0 Updated Oct 31, 2025
  • texton Public

    Text Tonsorium - a toolbox that automatically arranges NLP tools in workflows and enacts them with user's inputs

    kuhumcst/texton’s past year of commit activity
    PHP 5 0 1 0 Updated Jul 11, 2025
  • cstlemma Public

    Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.

    kuhumcst/cstlemma’s past year of commit activity
    C++ 36 GPL-2.0 7 0 0 Updated Jun 26, 2025
  • texton-linguistic-resources Public

    Linguistic resources for several of the tools included in the Text Tonsorium

    kuhumcst/texton-linguistic-resources’s past year of commit activity
    Roff 1 0 0 0 Updated Jun 23, 2025
  • texton-bin Public

    Binary executable files used by services in the Text Tonsorium.

    kuhumcst/texton-bin’s past year of commit activity
    0 0 0 0 Updated May 6, 2025
  • taggerXML Public

    Modernized version of Eric Brill's Part Of Speech tagger.

    kuhumcst/taggerXML’s past year of commit activity
    C++ 15 GPL-2.0 6 1 0 Updated May 6, 2025

Top languages

Loading…

Most used topics

Loading…