Skip to content

Latest commit

 

History

History
51 lines (30 loc) · 1.99 KB

File metadata and controls

51 lines (30 loc) · 1.99 KB

CLDF dataset derived from Ritchie et al.'s "UniNum: A Database of Number Names for 186 Languages" from 2019

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Ritchie, S., Sproat, R., Gorman, K., van Esch, D., Schallhart, C., Bampounis, N., Brard, B., Mortensen, J. F., Holt, M., and Mahon, E. 2019. Unified verbalization for speech recognition & synthesis across languages. In Proc. INTERSPEECH, pages 3530-3534.

  • the derived dataset using the DOI of the particular released version you were using

Description

A collection of numerals ranging between 0 and 100000000000 (inclusive), provided by Google and language experts.

This dataset is licensed under a CC-BY-4.0 license

Available online at https://github.com/google/uninum

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100%

  • Varieties: 182
  • Concepts: 111
  • Lexemes: 19,877
  • Sources: 1
  • Synonymy: 1.01

Contributors

Name GitHub user Description Role
Christoph Rzymski @chrzyki patron, code Other

CLDF Datasets

The following CLDF datasets are available in cldf: