Skip to content

Commit 60253f3

Browse files
committed
mention how language data can be generated
1 parent bb9ff6f commit 60253f3

File tree

1 file changed

+9
-0
lines changed

1 file changed

+9
-0
lines changed

README.rst

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -155,3 +155,12 @@ Unit test status
155155

156156
.. image:: https://travis-ci.org/pear/Text_LanguageDetect.svg?branch=master
157157
:target: https://travis-ci.org/pear/Text_LanguageDetect
158+
159+
160+
Notes
161+
=====
162+
Where are the data from?
163+
164+
I don't recall where I got the original data set.
165+
It's just the frequencies of 3-letter combinations in each supported language.
166+
It could be generated from a few random wikipedia pages from each language.

0 commit comments

Comments
 (0)