Current alphabet support is limited to ISO 8859-1 8-bit ASCII. Many word corpora are based on other character sets.