Skip to content

fula language datasets - peul resources for natural language applications

Notifications You must be signed in to change notification settings

flutter-painter/awesome_fula_nl_resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

awesome_fula_nl_resources

1. context

Fula/Fulani is a language spoken by 40 million people across 18 countries in West and Central Africa.

It belongs to the Niger-Congo family, specifically the Atlantic-Congo branch, under the Atlantic group known as Senegambian languages. It is composed of numerous dialects, including Pulaar, Fulfulde, and Maasina.

Resources mentioned here favour Pular spoken in Guinea.

ISO-639 codes

  • 639-1 : ff - Fula/Fulah
  • ISO 639-2 : ful – Fula/Fulah
  • ISO 639-3 codes according to ethnologue.com and sil.org:
    • fuc – Pulaar (Senegambia, Mauritania)
    • fuf – Pular (Guinea, Sierra Leone)
    • ffm – Maasina Fulfulde (Mali, Ivory Coast, and Ghana by 1.6 m)
    • fue – Borgu Fulfulde (Benin, Togo)
    • fuh – Western Niger Fulfulde (Burkina, Niger)
    • fuq – Central–Eastern Niger Fulfulde (Niger)
    • fuv – Nigerian Fulfulde (Nigeria)
    • fub – Adamawa Fulfulde (Cameroon, Chad, Nigeria)
    • fui – Bagirmi Fulfulde (CAR)(Chad)

alphabet

Fula Language Speakers Map

map from maria-kosogorova FulaLanguageMap

2. tools

2.1. machine learning models

2.2. crowdsourcing platforms

2.3. android apps

2.4. github repo

3. datasets

3.1. datasets translation

3.1.1. Bible

Text already extracted and available in dataset/fra-ful/bible_fr_ff.txt

3.1.2. Coran

Text already extracted and available in dataset/fra-ful/quran_fr_ff.txt

sources :

3.1.3. NLLB

3.1.4. wikimedia

3.1.5. QED

3.1.6. copyrighted translations

3.2. dictionaries

3.2.1. online dictionaries

3.2.2. pdf dictionaries

3.3. dataset unlabeled text

3.4. datasets audio

4. other resources

4.1. fulfulde resources

About

fula language datasets - peul resources for natural language applications

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages