-
Notifications
You must be signed in to change notification settings - Fork 141
Future Implementations for classes - Measure, Money and Date #257
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from 250 commits
Commits
Show all changes
688 commits
Select commit
Hold shift + click to select a range
bab1fcc
Disables Hindi ITN L0 checks
zoobereq a152461
Reapplies ITN CI Checks
zoobereq b8c592f
Adds missing inits
zoobereq ec94af1
resolved the failing sparrowhawk test cases failed
ngachchi 2ea146f
added new graph for symbols
ngachchi 2a9e3d2
Hindi TN Support for Cardinal, Decimal, Fraction, Date, Time, Money a…
ngachchi d93bf4b
added into(x) symbol dependency for measure class
ngachchi f33e847
working on measure class
ngachchi 5310a01
Hindi ITN - Addition of Whitelist and Word (#248)
tarushi2k2 866a7c0
Hindi TN changes
ngachchi e67a6d1
Updated date for Hindi TN cache
ngachchi 0760796
Combined Hindi TN and ITN seperate blocks into single
ngachchi 3fad604
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 1df58a7
Added init.py files and removed unused commented lines
ngachchi 34e96c6
Whitelist and Word class changes
ngachchi 4d04ad4
post processor changes with minor fixes
ngachchi 9414172
removed unused imports and statements
ngachchi c651d42
Hindi ITN - Addition of Whitelist and Word (#248)
ngachchi 3692ad6
refactoring minor currency instead of direct implementation of paise
ngachchi bf8aa47
Implements support for minor currency denominations
zoobereq c169881
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 4664fa2
added unit test cases and minor fixes
ngachchi 32efeec
added missing units to improve accuracy for measure class
ngachchi 27aab2c
Updates the cache
ngachchi 66e7b0e
fixed the sparrowhawk to trim extra space
ngachchi 1c02443
removed unused english whitelist files
ngachchi d88a361
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] f9bcfc9
reverted to previous logic
ngachchi 9fd1d72
Jp tn 20241017 (#240)
ngachchi cf76fa6
Hindi TN changes
ngachchi 1fa34b1
Updated date for Hindi TN cache
ngachchi a5bc674
additional whitelist class .tsv files and unused imports removed
ngachchi 6739784
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 8f85b3f
incorporated suggestions for unused statements and another for closin…
ngachchi 8a97026
Hindi ITN Support for Cardinal, Decimal, Ordinal, Fraction, Date, Tim…
tarushi2k2 48c5233
Combined Hindi TN and ITN seperate blocks into single
ngachchi bec0590
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 4802ab1
Added init.py files and removed unused commented lines
ngachchi 84e7fe5
commented irrevelant references and unused snippets from whitelist an…
ngachchi eb6b66d
Whitelist and Word class changes
ngachchi 00ab03f
post processor changes with minor fixes
ngachchi 34e3535
remove space before punctuation for sparrowhawk file
ngachchi e265b02
minor fixes for measure class
ngachchi 6abf375
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 1118d83
Updated Jenkinsfile
ngachchi 5952b47
removed unused imports and statements
ngachchi e17b6d2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 98d8090
updated date stamp for HI cache and commented ITN grammars
ngachchi 45c95ec
Updates the cache
zoobereq 4b66164
Disables Hindi ITN L0 checks
zoobereq f09bf2a
Reapplies ITN CI Checks
zoobereq 14a8a70
resolved the failing sparrowhawk test cases failed
ngachchi bcd2cb7
added new graph for symbols
ngachchi 8be056d
Hindi TN Support for Cardinal, Decimal, Fraction, Date, Time, Money a…
ngachchi ab3c797
added into(x) symbol dependency for measure class
ngachchi 55c03da
working on measure class
ngachchi 52015ab
Hindi ITN - Addition of Whitelist and Word (#248)
tarushi2k2 3972f76
Hindi TN changes
ngachchi 59bd066
Updated date for Hindi TN cache
ngachchi 965031f
Combined Hindi TN and ITN seperate blocks into single
ngachchi a7ce4ce
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 7f2f71c
Added init.py files and removed unused commented lines
ngachchi 5dcce8a
Whitelist and Word class changes
ngachchi 7b790af
post processor changes with minor fixes
ngachchi 2c79083
removed unused imports and statements
ngachchi 003718b
Hindi ITN - Addition of Whitelist and Word (#248)
tarushi2k2 443cfc3
refactoring minor currency instead of direct implementation of paise
ngachchi 451b4c0
Implements support for minor currency denominations
zoobereq fc838c6
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] db3f56c
added unit test cases and minor fixes
ngachchi 49b90a6
added missing units to improve accuracy for measure class
ngachchi 2890815
Updates the cache
zoobereq 8276843
fixed the sparrowhawk to trim extra space
ngachchi cb0207b
removed unused english whitelist files
ngachchi 2b2b16e
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] adf1bbc
reverted to previous logic
ngachchi 88e400f
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 1018d06
Updates the cache
zoobereq 2337f57
Updates the cache again
zoobereq 866c6ab
dedh and dhai implementation approach
ngachchi 14e65fd
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 200717e
Fix space issue with ZH ITN (#244)
zoobereq 18d1293
contributing update (#251)
tbartley94 d086d0b
fix bug #111 (ar currencies) (#117)
mgrafu 68a6e4a
Logging clean up + IT TN fix (#118)
ekmb 2613972
Time_IT_TN (#105)
GiacomoLeoneMaria 301e1ac
IT TN improvement on tests (#120)
mgrafu 01e4d71
add single letter exception for roman numerals (#121)
mgrafu 382ec01
Increase weights for serial (en TN) (#128)
anand-nv 2e218a0
add measures file for FR TN (#131)
mgrafu 1def7ed
Sh jenkins (#127)
anand-nv abce5aa
update isort - fix precommit (#138)
ekmb b0b527d
Armenian itn (#136)
davidks13 6d3f40b
Fix CI (#142)
ekmb 692cbbf
Armenian TN (#137)
davidks13 55c3004
Marathi ITN (#134)
ChinmayPatil11 a9c30c0
jenkins fix (#150)
tbartley94 8587845
r0.3.0 release (#151)
ekmb b7f923b
remove unused function from ar tn decimals (#165)
mgrafu 9ec2dd3
ZH sentence-level TN (#112)
BuyuanCui 6eefc3e
preparing release, updating change log (#168)
tbartley94 fbaf7d2
hotfix (#169)
ekmb 119dc1b
hotfix (#170)
tbartley94 5eba76c
DE TN Fixes (#177)
zoobereq b9c5049
Tts en tech terms (#167)
mgrafu 836d229
FR TN Fixes (#181)
zoobereq 1079175
EN TN fixes for Issue #166 (#185)
zoobereq 39a0d3d
IT TN Fixes for #166 (#183)
zoobereq 11804f1
HU TN Fixes issue #166 (#184)
zoobereq 449a2f4
Jp itn 20240221 (#141)
BuyuanCui 52a356b
update en tn folder to see if CI tests run - DO NOT MERGE (#199)
anand-nv e80c174
Reverts EN TN fixes for Issue #166 (#202)
zoobereq 7b0f5f7
es and es_en changes for unified models (#143)
mgrafu 973417a
ES TN Fixes for Issue #166 (#206)
zoobereq 579f90a
Zh tn bug 240712 (#187)
BuyuanCui e6cf450
EN TN Fixes for Issue 166 (#207)
zoobereq fda11ca
Fix for nv-bug 4786175 (#213)
zoobereq 40a8871
Release commit r1.1.0 (#217)
tbartley94 3cd1d04
EN TN Fixes for nv-bug 4786225 (#218)
zoobereq f123cf3
Applies fixes for nv-bug 4786263 (#220)
zoobereq 728eb6d
Fix invalid escape sequences (#219)
TheKevJames 24dc2d9
IT TN Fixes for Issue #166 (#221)
zoobereq 30c61c8
ES TN Fix for Issue #166 (#224)
zoobereq a413124
Expands per/unit mappings and updates the cache (#227)
zoobereq 0eb5ab1
Cardinals up to a hundred trillions, timeFST and transliteration (#209)
kurt0cougar b9b4702
Fix for issue #211 (#232)
mgrafu ac6bb08
Jp itn update 240805 (#208)
BuyuanCui 00511fe
DE TN Fix for Issue #228 (#237)
zoobereq 212103a
Jp tn 20241017 (#240)
BuyuanCui 115b1b9
Fixes issue 228 (#234)
zoobereq 95d73a8
Hindi TN changes
ngachchi ef5db91
Updated date for Hindi TN cache
ngachchi f639e83
additional whitelist class .tsv files and unused imports removed
ngachchi 9c647bd
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 050e752
incorporated suggestions for unused statements and another for closin…
ngachchi 78ef757
Hindi ITN Support for Cardinal, Decimal, Ordinal, Fraction, Date, Tim…
ngachchi 5c1a612
Combined Hindi TN and ITN seperate blocks into single
ngachchi b47777b
[pre-commit.ci] auto fixes from pre-commit.com hooks
ngachchi dd7a2b5
Added init.py files and removed unused commented lines
ngachchi 81f410e
commented irrevelant references and unused snippets from whitelist an…
ngachchi 55e59eb
Whitelist and Word class changes
ngachchi 926c393
post processor changes with minor fixes
ngachchi 05d7cde
remove space before punctuation for sparrowhawk file
ngachchi 7005d73
minor fixes for measure class
ngachchi 33464f6
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 9f9c95b
Updated Jenkinsfile
ngachchi 274b091
removed unused imports and statements
ngachchi 23969cc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] c4a14e9
updated date stamp for HI cache and commented ITN grammars
ngachchi 93eff1c
Updates the cache
zoobereq b488d07
Disables Hindi ITN L0 checks
zoobereq 62573fb
Reapplies ITN CI Checks
zoobereq e5bc86a
resolved the failing sparrowhawk test cases failed
ngachchi 5c399be
added new graph for symbols
ngachchi 8d6c805
Hindi TN Support for Cardinal, Decimal, Fraction, Date, Time, Money a…
ngachchi 56b33c5
added into(x) symbol dependency for measure class
ngachchi ef60c0a
working on measure class
ngachchi 3cd0ff9
Hindi ITN - Addition of Whitelist and Word (#248)
tarushi2k2 cdac50c
Hindi TN changes
ngachchi 8554f19
Updated date for Hindi TN cache
ngachchi dec902c
Combined Hindi TN and ITN seperate blocks into single
ngachchi 9fe3cbd
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 3ab01cb
Added init.py files and removed unused commented lines
ngachchi 844b017
Whitelist and Word class changes
ngachchi 9a85a11
post processor changes with minor fixes
ngachchi 700aed1
removed unused imports and statements
ngachchi 66cedf0
Hindi ITN - Addition of Whitelist and Word (#248)
ngachchi 47f9cf2
refactoring minor currency instead of direct implementation of paise
ngachchi a140a42
Implements support for minor currency denominations
zoobereq 0274037
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] b075f54
added unit test cases and minor fixes
ngachchi e879ffc
added missing units to improve accuracy for measure class
ngachchi cd98a2f
Updates the cache
ngachchi 6c3eedf
fixed the sparrowhawk to trim extra space
ngachchi 47a1011
removed unused english whitelist files
ngachchi 4e2bf6f
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 5c4a6a9
reverted to previous logic
ngachchi a6c4583
Jp tn 20241017 (#240)
BuyuanCui 17df864
Hindi TN changes
ngachchi a82ff77
Updated date for Hindi TN cache
ngachchi bd9e52d
additional whitelist class .tsv files and unused imports removed
ngachchi dc81c3c
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 90c88df
incorporated suggestions for unused statements and another for closin…
ngachchi 06e5506
Hindi ITN Support for Cardinal, Decimal, Ordinal, Fraction, Date, Tim…
tarushi2k2 35509e9
Combined Hindi TN and ITN seperate blocks into single
ngachchi f525ee5
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 0bd332c
Added init.py files and removed unused commented lines
ngachchi 15a0cd2
commented irrevelant references and unused snippets from whitelist an…
ngachchi f53dc2a
Whitelist and Word class changes
ngachchi a9ac512
post processor changes with minor fixes
ngachchi 9d7c9ef
remove space before punctuation for sparrowhawk file
ngachchi 015aabe
minor fixes for measure class
ngachchi 13456b4
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] b7962fc
Updated Jenkinsfile
ngachchi 00c33e6
removed unused imports and statements
ngachchi 8f2b671
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 33587e4
updated date stamp for HI cache and commented ITN grammars
ngachchi 52cbdb5
Updates the cache
zoobereq ba6f768
Disables Hindi ITN L0 checks
zoobereq f9da1ef
Reapplies ITN CI Checks
zoobereq e738dda
resolved the failing sparrowhawk test cases failed
ngachchi 88a6640
added new graph for symbols
ngachchi e4f03f1
Hindi TN Support for Cardinal, Decimal, Fraction, Date, Time, Money a…
ngachchi f452d1a
added into(x) symbol dependency for measure class
ngachchi bbfb927
working on measure class
ngachchi 9853d9f
Hindi ITN - Addition of Whitelist and Word (#248)
tarushi2k2 8e71b39
Hindi TN changes
ngachchi a125b2c
Updated date for Hindi TN cache
ngachchi d148cf5
Combined Hindi TN and ITN seperate blocks into single
ngachchi 4320ec2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] a58bd70
Added init.py files and removed unused commented lines
ngachchi 8eb9d34
Whitelist and Word class changes
ngachchi f28fea6
post processor changes with minor fixes
ngachchi b59fc5a
removed unused imports and statements
ngachchi e0e109d
Hindi ITN - Addition of Whitelist and Word (#248)
tarushi2k2 a12f533
refactoring minor currency instead of direct implementation of paise
ngachchi 0153cd7
Implements support for minor currency denominations
zoobereq 3095fb1
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 7cba72e
added unit test cases and minor fixes
ngachchi 082041d
added missing units to improve accuracy for measure class
ngachchi 3e9fe1f
Updates the cache
zoobereq dc61b1d
fixed the sparrowhawk to trim extra space
ngachchi 01755c5
removed unused english whitelist files
ngachchi b491ba9
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 1681992
reverted to previous logic
ngachchi 2af7575
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 172a314
Updates the cache
zoobereq 6e60b0a
Updates the cache again
zoobereq 4b33dc2
dedh and dhai implementation approach
ngachchi cb94776
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] ee38e85
Fix space issue with ZH ITN (#244)
zoobereq 2bedc07
reverted code and added zero to the hour tsv file
ngachchi 6a82f32
reverted to previous logic
ngachchi ff8ed1b
Hindi ITN - Addition of Whitelist and Word (#248)
tarushi2k2 8736a9e
Hindi ITN - Addition of Whitelist and Word (#248)
tarushi2k2 ecd4a2c
Hindi ITN - Addition of Whitelist and Word (#248)
tarushi2k2 175a6ec
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] f57aa18
Date further implementation (BC, B.C.) added
ngachchi 5e6df1e
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 08ce71d
added date range implementation
ngachchi 20d7520
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] ec48901
working unit test cases
ngachchi 6c9b6d9
removed the conflicted test case for the instance
ngachchi 66712bc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 2b5bfe5
Update Dockerfile (#254)
anand-nv f9013a4
updated Jenkins file
ngachchi c8715da
Merge branch 'hi_tn' of https://github.com/ngachchi/NeMo-text-process…
ngachchi 2e7c2a0
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 9bf29e1
minor fixes
ngachchi 55f2ee8
reformatting changes
ngachchi File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -67,4 +67,4 @@ | |
| /shift per shift | ||
| /project per project | ||
| /class per class | ||
| /session per session | ||
| /session per session | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
3 changes: 3 additions & 0 deletions
3
nemo_text_processing/text_normalization/hi/data/date/year_suffix.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,3 @@ | ||
| ई. पू. ईसा पूर्व | ||
| ई. ईसवी | ||
| तक तक |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
3 changes: 1 addition & 2 deletions
3
nemo_text_processing/text_normalization/hi/data/money/currency.tsv
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,10 +1,9 @@ | ||
| ₹ रुपए | ||
| P पैसे | ||
| £ पाउंड | ||
| ₩ वॉन | ||
| $ डॉलर | ||
| ₺ लीरा | ||
| ৳ टका | ||
| ¥ येन | ||
| ₦ नाइरा | ||
| € यूरो | ||
| € यूरो |
11 changes: 11 additions & 0 deletions
11
nemo_text_processing/text_normalization/hi/data/money/major_minor_currencies.py
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,11 @@ | ||
| major_minor_currencies = { | ||
ngachchi marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
| "रुपए": "पैसे", | ||
| "पाउंड": "पेंस", | ||
| "वॉन": "जिओन", | ||
| "डॉलर": "सेंट", | ||
| "लीरा": "कुरस", | ||
| "टका": "पैसे", | ||
| "येन": "सेन", | ||
| "नाइरा": "कोबो", | ||
| "यूरो": "सेंट", | ||
| } | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,3 +1,4 @@ | ||
| ० शून्य | ||
| १ एक | ||
| २ दो | ||
| ३ तीन | ||
|
|
||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.