Commit e503a6e
Initial Chunking (#14321)
* adding nfa to canary
Signed-off-by: Monica Sekoyan <msekoyan@nvidia.com>
* remove comments
Signed-off-by: Monica Sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* modify external model loading
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* fix audio padding
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* reseting
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* handle non-possible alignment
Signed-off-by: Monica Sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* add offset refinement
Signed-off-by: Monica Sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* Initial Chunking
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Adding comments and docstrings
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Changes in doctrings
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Changes in doctrings
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Updates to the algrithm
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Update with timestamps
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Remove join_text
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Final
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Remove pdb
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Adjust timestamps
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Adjust timestamps
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Support for long audio
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Refactoring to keep model clean
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Small changes
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Removing changes from mixin
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* small updates
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Back to main for mixin
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Fix for hypotheses
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Revert "Fix for hypotheses"
This reverts commit 61fb893.
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Fix for hypotheses
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Revert "Revert "Fix for hypotheses""
This reverts commit 3c62a2d.
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Resolve
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Allowing user to control chunking
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Doc changes
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Forcing true for chunking
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Revert "reseting"
This reverts commit 6d74ad0.
Signed-off-by: monica-sekoyan <msekoyan@vidia.com>
* Revert "Apply isort and black reformatting"
This reverts commit 1d8c363.
Signed-off-by: monica-sekoyan <msekoyan@vidia.com>
* handle merge case for timestamps
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* add timestamp_type
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* add timestamps support chunked inference
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* refactor ctc timestamps to use utils
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* correct restore_token_cased with unk_token
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* use timestamps utils in rnnt_decoding
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* change external timestamps asr model loading
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* add forced aligned method tests
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* modify nfa to match new setup and utils
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* remove unused imports
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* merge conflicts
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* remove unused errors
Signed-off-by: monica-sekoyan <msekoyan@vidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* remove unused import
Signed-off-by: monica-sekoyan <msekoyan@vidia.com>
* addressing comments, linting and flake8
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* handle decode_ids_to_str change
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* correct usage of decode_tokens_to_str
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* update nfa docs
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* revert jupyter settings
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Merge and Tests
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* Unit tests
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* change decoding_tokens_to_str
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* change decoding_tokens_to_str
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* Update
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Doc updates
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* Doc updates
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Doc change for speech_to_text_aed_chunked_infer
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Remove some import
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Copyright
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Remove some import
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* correct description
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* make private
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* rewrite restore_timestamps_asr_model
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* Update timestamps
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* Small updates
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* fix word offset logic
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
* Tests update after the fix
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Cases for monotonicity
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
* Tests fix
Signed-off-by: Nune <ntadevosyan@nvidia.com>
* Increase L0_Unit_Tests_GPU_ASR timeout to 30
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
---------
Signed-off-by: Monica Sekoyan <msekoyan@nvidia.com>
Signed-off-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
Signed-off-by: monica-sekoyan <msekoyan@nvidia.com>
Signed-off-by: Nune <ntadevosyan@nvidia.com>
Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
Signed-off-by: monica-sekoyan <msekoyan@vidia.com>
Signed-off-by: nune-tadevosyan <152167970+nune-tadevosyan@users.noreply.github.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Co-authored-by: Monica Sekoyan <msekoyan@nvidia.com>
Co-authored-by: monica-sekoyan <monica-sekoyan@users.noreply.github.com>
Co-authored-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>
Co-authored-by: monica-sekoyan <msekoyan@vidia.com>
Co-authored-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Charlie Truong <chtruong@nvidia.com>1 parent 454fabc commit e503a6e
File tree
11 files changed
+889
-40
lines changed- .github/workflows
- examples/asr/asr_chunked_inference/aed
- nemo/collections/asr
- data
- models
- parts
- submodules
- utils
- tests/collections/asr
- utils
11 files changed
+889
-40
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
40 | | - | |
| 40 | + | |
41 | 41 | | |
42 | 42 | | |
43 | 43 | | |
| |||
Lines changed: 9 additions & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
| 20 | + | |
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
28 | | - | |
29 | | - | |
30 | | - | |
| 28 | + | |
| 29 | + | |
31 | 30 | | |
32 | 31 | | |
33 | 32 | | |
| |||
41 | 40 | | |
42 | 41 | | |
43 | 42 | | |
44 | | - | |
45 | | - | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
46 | 49 | | |
47 | 50 | | |
48 | 51 | | |
| |||
Lines changed: 110 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
| 15 | + | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
| 24 | + | |
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| |||
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
64 | 69 | | |
65 | 70 | | |
66 | 71 | | |
67 | 72 | | |
68 | 73 | | |
69 | 74 | | |
| 75 | + | |
70 | 76 | | |
71 | 77 | | |
72 | 78 | | |
73 | 79 | | |
74 | 80 | | |
75 | 81 | | |
| 82 | + | |
76 | 83 | | |
77 | 84 | | |
78 | 85 | | |
79 | 86 | | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
80 | 101 | | |
81 | 102 | | |
82 | 103 | | |
| |||
110 | 131 | | |
111 | 132 | | |
112 | 133 | | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
113 | 221 | | |
114 | 222 | | |
115 | 223 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
40 | 40 | | |
41 | 41 | | |
42 | 42 | | |
| 43 | + | |
43 | 44 | | |
44 | 45 | | |
45 | 46 | | |
| |||
110 | 111 | | |
111 | 112 | | |
112 | 113 | | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
113 | 118 | | |
114 | 119 | | |
115 | 120 | | |
| |||
119 | 124 | | |
120 | 125 | | |
121 | 126 | | |
| 127 | + | |
122 | 128 | | |
123 | 129 | | |
124 | 130 | | |
| |||
495 | 501 | | |
496 | 502 | | |
497 | 503 | | |
| 504 | + | |
498 | 505 | | |
499 | 506 | | |
500 | 507 | | |
| |||
525 | 532 | | |
526 | 533 | | |
527 | 534 | | |
528 | | - | |
| 535 | + | |
529 | 536 | | |
530 | 537 | | |
531 | 538 | | |
| |||
561 | 568 | | |
562 | 569 | | |
563 | 570 | | |
564 | | - | |
| 571 | + | |
| 572 | + | |
| 573 | + | |
| 574 | + | |
| 575 | + | |
| 576 | + | |
| 577 | + | |
| 578 | + | |
| 579 | + | |
| 580 | + | |
| 581 | + | |
| 582 | + | |
| 583 | + | |
| 584 | + | |
| 585 | + | |
565 | 586 | | |
566 | 587 | | |
| 588 | + | |
567 | 589 | | |
568 | 590 | | |
569 | 591 | | |
570 | 592 | | |
571 | 593 | | |
572 | 594 | | |
| 595 | + | |
| 596 | + | |
| 597 | + | |
| 598 | + | |
| 599 | + | |
573 | 600 | | |
574 | 601 | | |
575 | 602 | | |
576 | 603 | | |
577 | 604 | | |
578 | 605 | | |
579 | 606 | | |
| 607 | + | |
580 | 608 | | |
581 | 609 | | |
582 | 610 | | |
| |||
889 | 917 | | |
890 | 918 | | |
891 | 919 | | |
892 | | - | |
893 | 920 | | |
894 | 921 | | |
895 | | - | |
| 922 | + | |
| 923 | + | |
| 924 | + | |
| 925 | + | |
896 | 926 | | |
897 | 927 | | |
898 | 928 | | |
| |||
979 | 1009 | | |
980 | 1010 | | |
981 | 1011 | | |
| 1012 | + | |
| 1013 | + | |
982 | 1014 | | |
983 | 1015 | | |
984 | 1016 | | |
| |||
988 | 1020 | | |
989 | 1021 | | |
990 | 1022 | | |
| 1023 | + | |
991 | 1024 | | |
992 | 1025 | | |
993 | 1026 | | |
| |||
996 | 1029 | | |
997 | 1030 | | |
998 | 1031 | | |
999 | | - | |
1000 | | - | |
| 1032 | + | |
| 1033 | + | |
| 1034 | + | |
| 1035 | + | |
| 1036 | + | |
1001 | 1037 | | |
1002 | 1038 | | |
1003 | 1039 | | |
1004 | 1040 | | |
1005 | 1041 | | |
1006 | 1042 | | |
| 1043 | + | |
1007 | 1044 | | |
1008 | 1045 | | |
1009 | 1046 | | |
| |||
1013 | 1050 | | |
1014 | 1051 | | |
1015 | 1052 | | |
1016 | | - | |
| 1053 | + | |
1017 | 1054 | | |
1018 | 1055 | | |
1019 | 1056 | | |
1020 | 1057 | | |
1021 | 1058 | | |
1022 | 1059 | | |
| 1060 | + | |
| 1061 | + | |
| 1062 | + | |
| 1063 | + | |
| 1064 | + | |
| 1065 | + | |
| 1066 | + | |
| 1067 | + | |
| 1068 | + | |
| 1069 | + | |
| 1070 | + | |
| 1071 | + | |
| 1072 | + | |
| 1073 | + | |
| 1074 | + | |
| 1075 | + | |
1023 | 1076 | | |
1024 | 1077 | | |
1025 | 1078 | | |
| |||
1035 | 1088 | | |
1036 | 1089 | | |
1037 | 1090 | | |
| 1091 | + | |
1038 | 1092 | | |
1039 | 1093 | | |
1040 | 1094 | | |
| |||
1059 | 1113 | | |
1060 | 1114 | | |
1061 | 1115 | | |
| 1116 | + | |
1062 | 1117 | | |
1063 | 1118 | | |
1064 | 1119 | | |
| |||
0 commit comments