Commit 119593e
Add Hindi (hi-IN) support for TTS (#15248)
* add Hindi char tokenizer, IPA G2P, and Unicode Hindi support
Signed-off-by: quanpham <youngkwan199@gmail.com>
* Add Hindi chars tokenizer
Signed-off-by: quanpham <youngkwan199@gmail.com>
* hindi grapheme and ipa sets
Signed-off-by: quanpham <youngkwan199@gmail.com>
* remove ipa hindi
Signed-off-by: quanpham <youngkwan199@gmail.com>
* remove hindi ipa
Signed-off-by: quanpham <youngkwan199@gmail.com>
* Restore file to base version
Signed-off-by: quanpham <youngkwan199@gmail.com>
* hindi chartokenizer unit test
Signed-off-by: quanpham <youngkwan199@gmail.com>
* Restore tokenizer_utils.py to base version
Signed-off-by: quanpham <youngkwan199@gmail.com>
* Apply suggestion from @Copilot
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Apply isort and black reformatting
Signed-off-by: XuesongYang <XuesongYang@users.noreply.github.com>
* Update nemo/collections/common/tokenizers/text_to_speech/ipa_lexicon.py
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Add simple docstrings for helper funcs
Signed-off-by: Jason <jasoli@nvidia.com>
---------
Signed-off-by: quanpham <youngkwan199@gmail.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: XuesongYang <XuesongYang@users.noreply.github.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: XuesongYang <XuesongYang@users.noreply.github.com>
Co-authored-by: Jason <jasoli@nvidia.com>1 parent 3bf62de commit 119593e
File tree
3 files changed
+123
-6
lines changed- nemo/collections/common/tokenizers/text_to_speech
- tests/collections/common/tokenizers/text_to_speech
3 files changed
+123
-6
lines changedLines changed: 36 additions & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
18 | | - | |
| 18 | + | |
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| |||
90 | 90 | | |
91 | 91 | | |
92 | 92 | | |
93 | | - | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
94 | 110 | | |
95 | 111 | | |
96 | 112 | | |
| |||
147 | 163 | | |
148 | 164 | | |
149 | 165 | | |
150 | | - | |
| 166 | + | |
151 | 167 | | |
152 | | - | |
| 168 | + | |
153 | 169 | | |
154 | 170 | | |
155 | 171 | | |
156 | 172 | | |
157 | | - | |
158 | | - | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
159 | 185 | | |
160 | 186 | | |
161 | 187 | | |
| |||
165 | 191 | | |
166 | 192 | | |
167 | 193 | | |
| 194 | + | |
168 | 195 | | |
169 | 196 | | |
170 | 197 | | |
171 | 198 | | |
172 | 199 | | |
| 200 | + | |
173 | 201 | | |
174 | 202 | | |
175 | 203 | | |
| |||
193 | 221 | | |
194 | 222 | | |
195 | 223 | | |
| 224 | + | |
196 | 225 | | |
197 | 226 | | |
198 | 227 | | |
| |||
202 | 231 | | |
203 | 232 | | |
204 | 233 | | |
| 234 | + | |
205 | 235 | | |
206 | 236 | | |
207 | 237 | | |
| |||
Lines changed: 75 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
380 | 380 | | |
381 | 381 | | |
382 | 382 | | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
| 429 | + | |
| 430 | + | |
| 431 | + | |
| 432 | + | |
| 433 | + | |
| 434 | + | |
| 435 | + | |
| 436 | + | |
| 437 | + | |
| 438 | + | |
| 439 | + | |
| 440 | + | |
| 441 | + | |
| 442 | + | |
| 443 | + | |
| 444 | + | |
| 445 | + | |
| 446 | + | |
| 447 | + | |
| 448 | + | |
| 449 | + | |
| 450 | + | |
| 451 | + | |
| 452 | + | |
| 453 | + | |
| 454 | + | |
| 455 | + | |
| 456 | + | |
| 457 | + | |
383 | 458 | | |
384 | 459 | | |
385 | 460 | | |
| |||
Lines changed: 12 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| 21 | + | |
21 | 22 | | |
22 | 23 | | |
23 | 24 | | |
| |||
302 | 303 | | |
303 | 304 | | |
304 | 305 | | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
| 311 | + | |
| 312 | + | |
| 313 | + | |
| 314 | + | |
| 315 | + | |
| 316 | + | |
0 commit comments