-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Per-Stream Phrase Boosting in ASR Decoding (Transducers) #15125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 81 commits
Commits
Show all changes
82 commits
Select commit
Hold shift + click to select a range
b4c33dd
Fused biasing ids in interface
artbataev 0e8f25d
Bugfix
artbataev 066d5d3
Fused model: non-batched reference implementation
artbataev 9d65743
Change api
artbataev 28aca3d
Temporary bugfix
artbataev fe42d8a
Allow partial hypothesis without dec_state
artbataev ca2cf4b
Multi-model biasing: implement with RNN-T
artbataev a290f32
Multi-model biasing: implement for TDT, fix RNN-T
artbataev e288228
Add and remove boosting model from multi-model: move logic to Hypothesis
artbataev 8d36596
Clean up per-hyp biasing config
artbataev b8df95e
Auto manage biasing requests in transcribe
artbataev e74f7b1
Disable CUDA graphs with per-stream biasing
artbataev 558dba2
Support specifying lang per phrase
artbataev 972fc29
Fix customization options
artbataev 352027f
Fix streaming decoding
artbataev e4e7fc3
Fix streaming decoding
artbataev ac88aa3
Fix type
artbataev 768e51a
Implement custom requests in nemo inference
artbataev e24fc99
Merge branch 'main' into vbataev/multi_biasing_models
artbataev 7209d63
Use biasing options from state instead of request
artbataev 736f942
Fix f-strings
artbataev e5f9c71
Merge branch 'main' into vbataev/multi_biasing_models
artbataev d8649b7
Expose parameter: enable_per_stream_biasing
artbataev 285ac29
key_phrase_items_list: use parameter in config
artbataev 59139ef
More documentation
artbataev 3680f07
Clean up biasing_cfg management
artbataev 8aea8ad
Specify todo
artbataev 2b20d8e
Clean up
artbataev 4c55a79
Stubs for optimized model
artbataev 5875202
Multi-model: implement adding model
artbataev 1c04627
Multi-model: stubs for advance
artbataev 6840f0f
Multi-model: stubs for advance using Triton
artbataev 494c4c1
Multi-model: implement Triton kernel
artbataev db84e03
Multi-model: implement advance in Pytorch
artbataev 225a644
Multi-model: use optimized implementation
artbataev 586221e
Multi-model: implement model removal
artbataev 398e4ed
Clean up
artbataev b9a4ba2
Clean up
artbataev cce5b26
Fix triton implementation
artbataev 13af31b
Fix flake8 suggestions
artbataev 985ccb4
Merge branch 'main' into vbataev/multi_biasing_models
artbataev 5cfc624
Support CUDA graphs for RNN-T
artbataev 7a00812
Support CUDA graphs for TDT
artbataev 0d44900
Fix TDT
artbataev dfdb876
Merge branch 'main' into vbataev/multi_biasing_models
artbataev bfcf441
Fix vocab size
artbataev fb20279
Fix edge case
artbataev dac3ef8
Add test
artbataev 4aa7b9d
Remove OneLogger causing issues
artbataev 71638dc
Clean up logging
artbataev 90e2ed3
Fix model removal
artbataev 3c9b2db
Efficient addition and removal of models
artbataev 103c4af
Fix Triton implementation
artbataev bc80bf5
Fix CUDA graphs
artbataev 01b361c
Clean up
artbataev 0c62cb5
Bugfix
artbataev f47ebd5
Bugfix
artbataev bec685c
Clean up
artbataev b7e78ef
Merge branch 'main' into vbataev/multi_biasing_models
artbataev 28bc0d3
Add docstrings
artbataev ca58d79
Fix moving fusion models to the device
artbataev 39d73f6
Fix inference mode issues
artbataev 4cae6dd
Merge branch 'main' into vbataev/multi_biasing_models
artbataev 526e0c8
Revert back One Logger callback
artbataev bf06b3b
Implement memory cache
artbataev 9acac89
Fix model removal
artbataev a36f3fa
Clean up
artbataev adf29aa
Add nemo inference test with boosting ground truth
artbataev 5e0bd80
Add test with `asr_model.transcribe`
artbataev a4e1582
Clean up
artbataev cc0c3a1
Clean up
artbataev fad3cad
Clean up decoders
artbataev 8e9a986
Merge branch 'main' into vbataev/multi_biasing_models
artbataev 1908759
Add unit tests
artbataev d49b342
Fix copyright
artbataev 5589360
Fix tests
artbataev 60f53aa
Support biasing request in manifest for streaming inference scripts
artbataev 6c3981c
Merge branch 'main' into vbataev/multi_biasing_models
artbataev e33942a
Use `asr_streaming_infer.py` in functional tests. Add test with per-s…
artbataev e8d0524
Remove redundant WER calculation
artbataev 5470fc7
Clean up. Add docstring
artbataev 9b479f8
Remove unused import
artbataev File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.