Fangjun Kuang
|
669f5ef441
|
Add C++ runtime and Python APIs for Moonshine models (#1473)
|
2024-10-26 14:34:07 +08:00 |
|
Robin Zhong
|
62c4d4ab62
|
Add emotion, event of SenseVoice. (#1257)
* Add emotion, event of SenseVoice.
* Fix tokens size check and update java api.
https://github.com/k2-fsa/sherpa-onnx/pull/1257
|
2024-08-14 15:50:13 +08:00 |
|
ivan provalov
|
de04b3b9bf
|
Allow modify model config at decode time for ASR (#1124)
|
2024-07-13 22:30:47 +08:00 |
|
Fangjun Kuang
|
117cd7bb8c
|
Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#1114)
|
2024-07-12 23:47:39 +08:00 |
|
Fangjun Kuang
|
1a43d1e37f
|
Support getting word IDs for CTC HLG decoding. (#978)
|
2024-06-06 14:22:39 +08:00 |
|
Fangjun Kuang
|
17cd3a5f01
|
Add C++ runtime for non-streaming faster conformer transducer from NeMo. (#854)
|
2024-05-10 12:15:39 +08:00 |
|
Fangjun Kuang
|
c1608b3524
|
Support CED models (#792)
|
2024-04-19 15:20:37 +08:00 |
|
Fangjun Kuang
|
f20291cadc
|
Support audio tagging using zipformer (#747)
|
2024-04-10 14:47:06 +08:00 |
|
Fangjun Kuang
|
b83b3e3cd1
|
Support non-streaming WeNet CTC models. (#426)
|
2023-11-15 14:23:20 +08:00 |
|
Fangjun Kuang
|
45b9d4ab37
|
Support whisper models (#238)
|
2023-08-07 12:34:18 +08:00 |
|
Wei Kang
|
8562711252
|
Implement context biasing with a Aho Corasick automata (#145)
* Implement context graph
* Modify the interface to support context biasing
* Support context biasing in modified beam search; add python wrapper
* Support context biasing in python api example
* Minor fixes
* Fix context graph
* Minor fixes
* Fix tests
* Fix style
* Fix style
* Fix comments
* Minor fixes
* Add missing header
* Replace std::shared_ptr with std::unique_ptr for effciency
* Build graph in constructor
* Fix comments
* Minor fixes
* Fix docs
|
2023-06-16 14:26:36 +08:00 |
|
cooldoomsday
|
0bc571f6ee
|
Return timestamp info and tokens in offline ASR
Co-authored-by: zhangbaofeng@npnets.com <41259@Zbf>
|
2023-05-06 10:20:46 +08:00 |
|
Fangjun Kuang
|
80060c276d
|
Begin to support CTC models (#119)
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
|
2023-04-07 23:11:34 +08:00 |
|
manyeyes
|
3f7e0c23ac
|
adding a python api for offline decode (#110)
|
2023-04-02 13:17:43 +08:00 |
|
Fangjun Kuang
|
423d89e9a5
|
Support paraformer. (#95)
|
2023-03-28 17:59:54 +08:00 |
|
Fangjun Kuang
|
5572246253
|
Add non-streaming ASR (#92)
|
2023-03-26 08:53:42 +08:00 |
|