Fangjun Kuang
|
209eaaae1d
|
Limit number of tokens per second for whisper. (#1958)
Otherwise, it spends lots of time in the loop if the EOT token
is not predicted.
|
2025-03-04 15:45:28 +08:00 |
|
ivan provalov
|
94728bfbee
|
Fixing Whisper Model Token Normalization (#1904)
|
2025-02-21 12:58:01 +08:00 |
|
Fangjun Kuang
|
298b6b6fda
|
Add non-streaming ASR support for HarmonyOS. (#1564)
|
2024-11-26 16:38:35 +08:00 |
|
ivan provalov
|
de04b3b9bf
|
Allow modify model config at decode time for ASR (#1124)
|
2024-07-13 22:30:47 +08:00 |
|
Fangjun Kuang
|
117cd7bb8c
|
Support whisper large/large-v1/large-v2/large-v3 and distil-large-v2 (#1114)
|
2024-07-12 23:47:39 +08:00 |
|
Fangjun Kuang
|
b0f7ed3ee3
|
Add inverse text normalization for non-streaming ASR (#1017)
|
2024-06-17 14:28:53 +08:00 |
|
Fangjun Kuang
|
17cd3a5f01
|
Add C++ runtime for non-streaming faster conformer transducer from NeMo. (#854)
|
2024-05-10 12:15:39 +08:00 |
|
Fangjun Kuang
|
0d258dd150
|
Support spoken language identification with whisper (#694)
|
2024-03-24 22:57:00 +08:00 |
|
Fangjun Kuang
|
0e23f82691
|
Give an informative log for whisper on exceptions. (#473)
|
2023-12-08 14:33:59 +08:00 |
|
Fangjun Kuang
|
868c339e5e
|
Support distil-small.en whisper (#472)
|
2023-12-08 11:59:20 +08:00 |
|
Fangjun Kuang
|
3ae984f148
|
Remove the 30-second constraint from whisper. (#471)
|
2023-12-07 17:47:08 +08:00 |
|
Fangjun Kuang
|
606cb26a62
|
Catch exception from whisper (#408)
|
2023-11-05 11:10:24 +08:00 |
|
Fangjun Kuang
|
552a267c23
|
Set is_final and start_time for online websocket server. (#342)
* Set is_final and start_time for online websocket server.
* Convert timestamps to a json array
|
2023-09-25 15:12:07 +08:00 |
|
Fangjun Kuang
|
debab7c091
|
Add two-pass speech recognition Android/iOS demo (#304)
|
2023-09-12 15:40:16 +08:00 |
|
Fangjun Kuang
|
f709c95c5f
|
Support multilingual whisper models (#274)
|
2023-08-16 00:28:52 +08:00 |
|
Fangjun Kuang
|
45b9d4ab37
|
Support whisper models (#238)
|
2023-08-07 12:34:18 +08:00 |
|