Fangjun Kuang
8b7928e7d6
Fix computing features for whisper. ( #617 )
2024-02-29 16:56:29 +08:00
Fangjun Kuang
85d59b5840
Use hub.nuaa.cf to replace huggingface URL to download dependencies. ( #614 )
2024-02-28 17:48:51 +08:00
Fangjun Kuang
0cb6d1b474
support using xnnpack as execution provider ( #612 )
2024-02-28 17:32:48 +08:00
Fangjun Kuang
87a7030c08
Support using alsa to access the microphone with non-streaming ASR models ( #517 )
2024-02-26 21:17:26 +08:00
Fangjun Kuang
ee37d9bd92
Support RISC-V ( #609 )
2024-02-26 06:57:18 +08:00
Fangjun Kuang
67acd34dcd
Use alsa to read microphone in speaker identification demo. ( #605 )
2024-02-23 19:27:51 +08:00
Fangjun Kuang
5f075d0fce
Support MinSizeRel and RelWithDebInfo build on Windows. ( #586 )
2024-02-20 10:22:02 +08:00
Fangjun Kuang
c68f39bd3c
Use onnxruntime static lib compiled with gcc8 on ubuntu 20.04 ( #587 )
2024-02-20 09:31:37 +08:00
Fangjun Kuang
64007a6193
Support building debug version on Windows ( #583 )
2024-02-18 10:39:55 +08:00
Fangjun Kuang
81da0fb7a6
Update onnxruntime from 1.16.3 to 1.17.0 ( #581 )
2024-02-17 12:43:42 +08:00
Fangjun Kuang
d771762868
Support WebAssembly for text-to-speech ( #577 )
2024-02-08 23:39:12 +08:00
Fangjun Kuang
0b18ccfbb2
C++ API demo for speaker identification with portaudio. ( #561 )
2024-01-30 11:21:43 +08:00
Fangjun Kuang
a9e7747736
Fix cmake variables to point to the project root directory. ( #545 )
2024-01-24 19:21:23 +08:00
Wei Kang
b6c020901a
decoder for open vocabulary keyword spotting ( #505 )
...
* various fixes to ContextGraph to support open vocabulary keywords decoder
* Add keyword spotter runtime
* Add binary
* First version works
* Minor fixes
* update text2token
* default values
* Add jni for kws
* add kws android project
* Minor fixes
* Remove unused interface
* Minor fixes
* Add workflow
* handle extra info in texts
* Minor fixes
* Add more comments
* Fix ci
* fix cpp style
* Add input box in android demo so that users can specify their keywords
* Fix cpp style
* Fix comments
* Minor fixes
* Minor fixes
* minor fixes
* Minor fixes
* Minor fixes
* Add CI
* Fix code style
* cpplint
* Fix comments
* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
2024e96639
Add C++ runtime for speaker verification models from NeMo ( #527 )
2024-01-13 21:42:09 +08:00
Fangjun Kuang
33c03f78b2
Fix CI ( #485 )
2023-12-15 10:25:03 +08:00
Fangjun Kuang
9ff6185b7c
fix building linux x86 wheels ( #484 )
2023-12-14 21:37:40 +08:00
Fangjun Kuang
b18812ceff
Play generated audio using alsa for TTS ( #482 )
2023-12-13 22:28:03 +08:00
Fangjun Kuang
cae0231f93
Fix releasing go packages ( #476 )
2023-12-09 00:07:52 +08:00
Fangjun Kuang
99ff6a834c
Play generated audio as it is generating. ( #457 )
2023-12-02 15:35:11 +08:00
Fangjun Kuang
62dc3c3e46
Use piper-phonemize to convert text to token IDs ( #453 )
2023-11-30 23:57:43 +08:00
Fangjun Kuang
db41778e99
Support piper-phonemize ( #452 )
2023-11-28 19:12:58 +08:00
Fangjun Kuang
8444d54c4e
Update to onnxruntime 1.16.3 ( #446 )
2023-11-24 14:39:03 +08:00
Fangjun Kuang
eeda1e190e
Build building for iOS ( #430 )
2023-11-16 21:14:25 +08:00
Fangjun Kuang
9884cf71e7
Update onnxruntime to v1.16.2 ( #421 )
2023-11-12 11:29:33 +08:00
Fangjun Kuang
68f0e59688
Add a C++ example to show streaming VAD + non-streaming ASR. ( #420 )
2023-11-11 22:54:27 +08:00
Fangjun Kuang
86baf43c6b
support reading rule FST for Android TTS ( #410 )
2023-11-06 10:38:40 +08:00
Fangjun Kuang
d1a450bf82
Support text normalization via rule FST ( #407 )
2023-11-05 08:59:03 +08:00
Fangjun Kuang
cca744e34e
Update to onnxruntime v1.16.1 ( #406 )
2023-11-01 16:23:31 +08:00
Fangjun Kuang
27db015c8e
Use a single static lib file for onnxruntime on Windows ( #404 )
2023-10-31 21:50:56 +08:00
Fangjun Kuang
b80b7e5144
Support linking onnxruntime statically for macOS ( #403 )
2023-10-31 20:24:43 +08:00
Fangjun Kuang
fabbc70633
Support static linking onnxruntime for 64-bit ARM ( #402 )
2023-10-31 16:51:04 +08:00
Fangjun Kuang
2f2d3bbd82
Support static linking onnxruntime lib for 32-bit arm ( #401 )
2023-10-31 11:19:01 +08:00
Fangjun Kuang
6e5efa48c5
Fix splitting utf8 string into words ( #385 )
2023-10-25 11:49:27 +08:00
Fangjun Kuang
1ee79e3ff5
Support Chinese vits models ( #368 )
2023-10-18 10:19:10 +08:00
Fangjun Kuang
0df0a735d7
Fix building wheels for windows ( #365 )
2023-10-14 23:14:45 +08:00
Fangjun Kuang
655e0fa836
add python API and examples for TTS ( #364 )
2023-10-14 14:21:53 +08:00
Fangjun Kuang
efd3cd3312
Fix pkg-config for linux ( #358 )
2023-10-12 16:21:06 +08:00
Fangjun Kuang
407602445d
Add CTC HLG decoding using OpenFst ( #349 )
2023-10-08 11:32:39 +08:00
Fangjun Kuang
552a267c23
Set is_final and start_time for online websocket server. ( #342 )
...
* Set is_final and start_time for online websocket server.
* Convert timestamps to a json array
2023-09-25 15:12:07 +08:00
Fangjun Kuang
f91c49a7ba
Provide pre-built wheels for 32-bit arm (e.g., 32-bit Raspberry Pi) ( #338 )
2023-09-25 09:33:43 +08:00
Fangjun Kuang
65ec4dc741
Update alternative URLs for onnxruntime 1.16.0 ( #334 )
2023-09-22 18:50:52 +08:00
Fangjun Kuang
cf199ad466
Support onnxruntime 1.16.0 ( #330 )
2023-09-21 20:39:24 +08:00
Fangjun Kuang
532ed142d2
Support linking onnxruntime lib statically on Linux ( #326 )
2023-09-21 10:15:42 +08:00
Fangjun Kuang
c471423125
Add Silero VAD ( #313 )
2023-09-17 14:54:38 +08:00
Fangjun Kuang
865fd1e017
Support pkg-config ( #253 )
2023-08-10 11:22:36 +08:00
xiaott
a04221a1ce
fix cargs cmake typo ( #252 )
...
Co-authored-by: xiaott <xiaott@LAPTOP-3UTS3SBS.localdomain >
2023-08-09 22:49:22 +08:00
Fangjun Kuang
6061318e3f
fix building on linux with GPU ( #249 )
2023-08-09 20:21:28 +08:00
Fangjun Kuang
45b9d4ab37
Support whisper models ( #238 )
2023-08-07 12:34:18 +08:00
Fangjun Kuang
c88e672733
Fix building wheels on Windows ( #229 )
2023-08-01 13:45:02 +08:00