Commit Graph

72 Commits

Author SHA1 Message Date
Fangjun Kuang
9efe69720d Support VITS VCTK models (#367)
* Support VITS VCTK models

* Release v1.8.1
2023-10-16 17:22:30 +08:00
Fangjun Kuang
655e0fa836 add python API and examples for TTS (#364) 2023-10-14 14:21:53 +08:00
Fangjun Kuang
efd3cd3312 Fix pkg-config for linux (#358) 2023-10-12 16:21:06 +08:00
Fangjun Kuang
407602445d Add CTC HLG decoding using OpenFst (#349) 2023-10-08 11:32:39 +08:00
Fangjun Kuang
552a267c23 Set is_final and start_time for online websocket server. (#342)
* Set is_final and start_time for online websocket server.

* Convert timestamps to a json array
2023-09-25 15:12:07 +08:00
poor1017
c2518a5826 Supports cmake compilation compatible with v3.13. (#340)
Co-authored-by: chenyu <cheny65@chinatelecom.cn>
2023-09-25 11:48:55 +08:00
Fangjun Kuang
7649bd862c Fix building APKs (#337) 2023-09-24 14:16:14 +08:00
Fangjun Kuang
4ca60e2603 Add VAD + Non-streaming ASR model for Android (#336) 2023-09-23 22:48:38 +08:00
Fangjun Kuang
6e60a77d89 Add Android APK for Silero VAD (#335) 2023-09-23 20:39:13 +08:00
Fangjun Kuang
43b2b7760d Fix tokens processing for byte-level BPE (#333) 2023-09-22 13:28:19 +08:00
Fangjun Kuang
cf199ad466 Support onnxruntime 1.16.0 (#330) 2023-09-21 20:39:24 +08:00
Fangjun Kuang
532ed142d2 Support linking onnxruntime lib statically on Linux (#326) 2023-09-21 10:15:42 +08:00
Fangjun Kuang
6afa9c85f6 Fix tokens for byte-level BPE token. (#324) 2023-09-20 07:49:53 +08:00
Fangjun Kuang
c471423125 Add Silero VAD (#313) 2023-09-17 14:54:38 +08:00
Fangjun Kuang
1e95fbf15f release v1.7.14 (#311) 2023-09-14 19:35:29 +08:00
Fangjun Kuang
087367d7fe Build APKs for two-pass speech recognition (#308) 2023-09-12 18:18:24 +08:00
Fangjun Kuang
ffeff3b8a3 Fix a typo for Go (#298) 2023-09-07 11:29:00 +08:00
Fangjun Kuang
49ec7e8f57 Fix building APK (#286) 2023-08-18 15:28:06 +08:00
Fangjun Kuang
e955f9c752 Release pre-built APKs (#285) 2023-08-18 14:28:44 +08:00
Fangjun Kuang
eb22b4845a Fix a bug for multilingual ASR (#281) 2023-08-17 10:43:26 +08:00
Fangjun Kuang
e31f9e48c2 Fix various language binding APIs for tdnn and whisper models (#278) 2023-08-16 22:15:10 +08:00
Fangjun Kuang
f709c95c5f Support multilingual whisper models (#274) 2023-08-16 00:28:52 +08:00
Fangjun Kuang
496c5dd7f5 Fix pip install on Windows for Python 3.11 (#271) 2023-08-14 22:22:46 +08:00
Fangjun Kuang
bc791d4996 Fix C api for Go and MFC to support streaming paraformer (#268) 2023-08-14 17:02:23 +08:00
Fangjun Kuang
6038e2aa62 Support streaming paraformer (#263) 2023-08-14 10:32:14 +08:00
Fangjun Kuang
a4bff28e21 Support TDNN models from the yesno recipe from icefall (#262) 2023-08-12 19:50:22 +08:00
Fangjun Kuang
b094868fb8 Add non-streaming websocket server for python (#259) 2023-08-11 15:56:24 +08:00
Fangjun Kuang
865fd1e017 Support pkg-config (#253) 2023-08-10 11:22:36 +08:00
Fangjun Kuang
d26bba1ec5 release v1.7.0 (#251) 2023-08-09 20:31:04 +08:00
Fangjun Kuang
aa48b76d4b Fix initial tokens to decoding (#246) 2023-08-09 12:33:47 +08:00
Fangjun Kuang
6235cb9f1b Fix building wheels for Linux (#240) 2023-08-07 20:16:05 +08:00
Fangjun Kuang
45b9d4ab37 Support whisper models (#238) 2023-08-07 12:34:18 +08:00
Fangjun Kuang
c88e672733 Fix building wheels on Windows (#229) 2023-08-01 13:45:02 +08:00
Fangjun Kuang
b96837d766 Release v1.5.4 (#221) 2023-07-21 18:05:26 +08:00
Fangjun Kuang
5d820b88ef Publish pre-compiled libs for Android. (#217) 2023-07-15 12:25:18 +08:00
Fangjun Kuang
fe0630fe1f release-v1.5.2 (#214) 2023-07-14 18:18:42 +08:00
Fangjun Kuang
bebc1f1398 Use static libraries for MFC examples (#210) 2023-07-13 14:52:43 +08:00
Fangjun Kuang
4eacbed0b2 Update onnxruntime from v1.14.0 to v1.15.1 (#204) 2023-07-09 14:42:19 +08:00
Fangjun Kuang
a40697f633 Fix building for GPU support. (#195) 2023-07-05 18:18:31 +08:00
Fangjun Kuang
2c436606bd Support .Netstandard 2.0 (#193) 2023-07-02 22:57:48 +08:00
Fangjun Kuang
0dd2d41f27 Real-time speech recognition from microphone for .Net (#192)
* Real-time speech recognition from microphone for .Net supporting Linux, macOS, and Windows

* Use PortAudioSharp2.

We will maintain it by ourselves. The project is at
https://github.com/csukuangfj/PortAudioSharp2

* minor fixes
2023-07-01 18:55:41 +08:00
Fangjun Kuang
b2e0c4c9c2 Update iOS and Android demos to include the latest streaming zipformer (#189) 2023-06-27 18:11:37 +08:00
Fangjun Kuang
157b6b801b Fix C# APIs (#183)
* Fix c# APIs

* reformat
2023-06-24 00:56:53 +08:00
Fangjun Kuang
13b33fcc08 Release v1.4.4 (#172) 2023-06-08 09:41:04 +08:00
Fangjun Kuang
959f13eac8 Fix typos in .Net APIs (#156) 2023-05-14 22:32:01 +08:00
Yuekai Zhang
b8fbf8e5ce Add onnxruntime gpu for cmake (#153)
* add onnxruntime gpu for cmake

* fix clang

* fix typo

* cpplint
2023-05-12 22:30:47 +08:00
Fangjun Kuang
7969cf44ac Refactor C# code and support building nuget packages for cross-platforms (#144) 2023-05-10 14:53:04 +08:00
Fangjun Kuang
86017f9833 Add RNN LM rescore for offline ASR with modified_beam_search (#125) 2023-04-23 17:15:18 +08:00
Fangjun Kuang
4024bfab32 Add examples for Kotlin API (#124) 2023-04-19 17:29:35 +08:00
Fangjun Kuang
ad05f52666 Add timestamps for streaming ASR. (#123) 2023-04-19 16:02:37 +08:00