Commit Graph

197 Commits

Author SHA1 Message Date
Fangjun Kuang
0b5fa24134 Upload pre-compiled binaries and libraries on release (#206) 2023-07-10 17:59:08 +08:00
Fangjun Kuang
4eacbed0b2 Update onnxruntime from v1.14.0 to v1.15.1 (#204) 2023-07-09 14:42:19 +08:00
Fangjun Kuang
fa86597f98 Publish pre-compiled xcframework for iOS via GitHub actions (#203) 2023-07-08 13:46:28 +08:00
Fangjun Kuang
ba26783eb0 Support 32-bit ARM Linux (#202) 2023-07-07 17:30:06 +08:00
Jingzhao Ou
27762bb169 fixed broken build when user provides their own onnxruntime library (#201) 2023-07-07 14:26:09 +08:00
Fangjun Kuang
a8599507ca Publish universal wheel for macOS (#200) 2023-07-06 18:05:52 +08:00
Fangjun Kuang
adcb5776d8 Publish pre-compiled wheels for Win32 (#199) 2023-07-06 15:08:52 +08:00
Fangjun Kuang
33bf8dc1f4 Support specifying providers in Python API (#198) 2023-07-06 10:14:01 +08:00
Fangjun Kuang
3a08191a43 Build pre-compiled wheels for aarch64 (Linux) (#197) 2023-07-06 08:40:25 +08:00
Fangjun Kuang
a40697f633 Fix building for GPU support. (#195) 2023-07-05 18:18:31 +08:00
Fangjun Kuang
1f02f7c349 Support recognition from URLs. (#194) 2023-07-04 10:16:11 +08:00
Fangjun Kuang
2c436606bd Support .Netstandard 2.0 (#193) 2023-07-02 22:57:48 +08:00
Fangjun Kuang
0dd2d41f27 Real-time speech recognition from microphone for .Net (#192)
* Real-time speech recognition from microphone for .Net supporting Linux, macOS, and Windows

* Use PortAudioSharp2.

We will maintain it by ourselves. The project is at
https://github.com/csukuangfj/PortAudioSharp2

* minor fixes
2023-07-01 18:55:41 +08:00
Wei Kang
513dfaa552 Support contextual-biasing for streaming model (#184)
* Support contextual-biasing for streaming model

* The whole pipeline runs normally

* Fix comments
2023-06-30 16:46:24 +08:00
Fangjun Kuang
b2e0c4c9c2 Update iOS and Android demos to include the latest streaming zipformer (#189) 2023-06-27 18:11:37 +08:00
James
dab22c4764 Add detection conditions under the x86 platform. (#186)
When downloading onnxruntime, add detection conditions under the x86 platform.
2023-06-26 12:05:03 +08:00
danfu
1c3dac9001 support streaming zipformer2 (#185)
Co-authored-by: danfu <danfu@tencent.com>
2023-06-26 11:09:43 +08:00
Fangjun Kuang
157b6b801b Fix C# APIs (#183)
* Fix c# APIs

* reformat
2023-06-24 00:56:53 +08:00
fx
81579bbddd fix numpy bug (#181) 2023-06-20 20:55:47 +08:00
Fangjun Kuang
d82251cba6 Support the latest offline zipformer model (#180) 2023-06-19 14:09:24 +08:00
Wei Kang
8562711252 Implement context biasing with a Aho Corasick automata (#145)
* Implement context graph

* Modify the interface to support context biasing

* Support context biasing in modified beam search; add python wrapper

* Support context biasing in python api example

* Minor fixes

* Fix context graph

* Minor fixes

* Fix tests

* Fix style

* Fix style

* Fix comments

* Minor fixes

* Add missing header

* Replace std::shared_ptr with std::unique_ptr for effciency

* Build graph in constructor

* Fix comments

* Minor fixes

* Fix docs
2023-06-16 14:26:36 +08:00
keanu
1a1b9fd236 RNNLM model support lm_num_thread and lm_provider setting (#173)
* rnnlm model inference supports num_threads setting

* rnnlm params decouple num_thread and provider with Transducer.

* fix python csrc bug which offline-lm-config.cc and online-lm-config.cc arguments problem

* lm_num_threads and lm_provider set default values

---------

Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com>
2023-06-12 15:51:27 +08:00
Fangjun Kuang
13b33fcc08 Release v1.4.4 (#172) 2023-06-08 09:41:04 +08:00
keanu
9c017c2ccb rnnlm model inference supports num_threads setting (#169)
Co-authored-by: cuidongcai1035 <cuidongcai1035@wezhuiyi.com>
2023-06-07 09:32:27 +08:00
Fangjun Kuang
8fad17c87e Fix iOS build (#167) 2023-06-04 23:03:22 +08:00
Jingzhao Ou
fdd49d0538 add batch processing to sherpa-onnx (#166) 2023-06-04 14:13:55 +08:00
Jingzhao Ou
0ed501b8f1 Added provider option to sherpa-onnx and decode-file-c-api (#162) 2023-06-03 04:57:48 +08:00
Fangjun Kuang
5e2dc5ceea add streaming-server with web client (#164)
* add streaming-server with web client

* small fixes
2023-05-30 22:46:52 +08:00
Fangjun Kuang
d7114da441 Minor fixes (#161) 2023-05-23 15:57:33 +08:00
Fangjun Kuang
44821ae2fb Use fixed decimal point for offline timestamp (#158) 2023-05-22 16:52:38 +08:00
zhaomingwork
b70d40f4ab Add java websocket support (#137)
* add decode example for mic

* some changes to README.md

* add java websocket srv

* change to readwav to static

* make some changes to code comments

* little change for readme.md

* fix bug about multiple threads

* made little modification

* add protocol in readme, removed static Queue and add lmConfig

---------

Co-authored-by: root <root@localhost.localdomain>
2023-05-18 10:35:40 +08:00
PF Luo
655c619bf3 Fix lm fusion (#157)
* share GetHypsRowSplits interface and fix getting Topk not taking logprob

* fix lm score of lm fusion and make padding len same with 'icefall/egs/librispeech/ASR/pruned_transducer_stateless7_streaming/decode.py'
2023-05-15 10:48:45 +08:00
Fangjun Kuang
959f13eac8 Fix typos in .Net APIs (#156) 2023-05-14 22:32:01 +08:00
Yuekai Zhang
b8fbf8e5ce Add onnxruntime gpu for cmake (#153)
* add onnxruntime gpu for cmake

* fix clang

* fix typo

* cpplint
2023-05-12 22:30:47 +08:00
Fangjun Kuang
cea718e3d8 Support CoreML for macOS (#151) 2023-05-12 15:57:44 +08:00
Jingzhao Ou
de1880948b Added streaming conformer test (#150) 2023-05-12 08:37:24 +08:00
Jingzhao Ou
0992063de8 Stack and streaming conformer support (#141)
* added csrc/stack.cc

* stack: added checks

* added copyright info

* passed cpp style checks

* formatted code

* added some support for streaming conformer model support (not verified)

* code lint

* made more progress with streaming conformer support (not working yet)

* passed style check

* changes as suggested by @csukuangfj

* added some debug info

* fixed style check

* Use Cat to replace Stack

* remove debug statements

---------

Co-authored-by: Jingzhao Ou (jou2019) <jou2019@cisco.com>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-05-11 14:30:39 +08:00
PF Luo
824b0809a4 add shallow fusion (#147) 2023-05-10 22:30:57 +08:00
Fangjun Kuang
7969cf44ac Refactor C# code and support building nuget packages for cross-platforms (#144) 2023-05-10 14:53:04 +08:00
cooldoomsday
0bc571f6ee Return timestamp info and tokens in offline ASR
Co-authored-by: zhangbaofeng@npnets.com <41259@Zbf>
2023-05-06 10:20:46 +08:00
Fangjun Kuang
8ce6fff9b6 ignore style check for files in csharp-api (#143) 2023-05-06 08:04:19 +08:00
PF Luo
8c6a6768d5 Add lm rescore to online-modified-beam-search (#133) 2023-05-05 21:23:54 +08:00
manyeyes
3b9c3db31d add online-api for csharp-api (#139)
Co-authored-by: zx <12345678>
2023-05-05 11:07:53 +08:00
manyeyes
0c23962129 Add a C# api for offline-recognizer of sherpa-onnx (#129) 2023-05-05 10:25:06 +08:00
PF Luo
aa7108729b share GetHypsRowSplits interface and fix getting Topk not taking logprob (#131) 2023-04-26 11:41:04 +08:00
Fangjun Kuang
86017f9833 Add RNN LM rescore for offline ASR with modified_beam_search (#125) 2023-04-23 17:15:18 +08:00
manyeyes
d49a597431 Fix bug in DecodeMultipleOnlineStreams method in c-api.cc file (#128)
Co-authored-by: zx <12345678>
2023-04-21 13:48:44 +08:00
zhaomingwork
cd3c58c320 java decode example for microphone (#122) 2023-04-20 09:10:47 +08:00
Fangjun Kuang
4024bfab32 Add examples for Kotlin API (#124) 2023-04-19 17:29:35 +08:00
Fangjun Kuang
ad05f52666 Add timestamps for streaming ASR. (#123) 2023-04-19 16:02:37 +08:00