Commit Graph

117 Commits

Author SHA1 Message Date
zhaomingwork
b70d40f4ab Add java websocket support (#137)
* add decode example for mic

* some changes to README.md

* add java websocket srv

* change to readwav to static

* make some changes to code comments

* little change for readme.md

* fix bug about multiple threads

* made little modification

* add protocol in readme, removed static Queue and add lmConfig

---------

Co-authored-by: root <root@localhost.localdomain>
2023-05-18 10:35:40 +08:00
PF Luo
655c619bf3 Fix lm fusion (#157)
* share GetHypsRowSplits interface and fix getting Topk not taking logprob

* fix lm score of lm fusion and make padding len same with 'icefall/egs/librispeech/ASR/pruned_transducer_stateless7_streaming/decode.py'
2023-05-15 10:48:45 +08:00
Fangjun Kuang
959f13eac8 Fix typos in .Net APIs (#156) 2023-05-14 22:32:01 +08:00
Yuekai Zhang
b8fbf8e5ce Add onnxruntime gpu for cmake (#153)
* add onnxruntime gpu for cmake

* fix clang

* fix typo

* cpplint
2023-05-12 22:30:47 +08:00
Fangjun Kuang
cea718e3d8 Support CoreML for macOS (#151) 2023-05-12 15:57:44 +08:00
Jingzhao Ou
de1880948b Added streaming conformer test (#150) 2023-05-12 08:37:24 +08:00
Jingzhao Ou
0992063de8 Stack and streaming conformer support (#141)
* added csrc/stack.cc

* stack: added checks

* added copyright info

* passed cpp style checks

* formatted code

* added some support for streaming conformer model support (not verified)

* code lint

* made more progress with streaming conformer support (not working yet)

* passed style check

* changes as suggested by @csukuangfj

* added some debug info

* fixed style check

* Use Cat to replace Stack

* remove debug statements

---------

Co-authored-by: Jingzhao Ou (jou2019) <jou2019@cisco.com>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-05-11 14:30:39 +08:00
PF Luo
824b0809a4 add shallow fusion (#147) 2023-05-10 22:30:57 +08:00
Fangjun Kuang
7969cf44ac Refactor C# code and support building nuget packages for cross-platforms (#144) 2023-05-10 14:53:04 +08:00
cooldoomsday
0bc571f6ee Return timestamp info and tokens in offline ASR
Co-authored-by: zhangbaofeng@npnets.com <41259@Zbf>
2023-05-06 10:20:46 +08:00
Fangjun Kuang
8ce6fff9b6 ignore style check for files in csharp-api (#143) 2023-05-06 08:04:19 +08:00
PF Luo
8c6a6768d5 Add lm rescore to online-modified-beam-search (#133) 2023-05-05 21:23:54 +08:00
manyeyes
3b9c3db31d add online-api for csharp-api (#139)
Co-authored-by: zx <12345678>
2023-05-05 11:07:53 +08:00
manyeyes
0c23962129 Add a C# api for offline-recognizer of sherpa-onnx (#129) 2023-05-05 10:25:06 +08:00
PF Luo
aa7108729b share GetHypsRowSplits interface and fix getting Topk not taking logprob (#131) 2023-04-26 11:41:04 +08:00
Fangjun Kuang
86017f9833 Add RNN LM rescore for offline ASR with modified_beam_search (#125) 2023-04-23 17:15:18 +08:00
manyeyes
d49a597431 Fix bug in DecodeMultipleOnlineStreams method in c-api.cc file (#128)
Co-authored-by: zx <12345678>
2023-04-21 13:48:44 +08:00
zhaomingwork
cd3c58c320 java decode example for microphone (#122) 2023-04-20 09:10:47 +08:00
Fangjun Kuang
4024bfab32 Add examples for Kotlin API (#124) 2023-04-19 17:29:35 +08:00
Fangjun Kuang
ad05f52666 Add timestamps for streaming ASR. (#123) 2023-04-19 16:02:37 +08:00
zhaomingwork
4b5d2887cb add java wrapper suppport (#117) 2023-04-15 22:17:28 +08:00
Fangjun Kuang
916e5adc1e Release v1.4.1 (#121) 2023-04-12 19:03:37 +08:00
彭震东
d781fcdeef Use log probs for paraformer (#120)
* Use log probs for paraformer

* Fix
2023-04-09 23:28:10 +08:00
Fangjun Kuang
80060c276d Begin to support CTC models (#119)
Please see https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-ctc/nemo/index.html for a list of pre-trained CTC models from NeMo.
2023-04-07 23:11:34 +08:00
Fangjun Kuang
9ac747248b Add SwiftUI demo project (#118)
* Commit after creating the project

* Add sherpa-onnx related files

* copy and modify files from sherpa-ncnn

* add app icon
2023-04-05 22:16:29 +08:00
Fangjun Kuang
ae1f9e7914 Publish sdist only when OS is Linux in CI (#116) 2023-04-04 23:00:14 +08:00
Fangjun Kuang
80b504ba24 Release v1.4.0 (#115)
- Support non-streaming ASR
- Support paraformer
- bugfixes
2023-04-04 19:40:06 +08:00
Fangjun Kuang
283e41c557 Fix displaying English words for paraformer models. (#114) 2023-04-04 16:44:37 +08:00
Fangjun Kuang
726680c5e0 Install binaries via pip install (#112)
When pepole use pip install sherpa-onnx, they also get the following binaries:

(py38) fangjuns-MacBook-Pro:bin fangjun$ ls -lh  sherpa-onnx*
-rwxr-xr-x  1 fangjun  staff    36K Apr  4 13:48 sherpa-onnx
-rwxr-xr-x  1 fangjun  staff    52K Apr  4 13:48 sherpa-onnx-microphone
-rwxr-xr-x  1 fangjun  staff    54K Apr  4 13:48 sherpa-onnx-microphone-offline
-rwxr-xr-x  1 fangjun  staff    37K Apr  4 13:48 sherpa-onnx-offline
-rwxr-xr-x  1 fangjun  staff   634K Apr  4 13:48 sherpa-onnx-offline-websocket-server
-rwxr-xr-x  1 fangjun  staff   710K Apr  4 13:48 sherpa-onnx-online-websocket-client
-rwxr-xr-x  1 fangjun  staff   651K Apr  4 13:48 sherpa-onnx-online-websocket-server
(py38) fangjuns-MacBook-Pro:bin fangjun$ pwd
/Users/fangjun/py38/bin
2023-04-04 15:45:59 +08:00
KajiMaCN
7f7e3680c3 Modify the rule attribute data type of OnlineRecognizer (#113) 2023-04-04 15:42:56 +08:00
Fangjun Kuang
5d3c8edbc9 add python tests (#111) 2023-04-02 23:05:30 +08:00
manyeyes
3f7e0c23ac adding a python api for offline decode (#110) 2023-04-02 13:17:43 +08:00
eee
94d77fa52e remove sherpa_onnx.Display (#109)
* fix garbled console output with chinese characters

* use print to instead  sherpa_onnx.Display
2023-04-01 18:14:33 +08:00
Fangjun Kuang
f9f9f20d37 Test int8 models (#107)
* Test int8 models

* Fix displaying help messages

* small fixes

* Fix jni test
2023-03-31 22:34:10 +08:00
eee
c0620a1fe1 fix garbled console output with chinese characters (#108) 2023-03-31 22:26:47 +08:00
Fangjun Kuang
f69b4e9396 Add tests for zipformer (#106) 2023-03-31 04:47:20 +08:00
Fangjun Kuang
b535d6db59 Fix building on Linux (#105)
* Fix building on Linux

* Fix style issues
2023-03-30 19:56:59 +08:00
Fangjun Kuang
b911915a32 Add microphone support for offline recognizer (#104) 2023-03-30 19:43:05 +08:00
Fangjun Kuang
6707ec4124 add offline websocket server/client (#98) 2023-03-29 21:48:45 +08:00
Jason's Lab
5e5620ea23 add zipformer model_type (#97)
Co-authored-by: lichao <www.563042811@qq.com>
2023-03-29 20:36:39 +08:00
Fangjun Kuang
423d89e9a5 Support paraformer. (#95) 2023-03-28 17:59:54 +08:00
Fangjun Kuang
dffb0fd43c Refactor offline recognizer. (#94)
* Refactor offline recognizer.

The purpose is to make it easier to support different types of models.
2023-03-27 14:59:40 +08:00
Fangjun Kuang
5572246253 Add non-streaming ASR (#92) 2023-03-26 08:53:42 +08:00
Fangjun Kuang
6f92bc7362 Fix issues about DLLs when installing pre-compiled wheels on windows (#91) 2023-03-18 13:55:59 +08:00
Fangjun Kuang
355c5ef541 fix typos in comments (#90) 2023-03-18 10:44:10 +08:00
manyeyes
2f9cd1007e add "import sys", (#89) 2023-03-16 10:49:37 +08:00
Fangjun Kuang
8c6a289e3d Add PackPaddedSequence (#85) 2023-03-08 14:12:20 +08:00
frankyoujian
3a79115884 fix cmake parse path string error (#88)
Co-authored-by: Jian You (jianyou) <jianyou@cisco.com>
2023-03-07 18:01:29 +08:00
Fangjun Kuang
1cf2c0417a Ci windows x86 (#87)
* CI for 32-bit windows

* small fixes
2023-03-07 17:14:52 +08:00
frankyoujian
822e05f169 Support windows x86 platform (#86)
* support windows x86 platform

* improve cmake

* resolve comments

---------

Co-authored-by: Jian You (jianyou) <jianyou@cisco.com>
2023-03-07 16:47:47 +08:00