Fangjun Kuang
fd5a0d1e00
Add C++ runtime for Tele-AI/TeleSpeech-ASR ( #970 )
2024-06-05 00:26:40 +08:00
Fangjun Kuang
f1cff83ef9
Add address sanitizer and undefined behavior sanitizer ( #951 )
2024-05-31 13:17:01 +08:00
Fangjun Kuang
bcaa6df389
Add VAD demo for Java API ( #928 )
2024-05-28 14:59:47 +08:00
Wei Kang
b012b78ceb
Encode hotwords in C++ side ( #828 )
...
* Encode hotwords in C++ side
2024-05-20 19:41:36 +08:00
Fangjun Kuang
65635b09d8
Fix a typo in jni ( #885 )
2024-05-16 14:31:45 +08:00
linziguan
d2745698c5
Support building JNI on Windows ( #881 )
2024-05-16 06:25:53 +08:00
Fangjun Kuang
db85b2c1d8
Add Android APKs for NeMo CTC models. ( #866 )
2024-05-12 14:58:36 +08:00
Fangjun Kuang
fcd6024200
Fix typos in JNI TTS ( #824 )
2024-05-01 14:14:24 +08:00
Fangjun Kuang
5407f880c0
Add Java and Kotlin API for punctuation models ( #818 )
2024-04-26 22:06:48 +08:00
Fangjun Kuang
f7b3735621
Add CTC HLG decoding for JNI ( #810 )
2024-04-25 17:20:02 +08:00
Fangjun Kuang
c3a2e8a67c
Refactor Java API ( #806 )
2024-04-24 18:41:48 +08:00
Fangjun Kuang
9b67a476e6
Refactor the JNI interface to make it more modular and maintainable ( #802 )
2024-04-24 09:48:42 +08:00
Fangjun Kuang
7f3b9ffe5d
Refactor TTS Android code to support jieba for Chinese TTS models ( #800 )
2024-04-22 17:21:05 +08:00
Fangjun Kuang
c1608b3524
Support CED models ( #792 )
2024-04-19 15:20:37 +08:00
Fangjun Kuang
d97a283dbb
Add Android demo for spoken language identification using Whisper multilingual models ( #783 )
2024-04-18 14:33:59 +08:00
Fangjun Kuang
3a43049ba1
Add JNI support for spoken language identification ( #782 )
2024-04-17 19:27:15 +08:00
Fangjun Kuang
bcd9e48150
Add Android demo for audio tagging ( #776 )
...
See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html
2024-04-16 20:47:16 +08:00
Fangjun Kuang
5981adf454
Add Kotlin API for audio tagging ( #770 )
2024-04-15 13:49:35 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS ( #738 )
2024-04-08 11:01:30 +08:00
Fangjun Kuang
2e0bccad36
Add C API for speaker embedding extractor. ( #711 )
2024-03-28 18:05:40 +08:00
Leo Huang
638f48f47a
Added progress for callback of tts generator ( #712 )
...
Co-authored-by: leohwang <leohwang@360converter.com >
2024-03-28 17:12:20 +08:00
longshiming
de655e838e
delete incorrect logs ( #714 )
...
Co-authored-by: longshiming <longshiming@greesoft.com >
2024-03-28 10:49:45 +08:00
Fangjun Kuang
4e040c596e
Support including TTS conditionally. ( #699 )
2024-03-26 17:21:35 +08:00
GaryLaurenceauAva
ac43c2d7b6
Expose 'language' 'task' 'tailPaddings' in OfflineWhisperModelConfig ( #643 )
...
Co-authored-by: Gary <gary.laurenceau@gmail.com >
2024-03-08 19:52:30 +08:00
Fangjun Kuang
fb04366179
Fix #608 ( #610 )
...
Fix java tests.
2024-02-26 13:49:37 +08:00
ductranminh
665b869f03
Add context biasing for mobile ( #568 )
2024-02-01 21:33:22 +08:00
Fangjun Kuang
bbd7c7fc18
Add Android demo for speaker recognition ( #536 )
...
See pre-built Android APKs at
https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
2024-01-23 16:50:52 +08:00
Wei Kang
b6c020901a
decoder for open vocabulary keyword spotting ( #505 )
...
* various fixes to ContextGraph to support open vocabulary keywords decoder
* Add keyword spotter runtime
* Add binary
* First version works
* Minor fixes
* update text2token
* default values
* Add jni for kws
* add kws android project
* Minor fixes
* Remove unused interface
* Minor fixes
* Add workflow
* handle extra info in texts
* Minor fixes
* Add more comments
* Fix ci
* fix cpp style
* Add input box in android demo so that users can specify their keywords
* Fix cpp style
* Fix comments
* Minor fixes
* Minor fixes
* minor fixes
* Minor fixes
* Minor fixes
* Add CI
* Fix code style
* cpplint
* Fix comments
* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
d7e10bb3f8
Replace Android system TTS engine ( #508 )
2023-12-31 23:02:35 +08:00
Fangjun Kuang
e475e750ac
Support streaming zipformer CTC ( #496 )
...
* Support streaming zipformer CTC
* test online zipformer2 CTC
* Update doc of sherpa-onnx.cc
* Add Python APIs for streaming zipformer2 ctc
* Add Python API examples for streaming zipformer2 ctc
* Swift API for streaming zipformer2 CTC
* NodeJS API for streaming zipformer2 CTC
* Kotlin API for streaming zipformer2 CTC
* Golang API for streaming zipformer2 CTC
* C# API for streaming zipformer2 CTC
* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
0f053d8040
Support playing as it is generating for Android ( #477 )
2023-12-09 16:36:38 +08:00
Fangjun Kuang
62dc3c3e46
Use piper-phonemize to convert text to token IDs ( #453 )
2023-11-30 23:57:43 +08:00
Fangjun Kuang
86baf43c6b
support reading rule FST for Android TTS ( #410 )
2023-11-06 10:38:40 +08:00
Fangjun Kuang
0fdb2044e1
Add jni interface and kotlin API examples for TTS. ( #381 )
2023-10-23 12:31:54 +08:00
yujinqiu
d01682d968
Add vad clear api for better performance ( #366 )
...
* Add vad clear api for better performance
* rename to make naming consistent and remove macro
* Fix linker error
* Fix Vad.kt
2023-10-16 14:40:47 +08:00
Fangjun Kuang
6e60a77d89
Add Android APK for Silero VAD ( #335 )
2023-09-23 20:39:13 +08:00
Wei Kang
d7eab95439
Add java api for hotwords ( #319 )
...
* Add java api
* support websocket
* Fix kotlin
2023-09-18 22:44:29 +08:00
Fangjun Kuang
debab7c091
Add two-pass speech recognition Android/iOS demo ( #304 )
2023-09-12 15:40:16 +08:00
Fangjun Kuang
35526e26e1
Support paraformer on Android ( #264 )
2023-08-14 12:26:15 +08:00
Fangjun Kuang
79c2ce5dd4
Refactor online recognizer ( #250 )
...
* Refactor online recognizer.
Make it easier to support other streaming models.
Note that it is a breaking change for the Python API.
`sherpa_onnx.OnlineRecognizer()` used before should be
replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.
2023-08-09 20:27:31 +08:00
Wilson Wongso
64efbd82af
Implement Tokens in Swift and Kotlin ( #227 )
...
Co-authored-by: duc <duc@appiphany.com.au >
2023-08-05 18:37:03 +08:00
Fangjun Kuang
de2673680e
Fix model_type for jni, c# and iOS. ( #216 )
2023-07-14 22:24:38 +08:00
PF Luo
8c6a6768d5
Add lm rescore to online-modified-beam-search ( #133 )
2023-05-05 21:23:54 +08:00
Fangjun Kuang
4024bfab32
Add examples for Kotlin API ( #124 )
2023-04-19 17:29:35 +08:00
Fangjun Kuang
ad05f52666
Add timestamps for streaming ASR. ( #123 )
2023-04-19 16:02:37 +08:00
zhaomingwork
4b5d2887cb
add java wrapper suppport ( #117 )
2023-04-15 22:17:28 +08:00
Fangjun Kuang
5572246253
Add non-streaming ASR ( #92 )
2023-03-26 08:53:42 +08:00
Fangjun Kuang
5f31b22c12
Fix modified beam search for iOS and android ( #76 )
...
* Use Int type for sampling rate
* Fix swift
* Fix iOS
2023-03-03 15:18:31 +08:00
Fangjun Kuang
9064b3f016
Support Android ( #59 )
2023-02-24 13:57:03 +08:00
Fangjun Kuang
5a5d029490
Add build script for Android armv8a ( #58 )
2023-02-22 22:36:05 +08:00