Commit Graph

51 Commits

Author SHA1 Message Date
Fangjun Kuang
fd5a0d1e00 Add C++ runtime for Tele-AI/TeleSpeech-ASR (#970) 2024-06-05 00:26:40 +08:00
Fangjun Kuang
f1cff83ef9 Add address sanitizer and undefined behavior sanitizer (#951) 2024-05-31 13:17:01 +08:00
Fangjun Kuang
bcaa6df389 Add VAD demo for Java API (#928) 2024-05-28 14:59:47 +08:00
Wei Kang
b012b78ceb Encode hotwords in C++ side (#828)
* Encode hotwords in C++ side
2024-05-20 19:41:36 +08:00
Fangjun Kuang
65635b09d8 Fix a typo in jni (#885) 2024-05-16 14:31:45 +08:00
linziguan
d2745698c5 Support building JNI on Windows (#881) 2024-05-16 06:25:53 +08:00
Fangjun Kuang
db85b2c1d8 Add Android APKs for NeMo CTC models. (#866) 2024-05-12 14:58:36 +08:00
Fangjun Kuang
fcd6024200 Fix typos in JNI TTS (#824) 2024-05-01 14:14:24 +08:00
Fangjun Kuang
5407f880c0 Add Java and Kotlin API for punctuation models (#818) 2024-04-26 22:06:48 +08:00
Fangjun Kuang
f7b3735621 Add CTC HLG decoding for JNI (#810) 2024-04-25 17:20:02 +08:00
Fangjun Kuang
c3a2e8a67c Refactor Java API (#806) 2024-04-24 18:41:48 +08:00
Fangjun Kuang
9b67a476e6 Refactor the JNI interface to make it more modular and maintainable (#802) 2024-04-24 09:48:42 +08:00
Fangjun Kuang
7f3b9ffe5d Refactor TTS Android code to support jieba for Chinese TTS models (#800) 2024-04-22 17:21:05 +08:00
Fangjun Kuang
c1608b3524 Support CED models (#792) 2024-04-19 15:20:37 +08:00
Fangjun Kuang
d97a283dbb Add Android demo for spoken language identification using Whisper multilingual models (#783) 2024-04-18 14:33:59 +08:00
Fangjun Kuang
3a43049ba1 Add JNI support for spoken language identification (#782) 2024-04-17 19:27:15 +08:00
Fangjun Kuang
bcd9e48150 Add Android demo for audio tagging (#776)
See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html
2024-04-16 20:47:16 +08:00
Fangjun Kuang
5981adf454 Add Kotlin API for audio tagging (#770) 2024-04-15 13:49:35 +08:00
Fangjun Kuang
a5f8fbc83f Support heteronyms in Chinese TTS (#738) 2024-04-08 11:01:30 +08:00
Fangjun Kuang
2e0bccad36 Add C API for speaker embedding extractor. (#711) 2024-03-28 18:05:40 +08:00
Leo Huang
638f48f47a Added progress for callback of tts generator (#712)
Co-authored-by: leohwang <leohwang@360converter.com>
2024-03-28 17:12:20 +08:00
longshiming
de655e838e delete incorrect logs (#714)
Co-authored-by: longshiming <longshiming@greesoft.com>
2024-03-28 10:49:45 +08:00
Fangjun Kuang
4e040c596e Support including TTS conditionally. (#699) 2024-03-26 17:21:35 +08:00
GaryLaurenceauAva
ac43c2d7b6 Expose 'language' 'task' 'tailPaddings' in OfflineWhisperModelConfig (#643)
Co-authored-by: Gary <gary.laurenceau@gmail.com>
2024-03-08 19:52:30 +08:00
Fangjun Kuang
fb04366179 Fix #608 (#610)
Fix java tests.
2024-02-26 13:49:37 +08:00
ductranminh
665b869f03 Add context biasing for mobile (#568) 2024-02-01 21:33:22 +08:00
Fangjun Kuang
bbd7c7fc18 Add Android demo for speaker recognition (#536)
See pre-built Android APKs at 
https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
2024-01-23 16:50:52 +08:00
Wei Kang
b6c020901a decoder for open vocabulary keyword spotting (#505)
* various fixes to ContextGraph to support open vocabulary keywords decoder

* Add keyword spotter runtime

* Add binary

* First version works

* Minor fixes

* update text2token

* default values

* Add jni for kws

* add kws android project

* Minor fixes

* Remove unused interface

* Minor fixes

* Add workflow

* handle extra info in texts

* Minor fixes

* Add more comments

* Fix ci

* fix cpp style

* Add input box in android demo so that users can specify their keywords

* Fix cpp style

* Fix comments

* Minor fixes

* Minor fixes

* minor fixes

* Minor fixes

* Minor fixes

* Add CI

* Fix code style

* cpplint

* Fix comments

* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
d7e10bb3f8 Replace Android system TTS engine (#508) 2023-12-31 23:02:35 +08:00
Fangjun Kuang
e475e750ac Support streaming zipformer CTC (#496)
* Support streaming zipformer CTC

* test online zipformer2 CTC

* Update doc of sherpa-onnx.cc

* Add Python APIs for streaming zipformer2 ctc

* Add Python API examples for streaming zipformer2 ctc

* Swift API for streaming zipformer2 CTC

* NodeJS API for streaming zipformer2 CTC

* Kotlin API for streaming zipformer2 CTC

* Golang API for streaming zipformer2 CTC

* C# API for streaming zipformer2 CTC

* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
0f053d8040 Support playing as it is generating for Android (#477) 2023-12-09 16:36:38 +08:00
Fangjun Kuang
62dc3c3e46 Use piper-phonemize to convert text to token IDs (#453) 2023-11-30 23:57:43 +08:00
Fangjun Kuang
86baf43c6b support reading rule FST for Android TTS (#410) 2023-11-06 10:38:40 +08:00
Fangjun Kuang
0fdb2044e1 Add jni interface and kotlin API examples for TTS. (#381) 2023-10-23 12:31:54 +08:00
yujinqiu
d01682d968 Add vad clear api for better performance (#366)
* Add vad clear api for better performance

* rename to make naming consistent and remove macro

* Fix linker error

* Fix Vad.kt
2023-10-16 14:40:47 +08:00
Fangjun Kuang
6e60a77d89 Add Android APK for Silero VAD (#335) 2023-09-23 20:39:13 +08:00
Wei Kang
d7eab95439 Add java api for hotwords (#319)
* Add java api

* support websocket

* Fix kotlin
2023-09-18 22:44:29 +08:00
Fangjun Kuang
debab7c091 Add two-pass speech recognition Android/iOS demo (#304) 2023-09-12 15:40:16 +08:00
Fangjun Kuang
35526e26e1 Support paraformer on Android (#264) 2023-08-14 12:26:15 +08:00
Fangjun Kuang
79c2ce5dd4 Refactor online recognizer (#250)
* Refactor online recognizer.

Make it easier to support other streaming models.

Note that it is a breaking change for the Python API.
`sherpa_onnx.OnlineRecognizer()` used before should be
replaced by `sherpa_onnx.OnlineRecognizer.from_transducer()`.
2023-08-09 20:27:31 +08:00
Wilson Wongso
64efbd82af Implement Tokens in Swift and Kotlin (#227)
Co-authored-by: duc <duc@appiphany.com.au>
2023-08-05 18:37:03 +08:00
Fangjun Kuang
de2673680e Fix model_type for jni, c# and iOS. (#216) 2023-07-14 22:24:38 +08:00
PF Luo
8c6a6768d5 Add lm rescore to online-modified-beam-search (#133) 2023-05-05 21:23:54 +08:00
Fangjun Kuang
4024bfab32 Add examples for Kotlin API (#124) 2023-04-19 17:29:35 +08:00
Fangjun Kuang
ad05f52666 Add timestamps for streaming ASR. (#123) 2023-04-19 16:02:37 +08:00
zhaomingwork
4b5d2887cb add java wrapper suppport (#117) 2023-04-15 22:17:28 +08:00
Fangjun Kuang
5572246253 Add non-streaming ASR (#92) 2023-03-26 08:53:42 +08:00
Fangjun Kuang
5f31b22c12 Fix modified beam search for iOS and android (#76)
* Use Int type for sampling rate

* Fix swift

* Fix iOS
2023-03-03 15:18:31 +08:00
Fangjun Kuang
9064b3f016 Support Android (#59) 2023-02-24 13:57:03 +08:00
Fangjun Kuang
5a5d029490 Add build script for Android armv8a (#58) 2023-02-22 22:36:05 +08:00