Commit Graph

57 Commits

Author SHA1 Message Date
Fangjun Kuang
9b67a476e6 Refactor the JNI interface to make it more modular and maintainable (#802) 2024-04-24 09:48:42 +08:00
Fangjun Kuang
7f3b9ffe5d Refactor TTS Android code to support jieba for Chinese TTS models (#800) 2024-04-22 17:21:05 +08:00
Fangjun Kuang
c1608b3524 Support CED models (#792) 2024-04-19 15:20:37 +08:00
Fangjun Kuang
d97a283dbb Add Android demo for spoken language identification using Whisper multilingual models (#783) 2024-04-18 14:33:59 +08:00
Fangjun Kuang
3a43049ba1 Add JNI support for spoken language identification (#782) 2024-04-17 19:27:15 +08:00
Fangjun Kuang
69440e481f Add WearOS demo for audio tagging (#777) 2024-04-17 12:22:17 +08:00
Fangjun Kuang
bcd9e48150 Add Android demo for audio tagging (#776)
See https://k2-fsa.github.io/sherpa/onnx/audio-tagging/apk.html
2024-04-16 20:47:16 +08:00
Fangjun Kuang
042976ea6e Add C++ microphone examples for audio tagging (#749) 2024-04-10 21:00:35 +08:00
Fangjun Kuang
6fb8ceda57 Add VAD examples using ALSA for recording (#739) 2024-04-08 16:41:01 +08:00
Fangjun Kuang
a5f8fbc83f Support heteronyms in Chinese TTS (#738) 2024-04-08 11:01:30 +08:00
Fangjun Kuang
bd66f7a7d0 Build Android TTS APKs for coqui-ai/TTS models (#704) 2024-03-26 14:05:26 +08:00
Masoud
e60c897ce7 Update MainActivity.kt (#693)
fix read-only test text box
2024-03-24 07:29:14 +08:00
Masoud
1c77457d61 Update MainActivity.kt (#687)
Appending a default text to test field.
To faster check the voices
2024-03-22 19:04:14 +08:00
GaryLaurenceauAva
ac43c2d7b6 Expose 'language' 'task' 'tailPaddings' in OfflineWhisperModelConfig (#643)
Co-authored-by: Gary <gary.laurenceau@gmail.com>
2024-03-08 19:52:30 +08:00
Fangjun Kuang
d2cc48ded5 Add more Chinese TTS models (Mandarin and Cantonese) (#589) 2024-02-20 15:05:35 +08:00
Paolo
92a8fd64f0 updated the icon on TTS engine for android (#579) 2024-02-19 10:25:01 +08:00
ductranminh
665b869f03 Add context biasing for mobile (#568) 2024-02-01 21:33:22 +08:00
Fangjun Kuang
44efff4e47 Fix CI tests for Python and JNI. (#554) 2024-01-27 13:01:54 +08:00
Fangjun Kuang
7ae73e75ba Run TTS engine service without starting the app. (#553) 2024-01-26 22:28:21 +08:00
Fangjun Kuang
4fbad6a368 Ensure input for speaker ID is a valid number. (#552)
Fix #547
2024-01-26 20:42:10 +08:00
Fangjun Kuang
bbd7c7fc18 Add Android demo for speaker recognition (#536)
See pre-built Android APKs at 
https://k2-fsa.github.io/sherpa/onnx/speaker-identification/apk.html
2024-01-23 16:50:52 +08:00
Wei Kang
b6c020901a decoder for open vocabulary keyword spotting (#505)
* various fixes to ContextGraph to support open vocabulary keywords decoder

* Add keyword spotter runtime

* Add binary

* First version works

* Minor fixes

* update text2token

* default values

* Add jni for kws

* add kws android project

* Minor fixes

* Remove unused interface

* Minor fixes

* Add workflow

* handle extra info in texts

* Minor fixes

* Add more comments

* Fix ci

* fix cpp style

* Add input box in android demo so that users can specify their keywords

* Fix cpp style

* Fix comments

* Minor fixes

* Minor fixes

* minor fixes

* Minor fixes

* Minor fixes

* Add CI

* Fix code style

* cpplint

* Fix comments

* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
bf1dd3daf6 Refactor the UI of Android TTS engine (#533) 2024-01-17 12:12:50 +08:00
Fangjun Kuang
f4e3f45664 Fix setting speaker ID for Android TTS Engine. (#530) 2024-01-15 11:46:57 +08:00
Fangjun Kuang
d01142173a Add missing field for two-pass APK. (#511) 2024-01-03 12:51:54 +08:00
Fangjun Kuang
581eceb4d5 Build text-to-speech engine APKs (#509) 2024-01-01 12:44:20 +08:00
Fangjun Kuang
d7e10bb3f8 Replace Android system TTS engine (#508) 2023-12-31 23:02:35 +08:00
Fangjun Kuang
e475e750ac Support streaming zipformer CTC (#496)
* Support streaming zipformer CTC

* test online zipformer2 CTC

* Update doc of sherpa-onnx.cc

* Add Python APIs for streaming zipformer2 ctc

* Add Python API examples for streaming zipformer2 ctc

* Swift API for streaming zipformer2 CTC

* NodeJS API for streaming zipformer2 CTC

* Kotlin API for streaming zipformer2 CTC

* Golang API for streaming zipformer2 CTC

* C# API for streaming zipformer2 CTC

* Release v1.9.6
2023-12-22 13:46:33 +08:00
Fangjun Kuang
80d0192325 Fix android tts audio buffer size and fix CI. (#478) 2023-12-10 18:25:50 +08:00
Fangjun Kuang
0f053d8040 Support playing as it is generating for Android (#477) 2023-12-09 16:36:38 +08:00
Fangjun Kuang
23cf92daf7 Use espeak-ng for coqui-ai/TTS VITS English models. (#466) 2023-12-06 11:00:38 +08:00
Fangjun Kuang
62dc3c3e46 Use piper-phonemize to convert text to token IDs (#453) 2023-11-30 23:57:43 +08:00
Fangjun Kuang
86baf43c6b support reading rule FST for Android TTS (#410) 2023-11-06 10:38:40 +08:00
Fangjun Kuang
1544a577e0 Upload TTS APKs to huggingface (#400) 2023-10-29 18:30:43 +08:00
Fangjun Kuang
fbf4c903e1 Support German TTS (#394) 2023-10-27 11:12:45 +08:00
Fangjun Kuang
789a8be73b Add Android TTS demo (#383) 2023-10-24 21:31:28 +08:00
yujinqiu
d01682d968 Add vad clear api for better performance (#366)
* Add vad clear api for better performance

* rename to make naming consistent and remove macro

* Fix linker error

* Fix Vad.kt
2023-10-16 14:40:47 +08:00
longshiming
55b6893885 add SherpaOnnxWebSocket (#362)
* add SherpaOnnxWebSocket

* add SherpaOnnxWebSocket

* update text show style

---------

Co-authored-by: longshiming <longshiming@greesoft.com>
2023-10-16 14:39:32 +08:00
Fangjun Kuang
7649bd862c Fix building APKs (#337) 2023-09-24 14:16:14 +08:00
Fangjun Kuang
4ca60e2603 Add VAD + Non-streaming ASR model for Android (#336) 2023-09-23 22:48:38 +08:00
Fangjun Kuang
6e60a77d89 Add Android APK for Silero VAD (#335) 2023-09-23 20:39:13 +08:00
Wei Kang
d7eab95439 Add java api for hotwords (#319)
* Add java api

* support websocket

* Fix kotlin
2023-09-18 22:44:29 +08:00
Fangjun Kuang
3a20e332bf Fix symlink for Windows (#312) 2023-09-15 16:03:09 +08:00
Fangjun Kuang
087367d7fe Build APKs for two-pass speech recognition (#308) 2023-09-12 18:18:24 +08:00
Fangjun Kuang
debab7c091 Add two-pass speech recognition Android/iOS demo (#304) 2023-09-12 15:40:16 +08:00
Fangjun Kuang
49ec7e8f57 Fix building APK (#286) 2023-08-18 15:28:06 +08:00
Fangjun Kuang
e955f9c752 Release pre-built APKs (#285) 2023-08-18 14:28:44 +08:00
Fangjun Kuang
35526e26e1 Support paraformer on Android (#264) 2023-08-14 12:26:15 +08:00
Fangjun Kuang
de2673680e Fix model_type for jni, c# and iOS. (#216) 2023-07-14 22:24:38 +08:00
Fangjun Kuang
b2e0c4c9c2 Update iOS and Android demos to include the latest streaming zipformer (#189) 2023-06-27 18:11:37 +08:00