Fangjun Kuang
716ba8317b
Add C++ runtime for spleeter about source separation ( #2242 )
2025-05-23 22:30:57 +08:00
Karel Vesely
f3d23aa170
cmake build, configurable from env ( #2115 )
...
- make sure the defaults in `cmake/cmake_extension.py` variable
`extra_cmake_args` can be overriden by `cmake_args` from
`SHERPA_ONNX_CMAKE_ARGS` env variable
- fix a bug in `sherpa-onnx/csrc/parse-options.cc` which appears
when using `-DSHERPA_ONNX_ENABLE_CHECK=ON`
- avoid copying binaries when these are disabled
2025-04-16 21:26:54 +08:00
Fangjun Kuang
0703bc1b86
Add CXX API for VAD ( #2077 )
2025-04-01 14:51:43 +08:00
Fangjun Kuang
1f52ac2126
add alsa example for vad+offline asr ( #2020 )
2025-03-18 20:06:24 +08:00
Fangjun Kuang
362ddf2c07
Add C++ demo for VAD+non-streaming ASR ( #1964 )
2025-03-07 11:49:46 +08:00
Fangjun Kuang
effd5ef2be
Add C++ API for streaming ASR. ( #1455 )
...
It is a wrapper around the C API.
2024-10-23 12:07:43 +08:00
Fangjun Kuang
59407edcad
C++ API for speaker diarization ( #1396 )
2024-10-09 12:01:20 +08:00
Fangjun Kuang
375c055ff8
Fix style issues for online punctuation source files ( #1225 )
2024-08-06 17:43:24 +08:00
Fangjun Kuang
dd0ff2ca06
Support onnxruntime 1.18.0 ( #906 )
2024-07-10 17:05:26 +08:00
Fangjun Kuang
a25075101c
Build sherpa-onnx as a single shared library ( #1078 )
...
When `-D BUILD_SHARED_LIBS=ON` is passed to `cmake`, it builds a single shared library.
Specifically,
- For C APIs, it builds `libsherpa-onnx-c-api.so`
- For Python APIs, it builds `_sherpa_onnx.cpython-xx-xx.so`
- For Kotlin and Java APIs, it builds `libsherpa-onnx-jni.so`
There is no `libsherpa-onnx-core.so` any longer.
Note it affects only shared libraries.
2024-07-06 16:41:54 +08:00
Fangjun Kuang
598c12c4e5
Fix CI tests ( #1061 )
2024-06-27 18:05:18 +08:00
Fangjun Kuang
d08cc04567
Add VAD example for Dart API ( #996 )
2024-06-14 10:37:16 +08:00
Fangjun Kuang
2db777587e
Fix CI tests. ( #907 )
2024-05-23 14:49:37 +08:00
Fangjun Kuang
c7691650d7
Fix CI tests ( #804 )
2024-04-24 13:01:06 +08:00
Fangjun Kuang
329fe1aa8b
Support adding punctuations to the speech recogntion result ( #761 )
2024-04-13 12:15:57 +08:00
Fangjun Kuang
042976ea6e
Add C++ microphone examples for audio tagging ( #749 )
2024-04-10 21:00:35 +08:00
Fangjun Kuang
f20291cadc
Support audio tagging using zipformer ( #747 )
2024-04-10 14:47:06 +08:00
Fangjun Kuang
6fb8ceda57
Add VAD examples using ALSA for recording ( #739 )
2024-04-08 16:41:01 +08:00
Fangjun Kuang
a5f8fbc83f
Support heteronyms in Chinese TTS ( #738 )
2024-04-08 11:01:30 +08:00
Fangjun Kuang
0d258dd150
Support spoken language identification with whisper ( #694 )
2024-03-24 22:57:00 +08:00
Fangjun Kuang
87a7030c08
Support using alsa to access the microphone with non-streaming ASR models ( #517 )
2024-02-26 21:17:26 +08:00
Fangjun Kuang
67acd34dcd
Use alsa to read microphone in speaker identification demo. ( #605 )
2024-02-23 19:27:51 +08:00
Fangjun Kuang
0b18ccfbb2
C++ API demo for speaker identification with portaudio. ( #561 )
2024-01-30 11:21:43 +08:00
Wei Kang
b6c020901a
decoder for open vocabulary keyword spotting ( #505 )
...
* various fixes to ContextGraph to support open vocabulary keywords decoder
* Add keyword spotter runtime
* Add binary
* First version works
* Minor fixes
* update text2token
* default values
* Add jni for kws
* add kws android project
* Minor fixes
* Remove unused interface
* Minor fixes
* Add workflow
* handle extra info in texts
* Minor fixes
* Add more comments
* Fix ci
* fix cpp style
* Add input box in android demo so that users can specify their keywords
* Fix cpp style
* Fix comments
* Minor fixes
* Minor fixes
* minor fixes
* Minor fixes
* Minor fixes
* Add CI
* Fix code style
* cpplint
* Fix comments
* Fix error
2024-01-20 22:52:41 +08:00
Fangjun Kuang
33c03f78b2
Fix CI ( #485 )
2023-12-15 10:25:03 +08:00
Fangjun Kuang
9ff6185b7c
fix building linux x86 wheels ( #484 )
2023-12-14 21:37:40 +08:00
Fangjun Kuang
b18812ceff
Play generated audio using alsa for TTS ( #482 )
2023-12-13 22:28:03 +08:00
Fangjun Kuang
99ff6a834c
Play generated audio as it is generating. ( #457 )
2023-12-02 15:35:11 +08:00
Fangjun Kuang
62dc3c3e46
Use piper-phonemize to convert text to token IDs ( #453 )
2023-11-30 23:57:43 +08:00
Fangjun Kuang
68f0e59688
Add a C++ example to show streaming VAD + non-streaming ASR. ( #420 )
2023-11-11 22:54:27 +08:00
Fangjun Kuang
0df0a735d7
Fix building wheels for windows ( #365 )
2023-10-14 23:14:45 +08:00
Fangjun Kuang
655e0fa836
add python API and examples for TTS ( #364 )
2023-10-14 14:21:53 +08:00
Fangjun Kuang
c471423125
Add Silero VAD ( #313 )
2023-09-17 14:54:38 +08:00
Fangjun Kuang
c88e672733
Fix building wheels on Windows ( #229 )
2023-08-01 13:45:02 +08:00
Fangjun Kuang
7969cf44ac
Refactor C# code and support building nuget packages for cross-platforms ( #144 )
2023-05-10 14:53:04 +08:00
Fangjun Kuang
726680c5e0
Install binaries via pip install ( #112 )
...
When pepole use pip install sherpa-onnx, they also get the following binaries:
(py38) fangjuns-MacBook-Pro:bin fangjun$ ls -lh sherpa-onnx*
-rwxr-xr-x 1 fangjun staff 36K Apr 4 13:48 sherpa-onnx
-rwxr-xr-x 1 fangjun staff 52K Apr 4 13:48 sherpa-onnx-microphone
-rwxr-xr-x 1 fangjun staff 54K Apr 4 13:48 sherpa-onnx-microphone-offline
-rwxr-xr-x 1 fangjun staff 37K Apr 4 13:48 sherpa-onnx-offline
-rwxr-xr-x 1 fangjun staff 634K Apr 4 13:48 sherpa-onnx-offline-websocket-server
-rwxr-xr-x 1 fangjun staff 710K Apr 4 13:48 sherpa-onnx-online-websocket-client
-rwxr-xr-x 1 fangjun staff 651K Apr 4 13:48 sherpa-onnx-online-websocket-server
(py38) fangjuns-MacBook-Pro:bin fangjun$ pwd
/Users/fangjun/py38/bin
2023-04-04 15:45:59 +08:00
Fangjun Kuang
6f92bc7362
Fix issues about DLLs when installing pre-compiled wheels on windows ( #91 )
2023-03-18 13:55:59 +08:00
Fangjun Kuang
8d7455a978
Fix building pre-compiled wheels ( #68 )
2023-02-28 11:11:26 +08:00
Fangjun Kuang
ea09d5fbc5
Add Python API ( #31 )
2023-02-19 19:36:03 +08:00