Commit Graph

  • b445956675 Fix CI tests. (#898) Fangjun Kuang 2024-05-21 20:37:29 +08:00
  • fdcae56a14 Fix Go tests (#897) Fangjun Kuang 2024-05-21 11:50:13 +08:00
  • b012b78ceb Encode hotwords in C++ side (#828) Wei Kang 2024-05-20 19:41:36 +08:00
  • 8af2af8466 Add tail_paddings to Whisper C API. (#886) Fangjun Kuang 2024-05-17 09:20:07 +08:00
  • 65635b09d8 Fix a typo in jni (#885) Fangjun Kuang 2024-05-16 14:31:45 +08:00
  • a421f8c1df Fix Java API examples (#883) Fangjun Kuang 2024-05-16 12:16:17 +08:00
  • d2745698c5 Support building JNI on Windows (#881) linziguan 2024-05-16 06:25:53 +08:00
  • c2dcdabab1 Fix sherpa-onnx-node-version in node examples (#879) Fangjun Kuang 2024-05-15 14:32:30 +08:00
  • 03c956a317 Add keyword spotting API for node-addon-api (#877) Fangjun Kuang 2024-05-14 20:26:48 +08:00
  • 75630b986b Support adding puncutations to text for node-addon-api (#876) Fangjun Kuang 2024-05-14 19:28:56 +08:00
  • d19f50b799 Add audio tagging APIs for node-addon-api (#875) Fangjun Kuang 2024-05-14 17:32:30 +08:00
  • 388e6a98fc Add speaker identification APIs for node-addon-api (#874) Fangjun Kuang 2024-05-14 13:28:50 +08:00
  • 0895b64850 Refactor node-addon-api to remove duplicate. (#873) Fangjun Kuang 2024-05-14 10:08:11 +08:00
  • 939fdd942c Add spoken language identification for node-addon-api (#872) Fangjun Kuang 2024-05-13 20:26:11 +08:00
  • 031134b4d4 Add TTS for node-addon-api (#871) Fangjun Kuang 2024-05-13 19:24:09 +08:00
  • 740d7ae9d6 fixing bug and compiler error (#870) Manix 2024-05-13 15:14:03 +05:30
  • 697b960768 Add non-streaming ASR APIs for node-addon-api (#868) Fangjun Kuang 2024-05-13 16:03:34 +08:00
  • 384f96c40f Add streaming CTC ASR APIs for node-addon-api (#867) Fangjun Kuang 2024-05-13 11:58:25 +08:00
  • db85b2c1d8 Add Android APKs for NeMo CTC models. (#866) Fangjun Kuang 2024-05-12 14:58:36 +08:00
  • 7322f4e0a3 Fix node addon tests (#865) Fangjun Kuang 2024-05-12 12:03:43 +08:00
  • eee5d8a15c Add node-addon-api for VAD (#864) Fangjun Kuang 2024-05-11 20:58:23 +08:00
  • 677bc1da3e Add Speaker ID demo for C# (#862) Fangjun Kuang 2024-05-11 13:27:33 +08:00
  • a88b3bac21 Fix Python TTS examples for models using jieba. (#861) Fangjun Kuang 2024-05-11 09:21:51 +08:00
  • 65f5161456 Add more streaming ASR methods for node-addon-api (#860) Fangjun Kuang 2024-05-10 18:21:05 +08:00
  • 46e4e5b7ac Add C++ support for streaming NeMo CTC models. (#857) Fangjun Kuang 2024-05-10 16:26:43 +08:00
  • 1eb60e8711 Solve the issue of missing the last sentence with punctuation (#856) yh646492956 2024-05-10 15:41:42 +08:00
  • 17cd3a5f01 Add C++ runtime for non-streaming faster conformer transducer from NeMo. (#854) Fangjun Kuang 2024-05-10 12:15:39 +08:00
  • 5d8c35e44e Add C++ support for non-streaming NeMo fast conformer hybrid transducer ctc (the ctc branch) (#848) Fangjun Kuang 2024-05-09 15:32:22 +08:00
  • 5ed3ec1c04 Export non-streaming NeMo faster conformer hybrid transducer and ctc to sherpa-onnx (#847) Fangjun Kuang 2024-05-09 13:59:47 +08:00
  • 68b25abf27 Export NeMo FastConformer Hybrid Transducer Large Streaming to ONNX (#844) Fangjun Kuang 2024-05-08 19:07:49 +08:00
  • a9f936e92b Export NeMo FastConformer Hybrid Transducer-CTC Large Streaming to ONNX. (#843) Fangjun Kuang 2024-05-08 12:33:46 +08:00
  • dbaa26ff4b Publish node-addon-api npm package for linux arm64 (#841) Fangjun Kuang 2024-05-07 23:05:40 +08:00
  • d2e86b0415 Add links to pre-built APKs and pre-trained models to README. (#840) Fangjun Kuang 2024-05-07 12:28:42 +08:00
  • 37a4135dd7 Publish npm package with node-addon-api for Windows (#838) Fangjun Kuang 2024-05-06 16:21:29 +08:00
  • e1bb928805 Upload two more 3d-speaker models (#837) Fangjun Kuang 2024-05-06 12:23:49 +08:00
  • 9c8255fdb2 Update 3dspeaker/export-onnx.py (#836) chiiyeh 2024-05-06 12:10:35 +08:00
  • 4f758e6cd3 Publish node-addon-api wrapper for sherpa-onnx as npm packages (#829) Fangjun Kuang 2024-05-04 13:27:39 +08:00
  • 2f9553d838 Begin to add node-addon-api for sherpa-onnx (#826) Fangjun Kuang 2024-05-03 14:47:40 +08:00
  • fcd6024200 Fix typos in JNI TTS (#824) Fangjun Kuang 2024-05-01 14:14:24 +08:00
  • cff207623e Add Java API for speaker identification (#822) Fangjun Kuang 2024-04-29 21:23:56 +08:00
  • 88202f05bb Add Java API for audio tagging (#820) Fangjun Kuang 2024-04-28 22:26:04 +08:00
  • 5407f880c0 Add Java and Kotlin API for punctuation models (#818) Fangjun Kuang 2024-04-26 22:06:48 +08:00
  • db25986240 Add Java API for spoken language identification with whisper multilingual models (#817) Fangjun Kuang 2024-04-26 19:05:39 +08:00
  • f2d074aea9 Fix a bug for offline paraformer (#816) Fangjun Kuang 2024-04-26 16:40:42 +08:00
  • 612002da57 Fix C# to support Chinese tts models using jieba (#815) Fangjun Kuang 2024-04-26 11:50:07 +08:00
  • c693676d20 Fix building wheels for macOS (#814) Fangjun Kuang 2024-04-26 10:05:39 +08:00
  • 2e45d327a5 Adding temperature scaling on Joiner logits: (#789) Karel Vesely 2024-04-26 03:44:26 +02:00
  • 15772d2150 Add Java API for text-to-speech (#811) Fangjun Kuang 2024-04-26 09:26:39 +08:00
  • fa2429920f Add function 'tolowerUnicode' in sherpa-onnx-microphone (fix #791) (#812) Daniel Doña 2024-04-26 03:19:32 +02:00
  • f7b3735621 Add CTC HLG decoding for JNI (#810) Fangjun Kuang 2024-04-25 17:20:02 +08:00
  • 6686c7d3e6 Add dict_dir arg to c api to support Chinese TTS models using jieba (#809) Fangjun Kuang 2024-04-25 12:28:31 +08:00
  • 83cd533f67 Add Java API for non-streaming ASR (#807) Fangjun Kuang 2024-04-24 21:03:26 +08:00
  • c3a2e8a67c Refactor Java API (#806) Fangjun Kuang 2024-04-24 18:41:48 +08:00
  • c7691650d7 Fix CI tests (#804) Fangjun Kuang 2024-04-24 13:01:06 +08:00
  • 9b67a476e6 Refactor the JNI interface to make it more modular and maintainable (#802) Fangjun Kuang 2024-04-24 09:48:42 +08:00
  • dc5af04830 wget 续传 (#801) 布宝 2024-04-22 20:19:08 +08:00
  • 7f3b9ffe5d Refactor TTS Android code to support jieba for Chinese TTS models (#800) Fangjun Kuang 2024-04-22 17:21:05 +08:00
  • 494cb5c733 Fix the last character not being recognized for streaming paraformer models. (#799) Fangjun Kuang 2024-04-22 15:10:39 +08:00
  • 9a68b92ce6 Increase CED's max frame length to 3000 (#798) Fangjun Kuang 2024-04-22 10:18:47 +08:00
  • 6b353bfb42 Add jieba for Chinese TTS models (#797) Fangjun Kuang 2024-04-21 14:47:13 +08:00
  • 2e0ee0e8c8 fix a typo in building language ID apk (#795) Fangjun Kuang 2024-04-19 20:16:48 +08:00
  • 37831fe89c Release v1.9.22 (#794) Fangjun Kuang 2024-04-19 18:37:47 +08:00
  • 54bc504065 Add Python API example for CED audio tagging. (#793) Fangjun Kuang 2024-04-19 18:33:18 +08:00
  • c1608b3524 Support CED models (#792) Fangjun Kuang 2024-04-19 15:20:37 +08:00
  • d97a283dbb Add Android demo for spoken language identification using Whisper multilingual models (#783) Fangjun Kuang 2024-04-18 14:33:59 +08:00
  • 3a43049ba1 Add JNI support for spoken language identification (#782) Fangjun Kuang 2024-04-17 19:27:15 +08:00
  • 69440e481f Add WearOS demo for audio tagging (#777) Fangjun Kuang 2024-04-17 12:22:17 +08:00
  • bcd9e48150 Add Android demo for audio tagging (#776) Fangjun Kuang 2024-04-16 20:47:16 +08:00
  • aa2d695fd2 Add score function to speaker identification (#775) chiiyeh 2024-04-16 17:29:46 +08:00
  • 6bf2099781 Fix code style issues (#774) Fangjun Kuang 2024-04-16 09:46:15 +08:00
  • 81b7f1d529 Fix display for sherpa-onnx-microphone (#773) Fangjun Kuang 2024-04-16 09:17:23 +08:00
  • fb4aee83ac Adding warm up for Zipformer2 (#766) Manix 2024-04-16 06:46:55 +05:30
  • 5981adf454 Add Kotlin API for audio tagging (#770) Fangjun Kuang 2024-04-15 13:49:35 +08:00
  • 13730ecbd8 Add C API for punctuation (#768) Fangjun Kuang 2024-04-14 19:02:34 +08:00
  • b0265b258d Replace torchaudio with soundfile in python-api-examples (#765) gtf35 2024-04-13 23:39:07 +08:00
  • 983df28a83 Fix a punctuation bug (#764) Fangjun Kuang 2024-04-13 19:08:46 +08:00
  • b6ad0436fa Release v1.9.18 (#763) Fangjun Kuang 2024-04-13 16:34:15 +08:00
  • 68b8b88b5a Add Python API for punctuation models. (#762) Fangjun Kuang 2024-04-13 13:28:17 +08:00
  • 329fe1aa8b Support adding punctuations to the speech recogntion result (#761) Fangjun Kuang 2024-04-13 12:15:57 +08:00
  • 0f4705f775 Fix WASM for kws (#758) Fangjun Kuang 2024-04-12 18:57:21 +08:00
  • be4a2488a8 Use batch size 1 in generating subtitles. (#756) Fangjun Kuang 2024-04-11 15:58:11 +08:00
  • 399d920b47 [feature] Configurable padding length in online websocket server (#755) Manix 2024-04-11 12:27:11 +05:30
  • f204e62b44 Add C API for audio tagging (#754) Fangjun Kuang 2024-04-11 14:18:43 +08:00
  • 34d70a259f Add Python API and Python examples for audio tagging (#753) Fangjun Kuang 2024-04-11 11:12:48 +08:00
  • 904a3cc8a9 Fix a bug in mean calculation of 'ys_probs' (#748) AHN Sung Hwan 2024-04-11 11:34:44 +09:00
  • d21c45d0ea Add --continue to wget (#750) 布宝 2024-04-11 09:07:31 +08:00
  • 042976ea6e Add C++ microphone examples for audio tagging (#749) Fangjun Kuang 2024-04-10 21:00:35 +08:00
  • f20291cadc Support audio tagging using zipformer (#747) Fangjun Kuang 2024-04-10 14:47:06 +08:00
  • c9ae7595d5 Fix go API examples with portaudio on Windows. (#746) Fangjun Kuang 2024-04-10 09:56:35 +08:00
  • db1b3ab1f3 Fix building OpenFst on Windows. (#744) Fangjun Kuang 2024-04-09 11:17:46 +08:00
  • 0d90b34e4a Support Chinese heteronyms on Android for TTS. (#742) Fangjun Kuang 2024-04-08 21:36:47 +08:00
  • 6b3d2b87f9 Fix releasing GIL (#741) Fangjun Kuang 2024-04-08 17:22:48 +08:00
  • 6fb8ceda57 Add VAD examples using ALSA for recording (#739) Fangjun Kuang 2024-04-08 16:41:01 +08:00
  • a5f8fbc83f Support heteronyms in Chinese TTS (#738) Fangjun Kuang 2024-04-08 11:01:30 +08:00
  • c1c0f5bafd return timestamps for WebAssembly (#737) Fangjun Kuang 2024-04-05 20:24:27 +08:00
  • dbff2eaadb Add C API for streaming HLG decoding (#734) Fangjun Kuang 2024-04-05 10:31:20 +08:00
  • db67e00c77 Add HLG decoding for streaming CTC models (#731) Fangjun Kuang 2024-04-03 21:31:42 +08:00
  • f8832cb5f2 Add language identification swiftui demo (#729) yujinqiu 2024-04-01 20:34:14 +08:00
  • fabd30e3bb Fix microphone privacy config (#727) yujinqiu 2024-04-01 14:59:40 +08:00
  • 3acf373b07 add more piper models (#725) Fangjun Kuang 2024-04-01 11:39:52 +08:00