Commit Graph

4 Commits

Author SHA1 Message Date
Fangjun Kuang
e2b2d5ea57 Add CXX examples for NeMo TDT ASR. (#2363)
# New Features
- Added new example programs demonstrating streaming speech recognition from a microphone using Parakeet-TDT CTC and Zipformer Transducer models with voice activity detection.
- These examples support microphone input via PortAudio and display recognized text incrementally.

# Bug Fixes
- Improved error handling and logic when opening microphone devices in several example programs for more reliable device initialization.

# Chores
- Updated build configuration to include new executable examples when PortAudio support is enabled.
2025-07-09 18:30:42 +08:00
Fangjun Kuang
2b2788332e Add C++ support for UVR models (#2269) 2025-06-01 17:22:08 +08:00
mtdxc
613e8084c2 move portaudio common record code to microphone (#2264)
Co-authored-by: cqm <cqm@97kid.com>
2025-05-31 21:48:41 +08:00
Fangjun Kuang
b269e5cccc Add C++ example for real-time ASR with nvidia/parakeet-tdt-0.6b-v2. (#2201) 2025-05-11 16:30:38 +08:00