This repository has been archived on 2025-08-26. You can view files and clone it, but cannot push or open issues or pull requests.
Files
enginex-mr_series-sherpa-onnx/sherpa-onnx/csrc
Karel Vesely 2e45d327a5 Adding temperature scaling on Joiner logits: (#789)
* Adding temperature scaling on Joiner logits:

- T hard-coded to 2.0
- so far best result NCE 0.122 (still not so high)
    - the BPE scores were rescaled with 0.2 (but then also incorrect words
      get high confidence, visually reasonable histograms are for 0.5 scale)
    - BPE->WORD score merging done by min(.) function
      (tried also prob-product, and also arithmetic, geometric, harmonic mean)

- without temperature scaling (i.e. scale 1.0), the best NCE was 0.032 (here product merging was best)

Results seem consistent with: https://arxiv.org/abs/2110.15222

Everything tuned on a very-small set of 100 sentences with 813 words and 10.2% WER, a Czech model.

I also experimented with blank posteriors mixed into the BPE confidences,
but no NCE improvement found, so not pushing that.

Temperature scling added also to the Greedy search confidences.

* making `temperature_scale` configurable from outside
2024-04-26 09:44:26 +08:00
..
2023-08-07 12:34:18 +08:00
2023-08-07 12:34:18 +08:00
2023-02-21 20:00:03 +08:00
2023-02-26 14:54:01 +08:00
2023-02-21 20:00:03 +08:00
2023-09-17 14:54:38 +08:00
2023-03-03 12:10:59 +08:00
2023-02-22 15:35:55 +08:00
2023-02-22 15:35:55 +08:00
2024-04-19 15:20:37 +08:00
2024-02-29 06:28:45 +08:00
2023-02-22 15:35:55 +08:00
2023-02-22 15:35:55 +08:00
2024-04-19 15:20:37 +08:00
2024-04-19 15:20:37 +08:00
2024-04-19 15:20:37 +08:00
2024-04-19 15:20:37 +08:00
2024-04-16 09:46:15 +08:00
2023-03-26 08:53:42 +08:00
2023-03-26 08:53:42 +08:00
2023-03-05 22:02:50 +08:00
2023-03-05 22:02:50 +08:00
2023-03-05 22:02:50 +08:00
2023-03-31 22:34:10 +08:00
2024-03-01 16:39:48 +08:00
2024-03-01 16:39:48 +08:00
2023-09-17 14:54:38 +08:00
2023-03-26 08:53:42 +08:00
2024-04-24 18:41:48 +08:00
2023-03-26 08:53:42 +08:00
2023-03-26 08:53:42 +08:00
2023-03-26 08:53:42 +08:00
2024-04-16 09:46:15 +08:00
2023-05-12 15:57:44 +08:00
2023-08-07 12:34:18 +08:00
2023-04-07 23:11:34 +08:00
2023-02-21 20:00:03 +08:00
2023-02-26 14:54:01 +08:00
2023-02-21 20:00:03 +08:00
2023-09-17 14:54:38 +08:00
2023-09-17 14:54:38 +08:00
2023-03-26 08:53:42 +08:00
2023-03-26 08:53:42 +08:00

File descriptions