Improved speech-to-text module
- Migration to Faster-Whisper after benchmarking:
- Improved speed (halved translation time).
- Higher accuracy in noisy environments.
- Dynamic Integration of "Hot Words":
- Context-specific vocabulary dynamically adjusted.
- Increases robustness and accuracy for uncommon terms.
STT Benchmark
| File (10s) |
Size (MB) |
Faster-whisper accuracy |
Time (s) |
Whisper accuracy |
Time (s) |
| test1.wav |
1.22 |
85.7% |
0.64 |
71.4% |
1.25 |
| test2.wav |
1.22 |
77.8% |
0.71 |
33.3% |
1.44 |
| test3.wav |
1.22 |
71.4% |
0.66 |
57.1% |
1.13 |
| test4.wav |
1.22 |
80% |
0.70 |
60% |
1.36 |
| test5.wav |
1.53 |
71.4% |
4.68 |
71.4% |
4.5 |
| test6.wav |
1.83 |
42.9% |
0.63 |
28.6% |
1.03 |
| test7.wav |
1.83 |
90% |
0.64 |
90% |
0.87 |
| test8.wav |
1.83 |
83.3% |
0.61 |
66.7% |
0.99 |
| test9.wav |
1.83 |
100% |
0.62 |
100% |
0.94 |
| test10.wav |
1.83 |
100% |
0.58 |
100% |
0.77 |