Improved speech-to-text module
- Migration to Faster-Whisper after benchmarking:
- Improved speed (halved translation time).
- Higher accuracy in noisy environments.
- Dynamic Integration of "Hot Words":
- Context-specific vocabulary dynamically adjusted.
- Increases robustness and accuracy for uncommon terms.
STT Benchmark
| File (10s) | Size (MB) | Faster-whisper accuracy | Time (s) | Whisper accuracy | Time (s) | 
|---|---|---|---|---|---|
| test1.wav | 1.22 | 85.7% | 0.64 | 71.4% | 1.25 | 
| test2.wav | 1.22 | 77.8% | 0.71 | 33.3% | 1.44 | 
| test3.wav | 1.22 | 71.4% | 0.66 | 57.1% | 1.13 | 
| test4.wav | 1.22 | 80% | 0.70 | 60% | 1.36 | 
| test5.wav | 1.53 | 71.4% | 4.68 | 71.4% | 4.5 | 
| test6.wav | 1.83 | 42.9% | 0.63 | 28.6% | 1.03 | 
| test7.wav | 1.83 | 90% | 0.64 | 90% | 0.87 | 
| test8.wav | 1.83 | 83.3% | 0.61 | 66.7% | 0.99 | 
| test9.wav | 1.83 | 100% | 0.62 | 100% | 0.94 | 
| test10.wav | 1.83 | 100% | 0.58 | 100% | 0.77 |