Commit 93c9058
committed
Add MedASR medical speech recognition notebook
This notebook demonstrates converting Google's MedASR model to OpenVINO
with FP16 and INT8 quantization for efficient medical speech recognition.
Features:
- HuggingFace authentication with notebook_login for gated model access
- Model conversion using torch.export and ov.convert_model
- INT8 quantization with NNCF using real audio calibration data
- Comprehensive accuracy validation (97.98% token-level accuracy)
- Performance benchmarking on CPU and GPU
- Model compression: 402 MB -> 102 MB (3.9x reduction)
The notebook includes complete workflow from model loading to deployment,
with support for 10-second audio chunks (static shape [1, 998, 128]).1 parent 25f4d2e commit 93c9058
File tree
2 files changed
+1002
-0
lines changed- notebooks/medasr-medical-asr
2 files changed
+1002
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
0 commit comments