You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: recipes_source/torch_export_challenges_solutions.rst
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -142,12 +142,12 @@ for ``torch.export`` can be found in the export tutorial. The code shown below d
142
142
143
143
144
144
145
-
Pose Estimation
145
+
Automatic Speech Recognition
146
146
---------------
147
147
148
-
**Pose Estimation** is a Computer Vision concept that can be used to identify the location of joints of a human in a 2D image.
149
-
`Ultralytics<https://docs.ultralytics.com/tasks/pose/>`__ has published a Pose Estimation model based on `YOLO11 <https://docs.ultralytics.com/models/yolo11/>`__. This has been trained on the `COCO Dataset <https://cocodataset.org/#keypoints-2017>`__. This model can be used
150
-
for analyzing human pose for determining action or intent. The code below tries to export the YOLO11 Pose model with `batch_size=1`
148
+
**Automatic Speech Recognition**(ASR) is the use of machine learning to transcribe spoken language into text.
149
+
`Whisper <https://arxiv.org/abs/2212.04356>`__ is a Transformer based encoder-decoder model from OpenAI, which was trained on 680k hours of labelled data for ASR and speech translation.
150
+
The code below tries to export ``whisper-tiny`` model for ASR.
0 commit comments