You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: decoder_native_transforms.md
+4-10Lines changed: 4 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,16 +22,10 @@ We want to support this user-facing API:
22
22
What the user is asking for, in English:
23
23
24
24
1. I want to decode frames from the file `"vid.mp4".`
25
-
2. For each decoded frame, I want each frame to pass through the following
26
-
transforms:
27
-
a. Add or remove frames as necessary to ensure a constant 30 frames
28
-
per second.
29
-
b. Resize the frame to 640x480. Use the algorithm that is
30
-
TorchVision's default.
31
-
c. Inside the resized frame, crop the image to 32x32. The x and y
32
-
coordinates are chosen randomly upon the creation of the Python
33
-
VideoDecoder object. All decoded frames use the same values for x
34
-
and y.
25
+
2. For each decoded frame, I want each frame to pass through the following transforms:
26
+
a. Add or remove frames as necessary to ensure a constant 30 frames per second.
27
+
b. Resize the frame to 640x480. Use the algorithm that is TorchVision's default.
28
+
c. Inside the resized frame, crop the image to 32x32. The x and y coordinates are chosen randomly upon the creation of the Python VideoDecoder object. All decoded frames use the same values for x and y.
35
29
36
30
These three transforms are instructive, as they force us to consider:
0 commit comments