Actions: NVIDIA/TensorRT
Actions
1,140 workflow runs
1,140 workflow runs
trt.IStreamReader (as implemented e.g. in polygraphy) requires higher peak CPU memory and more time than naive python implementation.
Blossom-CI
#6820:
Issue comment #4327 (comment)
created
by
pranavm-nvidia
F.scaled_dot_product_attention on GPU L4
Blossom-CI
#6815:
Issue comment #4333 (comment)
created
by
ohadravid
F.scaled_dot_product_attention on GPU L4
Blossom-CI
#6813:
Issue comment #4333 (comment)
created
by
kevinch-nv
F.scaled_dot_product_attention on GPU L4
Blossom-CI
#6811:
Issue comment #4333 (comment)
created
by
kevinch-nv
ProTip!
You can narrow down the results and go further in time using created:<2025-01-26 or the other filters available.