Replies: 3 comments 5 replies
-
|
Beta Was this translation helpful? Give feedback.
0 replies
-
@hbsun2113 关于问题三,“并行度”这个词并没有一个准确的定义,如果你指的是IPC(Instruction Per Cycle),那么你是对的,loop unroll是经典的提高IPC的方法之一。 |
Beta Was this translation helpful? Give feedback.
3 replies
-
补充一点,我发现parallel、unroll、vectorize的顺序一般是固定的(逐级包含:parallel最外层,unroll在中间、vectorize最内层),我猜测是因为它们粒度逐渐变小:parallel(线程级)、unroll(语句级)、vectorize(指令级)。 |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
我不确定是否是自己遗漏了,虽然在课程或者作业中频繁提到这3个操作parallel/unroll/vectorize,但我没有找到对其的解释和区分(e.g. 使用场景)。在TVM文档中也没有找到解释。请问可以帮忙分辨这三者之间的不同吗?

我自己粗浅的理解:
Beta Was this translation helpful? Give feedback.
All reactions