Neighbor List CPU #2697

npiroozan · 2023-07-26T02:35:39Z

npiroozan
Jul 26, 2023

I believe I have found the issue for performance on CPUs. It seems to be related to neighbor lists. Traditionally we should migrate from AoS to SoA (when optimizing for GPUs), but DeePMD seems to implement a novel way of neighbor lists:

Migrate from AoS to SoA and then compress each element of the neighbor list into a 64 bit unsigned integer with the equation (α(j) × 10^16 + rijj × 10^8 × 10^6 + j).

In your paper, you specify that this is an efficient way for GPUs. For x86 CPU optimization, how would you recommend improving neighbor list formatting?

Thank you very much.

npiroozan · 2023-07-27T00:05:56Z

npiroozan
Jul 27, 2023
Author

Pardon me, I figured out the answer to this.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Neighbor List CPU #2697

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Neighbor List CPU #2697

Uh oh!

Uh oh!

npiroozan Jul 26, 2023

Replies: 1 comment

Uh oh!

npiroozan Jul 27, 2023 Author

npiroozan
Jul 26, 2023

npiroozan
Jul 27, 2023
Author