Skip to content
Discussion options

You must be logged in to vote

There's a number of things to think about; the first is that the data have to get from disk to the GPU somehow. It will usually get there via RAM, though supercomputers can go directly from network to GPUs (and they have internal networks that are faster than disk access; it's usually the other way around if the data have to go through the internet).

Here's a good summary of the storage latency hierarchy: https://blog.codinghorror.com/the-infinite-space-between-words/

Whether you go from disk to RAM to GPU or directly from disk to GPU (such a thing would require special hardware), the bottleneck is going to be the disk (unless your operating system has already cached those parts of the di…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by EnginEren
Comment options

You must be logged in to vote
1 reply
@jpivarski
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants