[WIP] perf: common neural network workloads benchmarks#1026
Closed
[WIP] perf: common neural network workloads benchmarks#1026
Conversation
Member
|
@thazhemadam @staticfloat what's the easiest way to set this up so that benchmarks can choose a separate runner for GPUs? |
515068b to
3ae1755
Compare
Member
Author
|
Allowing different runners based on the jmd metadata would be a nice way. Something like ---
title: Simple Neural Networks
author: Avik Pal
backend: CUDA <--- If nothing present then use CPU. In future we can allow AMDGPU/Metal/etc with the same syntax
--- |
0a29e65 to
48aba8e
Compare
2c539db to
b88a3a0
Compare
Member
Author
|
Lux now matches SimpleChains in inference timings 😅. The ones where we fall behind are because Octavian is somewhat slow on EPYC machines so it is turned off |
3 tasks
Contributor
|
Superseded by #1530 which completes this work: rebased onto master, updated to latest package versions, added Reactant.jl (via @compile and TrainState API), JAX and PyTorch benchmarks (via PythonCall with Python-side timing), completed all benchmark sections (MLP relu/gelu, MLP+BN, LeNet, ResNet) with both inference and training, and configured GPU runner. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
currently add CPU versions. should be easy to extend to GPU once SciMLBenchmarks has GPU runners available.
TODOs
compilePR lands in Lux