Skip to content
Discussion options

You must be logged in to vote

Hey!

I would recommend reading these docs:

For the techniques you mentioned, it really depends on what you are trying to do. Maybe https://jax-ml.github.io/scaling-book/training/ can help? This doc covers the techniques you were asking about (Data parallelism, FSDP and TP)

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@Biosins
Comment options

@yashk2810
Comment options

@Biosins
Comment options

Answer selected by Biosins
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants