Skip to content
Discussion options

You must be logged in to vote

@t0278611 no, I never implemented this, there was also a simplified SAM variant that used EMA, not sure if it was of the model weights or past gradients to approx the original SAM algorithm.. I tried that one at one point but couldn't improve past training runs.

Open to adding this if there's some evidence of success with hacked timm scripts and models in the image space ... I've run across a lot of paper ideas that I couldn't replicate over the years...

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@t0278611
Comment options

@t0278611
Comment options

Answer selected by t0278611
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Ideas
Labels
None yet
2 participants