Skip to content
This repository has been archived by the owner on Oct 31, 2022. It is now read-only.

avg stays in 2.6-2.9 range #86

Open
freedmann2 opened this issue Sep 8, 2021 · 0 comments
Open

avg stays in 2.6-2.9 range #86

freedmann2 opened this issue Sep 8, 2021 · 0 comments

Comments

@freedmann2
Copy link

I'm trying to finetune 355M, 744M models, but having issue with avg. it doesn't fall at all!
have tried learning_rate 0.00001, 0.00002 , 5e-5 - result is the same.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant