learn_jax/t5_model
Richard Wong a817fe16cc Feat: increased learning rate for effective large batch size learning 2024-09-22 22:28:41 +09:00
..
.gitignore Feat: increased learning rate for effective large batch size learning 2024-09-22 22:28:41 +09:00
configuration_t5.py Feat: increased learning rate for effective large batch size learning 2024-09-22 22:28:41 +09:00
modeling_t5_flax.py Feat: increased learning rate for effective large batch size learning 2024-09-22 22:28:41 +09:00