Skip to content
Snippets Groups Projects
Commit 996a27bf authored by Riko Corwin Uphoff's avatar Riko Corwin Uphoff
Browse files

Updated pretraining scripts

parent 9538fe00
No related branches found
No related tags found
No related merge requests found
Pipeline #25320 passed
......@@ -7,6 +7,7 @@ do
--mode pretraining \
--optimizer "$optimizer" \
--model llama_60m \
--dataset c4 \
--batch_size 512 \
--num_epochs 1 \
--num_training_tokens 1310000000 \
......
......@@ -7,6 +7,7 @@ do
--mode pretraining \
--optimizer "$optimizer" \
--model llama_7b \
--dataset c4 \
--batch_size 512 \
--num_epochs 1 \
--num_training_tokens 13100000 \
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment