Commits on Source (95)
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Tommotius authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Nani authored
-
Konstantin Fritzsch authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Nani authored
-
Nani authored
-
Nani authored
-
Nani authored
-
Nani authored
-
Nani authored
-
Riko Corwin Uphoff authored
Todo See merge request !1
-
Nani authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
# Conflicts: # args.py # main.py # scripts/shell/pretrain_7b.sh
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Tommotius authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Nani authored
-
Konstantin Fritzsch authored
-
Riko Corwin Uphoff authored
-
Tommotius authored
-
Riko Corwin Uphoff authored
-
Nani authored
-
Konstantin Fritzsch authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Nani authored
-
Nani authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Nani authored
-
Konstantin Fritzsch authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Konstantin Fritzsch authored
-
Riko Corwin Uphoff authored
-
Konstantin Fritzsch authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Riko Corwin Uphoff authored
-
Riko Corwin Uphoff authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
-
Konstantin Fritzsch authored
Showing
- .gitignore 7 additions, 0 deletions.gitignore
- .gitlab-ci.yml 1 addition, 1 deletion.gitlab-ci.yml
- Dockerfile 12 additions, 0 deletionsDockerfile
- README.md 39 additions, 86 deletionsREADME.md
- args.py 38 additions, 0 deletionsargs.py
- config/README.md 4 additions, 0 deletionsconfig/README.md
- config/galore_config.json 6 additions, 0 deletionsconfig/galore_config.json
- config/llama_1b.json 20 additions, 0 deletionsconfig/llama_1b.json
- config/llama_350m.json 20 additions, 0 deletionsconfig/llama_350m.json
- config/llama_3b.json 20 additions, 0 deletionsconfig/llama_3b.json
- config/llama_60m.json 20 additions, 0 deletionsconfig/llama_60m.json
- config/llama_7b.json 20 additions, 0 deletionsconfig/llama_7b.json
- config/lora_config.json 7 additions, 0 deletionsconfig/lora_config.json
- load_data.py 102 additions, 0 deletionsload_data.py
- load_lr_scheduler.py 32 additions, 0 deletionsload_lr_scheduler.py
- load_models.py 61 additions, 0 deletionsload_models.py
- load_optimizers.py 147 additions, 0 deletionsload_optimizers.py
- logger.py 55 additions, 0 deletionslogger.py
- main.py 160 additions, 0 deletionsmain.py
- output/cola/output_finetuning_roberta_glue_cola_batch-size-32_adamw+lora_2025-04-15_03-13-34.csv 151 additions, 0 deletions...lue_cola_batch-size-32_adamw+lora_2025-04-15_03-13-34.csv
.gitignore
0 → 100644
args.py
0 → 100644
config/README.md
0 → 100644
config/galore_config.json
0 → 100644
config/llama_1b.json
0 → 100644
config/llama_350m.json
0 → 100644
config/llama_3b.json
0 → 100644
config/llama_60m.json
0 → 100644
config/llama_7b.json
0 → 100644
config/lora_config.json
0 → 100644
load_data.py
0 → 100644
load_lr_scheduler.py
0 → 100644
load_models.py
0 → 100644
load_optimizers.py
0 → 100644
logger.py
0 → 100644
main.py
0 → 100644
This diff is collapsed.