jgrosjean-mathesis
/

sentence-swissbert

Sentence Similarity

Inference Endpoints

Model card Files Files and versions Community

jgrosjean commited on Dec 18, 2023

Commit

25f607a

•

1 Parent(s): d8f35d4

Update README.md

Files changed (1) hide show

README.md +3 -52

README.md CHANGED Viewed

@@ -120,26 +120,9 @@ This model has been trained on news articles only. Hence, it might not perform a
 #### Training Hyperparameters
-- **Training regime:** python3 train_simcse_multilingual.py \
-  --seed 54699 \
-  --model_name_or_path zurichNLP/swissbert \
-  --train_file /srv/scratch2/grosjean/Masterarbeit/data_subsets \
-  --output_dir /srv/scratch2/grosjean/Masterarbeit/model \
-  --overwrite_output_dir \
-  --save_strategy no \
-  --do_train \
-  --num_train_epochs 1 \
-  --learning_rate 1e-5 \
-  --per_device_train_batch_size 4 \
-  --gradient_accumulation_steps 128 \
-  --max_seq_length 512 \
-  --overwrite_cache \
-  --pooler_type avg \
-  --pad_to_max_length \
-  --temp 0.05 \
-  --fp16 <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
-[More Information Needed]
 ## Evaluation
@@ -190,35 +173,3 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 ### Model Architecture and Objective
 [More Information Needed]
-## Citation [optional]
-<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
-**BibTeX:**
-[More Information Needed]
-**APA:**
-[More Information Needed]
-## Glossary [optional]
-<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
-[More Information Needed]
-## More Information [optional]
-[More Information Needed]
-## Model Card Authors [optional]
-[More Information Needed]
-## Model Card Contact
-[More Information Needed]

 #### Training Hyperparameters
+Number of epochs: 1
+Learning rate: 1e-5
+Batch size: 512
 ## Evaluation
 ### Model Architecture and Objective
 [More Information Needed]