AmirMohseni
/

SmolLM-360M-Instruct-finetuned-sft

Text Generation

instruction-following

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AmirMohseni commited on Aug 13

Commit

41f8c8d

•

1 Parent(s): 6c63a6b

Update README.md

Files changed (1) hide show

README.md +23 -10

README.md CHANGED Viewed

@@ -1,19 +1,20 @@
 ---
 library_name: transformers
-tags:
-  - language-model
-  - fine-tuned
-  - instruction-following
-  - SmolLM
-  - HelpSteer2
-  - NVIDIA
-  - A100
-  - English
 language: en
 license: apache-2.0
 datasets:
-  - nvidia/HelpSteer2
 model_name: SmolLM-360M-Instruct-finetuned-sft
 ---
 # Model Card for `SmolLM-360M-Instruct-finetuned-sft`
@@ -36,6 +37,18 @@ The `SmolLM-360M-Instruct-finetuned-sft` model is a compact language model part
 - **Repository:** [SmolLM-360M-Instruct-finetuned-sft on Hugging Face](https://huggingface.co/AmirMohseni/SmolLM-360M-Instruct-finetuned-sft)
 ## Uses
 ### Direct Use

 ---
 library_name: transformers
+tags:
+- language-model
+- fine-tuned
+- instruction-following
+- SmolLM
+- HelpSteer2
+- NVIDIA
+- A100
+- English
 language: en
 license: apache-2.0
 datasets:
+- nvidia/HelpSteer2
 model_name: SmolLM-360M-Instruct-finetuned-sft
+pipeline_tag: text-generation
 ---
 # Model Card for `SmolLM-360M-Instruct-finetuned-sft`
 - **Repository:** [SmolLM-360M-Instruct-finetuned-sft on Hugging Face](https://huggingface.co/AmirMohseni/SmolLM-360M-Instruct-finetuned-sft)
+## Performance Improvements After Fine-Tuning
+The fine-tuning process was evaluated using the NVIDIA Nemotron-4-340B-Reward model, which assesses AI-generated responses on five key attributes: helpfulness, correctness, coherence, complexity, and verbosity. Based on this reward model, the fine-tuning resulted in the following performance boosts:
+- **Helpfulness:** Increased from **0.413** to **0.576**.
+- **Correctness:** Increased from **0.521** to **0.829**.
+- **Coherence:** Slight decrease from **2.424** to **2.411**.
+- **Complexity:** Decreased from **1.048** to **0.881**.
+- **Verbosity:** Decreased from **1.348** to **1.040**.
+These results indicate that the fine-tuning process generally improved the model's ability to generate more helpful and correct responses, while making the outputs slightly less complex and verbose. The decrease in coherence is minimal, suggesting that the overall logical consistency of the responses remains strong.
 ## Uses
 ### Direct Use