concedo commited on
Commit
9d5955a
1 Parent(s): 29ffa5a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -3
README.md CHANGED
@@ -1,3 +1,40 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ ---
5
+
6
+ <div align="center">
7
+
8
+ # KobbleSmall-2B
9
+ </div>
10
+
11
+ This is a finetune of Gemma2-2B trained on a subset of the Kobble Dataset.
12
+ Training was done in under 3 hours on a single NVIDIA T4 GPU with qLora (LR 1.5e-4, rank 16, alpha 16, batch size 2, gradient acc. 4, 2048 ctx).
13
+
14
+ You can obtain the GGUF quantization of this model here: https://huggingface.co/concedo/KobbleSmall-2B-GGUF
15
+
16
+
17
+ ## Dataset and Objectives
18
+
19
+ The Kobble Dataset is a semi-private aggregated dataset made from multiple online sources and web scrapes.
20
+ It contains content chosen and formatted specifically to work with KoboldAI software and Kobold Lite.
21
+
22
+ #### Dataset Categories:
23
+ - Instruct: Single turn instruct examples presented in the Alpaca format, with an emphasis on uncensored and unrestricted responses.
24
+ - Chat: Two participant roleplay conversation logs in a multi-turn raw chat format that KoboldAI uses.
25
+ - Story: Unstructured fiction excerpts, including literature containing various erotic and provocative content.
26
+
27
+ <!-- prompt-template start -->
28
+ ## Prompt template: Alpaca
29
+
30
+ ```
31
+ ### Instruction:
32
+ {prompt}
33
+
34
+ ### Response:
35
+ ```
36
+
37
+ <!-- prompt-template end -->
38
+
39
+ **Note:** *No assurances will be provided about the **origins, safety, or copyright status** of this model, or of **any content** within the Kobble dataset.*
40
+ *If you belong to a country or organization that has strict AI laws or restrictions against unlabelled or unrestricted content, you are advised not to use this model.*