concedo
/

KobbleSmall-2B

Model card Files Files and versions Community

concedo commited on Aug 5

Commit

9d5955a

•

1 Parent(s): 29ffa5a

Update README.md

Files changed (1) hide show

README.md +40 -3

README.md CHANGED Viewed

@@ -1,3 +1,40 @@
----
-license: apache-2.0
----

+---
+language:
+- en
+---
+<div align="center">
+# KobbleSmall-2B
+</div>
+This is a finetune of Gemma2-2B trained on a subset of the Kobble Dataset.
+Training was done in under 3 hours on a single NVIDIA T4 GPU with qLora (LR 1.5e-4, rank 16, alpha 16, batch size 2, gradient acc. 4, 2048 ctx).
+You can obtain the GGUF quantization of this model here: https://huggingface.co/concedo/KobbleSmall-2B-GGUF
+## Dataset and Objectives
+The Kobble Dataset is a semi-private aggregated dataset made from multiple online sources and web scrapes.
+It contains content chosen and formatted specifically to work with KoboldAI software and Kobold Lite.
+#### Dataset Categories:
+- Instruct: Single turn instruct examples presented in the Alpaca format, with an emphasis on uncensored and unrestricted responses.
+- Chat: Two participant roleplay conversation logs in a multi-turn raw chat format that KoboldAI uses.
+- Story: Unstructured fiction excerpts, including literature containing various erotic and provocative content.
+<!-- prompt-template start -->
+## Prompt template: Alpaca
+```
+### Instruction:
+{prompt}
+### Response:
+```
+<!-- prompt-template end -->
+**Note:** *No assurances will be provided about the **origins, safety, or copyright status** of this model, or of **any content** within the Kobble dataset.*
+*If you belong to a country or organization that has strict AI laws or restrictions against unlabelled or unrestricted content, you are advised not to use this model.*