zijianhu commited on
Commit
a1caa5d
1 Parent(s): 40bd0d7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -3
README.md CHANGED
@@ -1,3 +1,16 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ pipeline_tag: text-generation
6
+ ---
7
+
8
+ ## Model Card for Fox-1-1.6B
9
+
10
+ > [!IMPORTANT]
11
+ > This model is a base pretrained model which requires further finetuning for most use cases. We will release the instruction-tuned version soon.
12
+
13
+ Fox-1 is a decoder-only transformer-based small language model (SLM) with 1.6B total parameters developed by [TensorOpera AI](https://tensoropera.ai/). The model was trained with a 3-stage data curriculum on 3 trillion tokens of text and code data in 8K sequence length. Fox-1 uses grouped query attention (GQA) with 4 KV heads and 16 attention heads and has a deeper architecture than other SLMs.
14
+
15
+ For the full details of this model please read our [release blog post](https://blog.tensoropera.ai/tensoropera-unveils-fox-foundation-model-a-pioneering-open-source-slm-leading-the-way-against-tech-giants).
16
+