jeiku commited on
Commit
0a92056
1 Parent(s): 1453080

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -35
README.md CHANGED
@@ -4,42 +4,12 @@ library_name: transformers
4
  tags:
5
  - mergekit
6
  - merge
7
-
 
 
8
  ---
9
  # Persephone
10
 
11
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
12
-
13
- ## Merge Details
14
- ### Merge Method
15
-
16
- This model was merged using the SLERP merge method.
17
-
18
- ### Models Merged
19
-
20
- The following models were included in the merge:
21
- * MonaTrixWestlake
22
- * Experiment26Krishna
23
-
24
- ### Configuration
25
-
26
- The following YAML configuration was used to produce this model:
27
 
28
- ```yaml
29
- slices:
30
- - sources:
31
- - model: Experiment26Krishna
32
- layer_range: [0, 32]
33
- - model: MonaTrixWestlake
34
- layer_range: [0, 32]
35
- merge_method: slerp
36
- base_model: MonaTrixWestlake
37
- parameters:
38
- t:
39
- - filter: self_attn
40
- value: [0, 0.5, 0.3, 0.7, 1]
41
- - filter: mlp
42
- value: [1, 0.5, 0.7, 0.3, 0]
43
- - value: 0.5
44
- dtype: bfloat16
45
- ```
 
4
  tags:
5
  - mergekit
6
  - merge
7
+ license: other
8
+ language:
9
+ - en
10
  ---
11
  # Persephone
12
 
13
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/626dfb8786671a29c715f8a9/aOnBmqHJQfOFEIgqD_JCz.jpeg)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
 
15
+ After being in a bit of a rut, I decided to take a radically different approach to produce something new and exciting. It seems to have worked out. I hope you enjoy!