--- base_model: [] library_name: transformers tags: - mergekit - merge ---

L3-SMB-Instruct-12.2B-F32

This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the passthrough merge method. ### Models Merged The following models were included in the merge: * G:/7B/L3-SthenoMaidBlackroot-8B-V1 * G:/7B/Meta-Llama-3-8B-Instruct ### Configuration The following YAML configuration was used to produce this model: ```yaml slices: - sources: - model: G:/7B/Meta-Llama-3-8B-Instruct layer_range: [0, 12] - sources: - model: G:/7B/L3-SthenoMaidBlackroot-8B-V1 layer_range: [6, 19] parameters: scale: - filter: o_proj value: 1 - filter: down_proj value: 1 - value: 1 - sources: - model: G:/7B/Meta-Llama-3-8B-Instruct layer_range: [12, 18] parameters: scale: - filter: o_proj value: .5 - filter: down_proj value: .5 - value: 1 - sources: - model: G:/7B/Meta-Llama-3-8B-Instruct layer_range: [18, 25] parameters: scale: - filter: o_proj value: .75 - filter: down_proj value: .75 - value: 1 - sources: - model: G:/7B/L3-SthenoMaidBlackroot-8B-V1 layer_range: [19, 32] parameters: scale: - filter: o_proj value: 1 - filter: down_proj value: 1 - value: 1 merge_method: passthrough dtype: float32 ```