--- base_model: - Nexesenex/Llama_3.x_70b_L3.3_Dolphin_128K_v1.02 - migtissera/Tess-3-Llama-3.1-70B - huihui-ai/Llama-3.1-Nemotron-70B-Instruct-HF-abliterated - WhiteRabbitNeo/Llama-3.1-WhiteRabbitNeo-2-70B - mlabonne/Hermes-3-Llama-3.1-70B-lorablated - hitachi-nlp/Llama-3.1-70B-FLDx2 library_name: transformers tags: - mergekit - merge --- # about A pretty tame merge, good for SFW stuff. --- # benchs ARC-C: 60.87 ARC-E: 83.86 PPL-512: 3.05 --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Nexesenex/Llama_3.x_70b_L3.3_Dolphin_128K_v1.02](https://huggingface.co/Nexesenex/Llama_3.x_70b_L3.3_Dolphin_128K_v1.02) as a base. ### Models Merged The following models were included in the merge: * [migtissera/Tess-3-Llama-3.1-70B](https://huggingface.co/migtissera/Tess-3-Llama-3.1-70B) * [huihui-ai/Llama-3.1-Nemotron-70B-Instruct-HF-abliterated](https://huggingface.co/huihui-ai/Llama-3.1-Nemotron-70B-Instruct-HF-abliterated) * [WhiteRabbitNeo/Llama-3.1-WhiteRabbitNeo-2-70B](https://huggingface.co/WhiteRabbitNeo/Llama-3.1-WhiteRabbitNeo-2-70B) * [mlabonne/Hermes-3-Llama-3.1-70B-lorablated](https://huggingface.co/mlabonne/Hermes-3-Llama-3.1-70B-lorablated) * [hitachi-nlp/Llama-3.1-70B-FLDx2](https://huggingface.co/hitachi-nlp/Llama-3.1-70B-FLDx2) ### Configuration The following YAML configuration was used to produce this model: ```yaml merge_method: model_stock models: - model: Nexesenex/Llama_3.x_70b_L3.3_Dolphin_128K_v1.02 parameters: weight: 1.0 - model: huihui-ai/Llama-3.1-Nemotron-70B-Instruct-HF-abliterated parameters: weight: 1.0 - model: mlabonne/Hermes-3-Llama-3.1-70B-lorablated parameters: weight: 1.0 - model: hitachi-nlp/Llama-3.1-70B-FLDx2 parameters: weight: 1.0 - model: WhiteRabbitNeo/Llama-3.1-WhiteRabbitNeo-2-70B parameters: weight: 1.0 - model: migtissera/Tess-3-Llama-3.1-70B parameters: weight: 1.0 base_model: Nexesenex/Llama_3.x_70b_L3.3_Dolphin_128K_v1.02 dtype: bfloat16 out_dtype: bfloat16 parameters: int8_mask: true normalize: true rescale: false filter_wise: false smooth: false allow_negative_weights: false chat_template: auto tokenizer: source: union ```