gss1147 commited on
Commit
3c0868e
·
verified ·
1 Parent(s): 6c24a9c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +51 -35
README.md CHANGED
@@ -1,54 +1,70 @@
1
  ---
 
2
  base_model:
3
- - artificialguybr/LLAMA-3.2-1B-OpenHermes2.5
4
- - dphn/Dolphin3.0-Llama3.2-1B
5
- - meta-llama/Llama-3.2-1B-Instruct
6
  library_name: transformers
 
7
  tags:
8
- - mergekit
9
- - merge
 
 
 
 
 
10
  datasets:
11
- - OpenCoder-LLM/opc-sft-stage1
12
- - OpenCoder-LLM/opc-sft-stage2
13
- - microsoft/orca-agentinstruct-1M-v1
14
- - microsoft/orca-math-word-problems-200k
15
- - NousResearch/hermes-function-calling-v1
16
- - AI-MO/NuminaMath-CoT
17
- - AI-MO/NuminaMath-TIR
18
- - allenai/tulu-3-sft-mixture
19
- - HuggingFaceTB/smoltalk
20
- - m-a-p/CodeFeedback-Filtered-Instruction
21
- - m-a-p/Code-Feedback
22
- - teknium/OpenHermes-2.5
23
  ---
24
- # merged_20260322_234544
25
 
26
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
27
 
28
- ## Merge Details
29
- ### Merge Method
 
30
 
31
- This model was merged using the [SLERP](https://en.wikipedia.org/wiki/Slerp) merge method.
32
 
33
- ### Models Merged
 
 
 
 
 
34
 
35
- The following models were included in the merge:
36
- * dphn-Dolphin3.0-Llama3.2-1B
37
- * artificialguybr-LLAMA-3.2-1B-OpenHermes2.5
38
 
39
- ### Configuration
40
 
41
- The following YAML configuration was used to produce this model:
 
 
 
 
 
 
 
 
42
 
43
  ```yaml
44
  merge_method: slerp
45
- base_model: X:\Amalgamation AI Universal GUI\_prepared_models\artificialguybr-LLAMA-3.2-1B-OpenHermes2.5_1a7ec6f29c
46
  models:
47
- - model: X:\Amalgamation AI Universal GUI\_prepared_models\dphn-Dolphin3.0-Llama3.2-1B_a8e1ced6b4
48
- parameters:
49
- weight: 1.0
50
  dtype: float32
51
  parameters:
52
- t: 0.5
53
-
54
- ```
 
1
  ---
2
+ license: other
3
  base_model:
4
+ - artificialguybr/LLAMA-3.2-1B-OpenHermes2.5
5
+ - dphn/Dolphin3.0-Llama3.2-1B
6
+ - meta-llama/Llama-3.2-1B-Instruct
7
  library_name: transformers
8
+ pipeline_tag: text-generation
9
  tags:
10
+ - llama
11
+ - mergekit
12
+ - merge
13
+ - slerp
14
+ - text-generation
15
+ - code
16
+ - instruct
17
  datasets:
18
+ - OpenCoder-LLM/opc-sft-stage1
19
+ - OpenCoder-LLM/opc-sft-stage2
20
+ - microsoft/orca-agentinstruct-1M-v1
21
+ - microsoft/orca-math-word-problems-200k
22
+ - NousResearch/hermes-function-calling-v1
23
+ - AI-MO/NuminaMath-CoT
24
+ - AI-MO/NuminaMath-TIR
25
+ - allenai/tulu-3-sft-mixture
26
+ - HuggingFaceTB/smoltalk
27
+ - m-a-p/CodeFeedback-Filtered-Instruction
28
+ - m-a-p/Code-Feedback
29
+ - teknium/OpenHermes-2.5
30
  ---
 
31
 
32
+ # Llama-3.2-HermesDolphin-Coder-1B
33
 
34
+ Llama-3.2-HermesDolphin-Coder-1B is a compact merged language model designed for general instruction following, coding assistance, and lightweight conversational use. It combines Hermes-style instruction tuning and Dolphin-style helpfulness into a small Llama 3.2 class model intended for experimentation, local workflows, and developer-oriented prompting.
35
+
36
+ This repository appears to be a **merge model** created with **mergekit** using the **SLERP** merge method.
37
 
38
+ ## Model Summary
39
 
40
+ - **Model type:** Causal language model
41
+ - **Architecture:** LlamaForCausalLM
42
+ - **Primary use:** Text generation, instruction following, code-oriented prompting
43
+ - **Library:** Transformers
44
+ - **Merge method:** SLERP
45
+ - **Format:** Safetensors
46
 
47
+ ## Base Models
 
 
48
 
49
+ This merged model is based on:
50
 
51
+ - `artificialguybr/LLAMA-3.2-1B-OpenHermes2.5`
52
+ - `dphn/Dolphin3.0-Llama3.2-1B`
53
+ - `meta-llama/Llama-3.2-1B-Instruct`
54
+
55
+ ## Merge Details
56
+
57
+ According to the repository metadata/configuration, the merge was produced with `mergekit` using a SLERP setup with a midpoint interpolation parameter.
58
+
59
+ ### Merge configuration
60
 
61
  ```yaml
62
  merge_method: slerp
63
+ base_model: artificialguybr/LLAMA-3.2-1B-OpenHermes2.5
64
  models:
65
+ - model: dphn/Dolphin3.0-Llama3.2-1B
66
+ parameters:
67
+ weight: 1.0
68
  dtype: float32
69
  parameters:
70
+ t: 0.5