tngtech
/

DeepSeek-R1T-Chimera

Text Generation

text-generation-inference

Model card Files Files and versions Community

rbrt commited on 5 days ago

Commit

a31f288

·

verified ·

1 Parent(s): 3df2c0b

Add benchmark plot to README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -20,11 +20,19 @@ pipeline_tag: text-generation
     <img alt="License" src="https://img.shields.io/badge/License-MIT-f5de53?&color=f5de53" style="display: inline-block; vertical-align: middle;"/>
   </a>
 </div>
 **Model merge of DeepSeek-R1 and DeepSeek-V3 (0324)**
 An open weights model combining the intelligence of R1 with the token efficiency of V3.
 ## Model Details
 - **Architecture**: DeepSeek-MoE Transformer-based language model

     <img alt="License" src="https://img.shields.io/badge/License-MIT-f5de53?&color=f5de53" style="display: inline-block; vertical-align: middle;"/>
   </a>
 </div>
+<br>
+<div align="center">
+  <a href="LICENSE" style="margin: 2px;">
+    <img alt="License" src="R1T-Chimera_Benchmarks_20250427_V1.jpg" style="display: inline-block; vertical-align: middle;"/>
+  </a>
+</div>
 **Model merge of DeepSeek-R1 and DeepSeek-V3 (0324)**
 An open weights model combining the intelligence of R1 with the token efficiency of V3.
 ## Model Details
 - **Architecture**: DeepSeek-MoE Transformer-based language model