ryanmarten
/

OpenThinker-32B-Unverified

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

OpenThinker-32B-Unverified / README.md

madiator's picture

Update README.md

14daaa4 verified 3 months ago

|

history blame contribute delete

1.98 kB

	---
	library_name: transformers
	license: other
	base_model: Qwen/Qwen2.5-32B-Instruct
	tags:
	- llama-factory
	- full
	- generated_from_trainer
	model-index:
	- name: DCFT-Stratos-Unverified-114k-32B
	results: []
	---

	<p align="center">
	<img src="https://huggingface.co/datasets/open-thoughts/open-thoughts-114k/resolve/main/open_thoughts.png" width="50%">
	</p>

	# OpenThinker-32B-Unverified

	This model is a fine-tuned version of [Qwen/Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) on the
	[OpenThoughts-Unverified-173k](https://huggingface.co/datasets/open-thoughts/OpenThoughts-Unverified-173k) dataset.

	Please see [OpenThinker-32B](https://huggingface.co/open-thoughts/OpenThinker-32B) for more information.

	\|Model Name\|Dataset size\|AIME24 I/II\|AIME25 I\|MATH500\|GPQA Diamond\|LCBv2\|
	\|---\|---\|---\|---\|---\|---\|---\|
	\|[OpenThinker-7B](https://huggingface.co/open-thoughts/OpenThinker-7B)\|114k\|31.3\|30.7\|84.4\|38.9\|41.8\|
	\|[OpenThinker-7B-Unverified](https://huggingface.co/open-thoughts/OpenThinker-7B-Unverified)\|173k\|34\|29.33\|83\|39.4\|43.8\|
	\|[OpenThinker-32B](https://huggingface.co/open-thoughts/OpenThinker-32B)\|114k\|66.7\|53.3\|90.6\|61.6\|68.9\|
	\|[OpenThinker-32B-Unverified](https://huggingface.co/open-thoughts/OpenThinker-32B-Unverified)\|173k\|60.7\|44\|90\|60.6\|69.2\|



	## Intended uses & limitations

	Apache 2.0

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 1e-05
	- train_batch_size: 1
	- eval_batch_size: 8
	- seed: 42
	- distributed_type: multi-GPU
	- num_devices: 384
	- total_train_batch_size: 384
	- total_eval_batch_size: 3072
	- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
	- lr_scheduler_type: cosine
	- lr_scheduler_warmup_ratio: 0.1
	- num_epochs: 3.0

	### Training results



	### Framework versions

	- Transformers 4.46.1
	- Pytorch 2.5.0a0+b465a5843b.nv24.09
	- Datasets 3.0.2
	- Tokenizers 0.20.3