HachiML
/

Llama-2-13b-hf-qlora-dolly-ja-2ep

Model card Files Files and versions Community

Llama-2-13b-hf-qlora-dolly-ja-2ep / README.md

HachiML's picture

Update README.md

9f2d2ae almost 2 years ago

|

1.03 kB

	---
	library_name: peft
	datasets:
	- HachiML/databricks-dolly-15k-ja-for-peft
	---
	## JGLUE Score
	We evaluated our model using the following JGLUE tasks. Here are the scores:

	\| Task \| Score \|
	\|----------------\|---------:\|
	\| JSQUAD(exact_match) \| 62.83 \|
	\| JCOMMONSENSEQA(acc) \| 75.78 \|
	\| JNLI(acc) \| 50.69 \|
	\| MARC_JA(acc) \| - \|
	\| \| \|
	\|----------------\|----------:\|
	\| Average \| average_score \|

	The JGLUE scores were measured using the following script:
	[Stability-AI/lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness/tree/jp-stable)

	## Training procedure


	The following `bitsandbytes` quantization config was used during training:
	- load_in_8bit: False
	- load_in_4bit: True
	- llm_int8_threshold: 6.0
	- llm_int8_skip_modules: None
	- llm_int8_enable_fp32_cpu_offload: False
	- llm_int8_has_fp16_weight: False
	- bnb_4bit_quant_type: nf4
	- bnb_4bit_use_double_quant: True
	- bnb_4bit_compute_dtype: float16
	### Framework versions


	- PEFT 0.4.0