README.md · Ramikan-BR/tinyllama-coder-py-4bit-v10 at 6aa01b941768334dde401bf5212993336e061dca

tinyllama-coder-py-4bit-v10 / README.md

Ramikan-BR

Update README.md

6aa01b9 verified 12 months ago

preview code

raw

history blame

2.42 kB

	---
	language:
	- en
	license: apache-2.0
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- llama
	- trl
	- sft
	- code
	- lora
	- peft
	base_model: unsloth/tinyllama-chat-bnb-4bit
	pipeline_tag: text-generation
	datasets: Ramikan-BR/data-oss_instruct-decontaminated_python.jsonl
	---

	# Uploaded model

	- Developed by: Ramikan-BR
	- Model type: [text-generation/Python Coder]
	- Language(s) (NLP): [en]
	- License: apache-2.0
	- Finetuned from model : unsloth/tinyllama-chat-bnb-4bit

	### Model Description

	<!-- Provide a longer summary of what this model is. -->

	### Training Data

	datasets: [Ramikan-BR/data-oss_instruct-decontaminated_python.jsonl](https://huggingface.co/datasets/Ramikan-BR/data-oss_instruct-decontaminated_python.jsonl)

	### Training Procedure

	The model was refined using [Unsloath](https://github.com/unslothai/unsloth). The dataset [ise-uiuc/Magicoder-OSS-Instruct-75K](https://huggingface.co/datasets/ise-uiuc/Magicoder-OSS-Instruct-75K/blob/main/data-oss_instruct-decontaminated.jsonl) was adjusted, leaving only data on python and divided into 10 parts, each refinement occurred for 2 epochs, using adafactor optimizer or adamw_8bit (adafactor seems to deliver less loss).

	### Model Sources [optional]
	base_model: [unsloth/tinyllama-chat-bnb-4bit](https://huggingface.co/unsloth/tinyllama-chat-bnb-4bit)

	model: [Ramikan-BR/tinyllama-coder-py-4bit-v10](https://huggingface.co/Ramikan-BR/tinyllama-coder-py-4bit-v10)
	gguf_f16: [tinyllama-coder-py-4bit-v10-unsloth.F16.gguf](https://huggingface.co/Ramikan-BR/tinyllama-coder-py-4bit-v10/blob/main/tinyllama-coder-py-4bit-v10-unsloth.F16.gguf)
	gguf_Q4_K_M: [tinyllama-coder-py-4bit-v10-unsloth.Q4_K_M.gguf](https://huggingface.co/Ramikan-BR/tinyllama-coder-py-4bit-v10/blob/main/tinyllama-coder-py-4bit-v10-unsloth.Q4_K_M.gguf)
	gguf_Q8_0: [tinyllama-coder-py-4bit-v10-unsloth.Q8_0.gguf](https://huggingface.co/Ramikan-BR/tinyllama-coder-py-4bit-v10/blob/main/tinyllama-coder-py-4bit-v10-unsloth.Q8_0.gguf)

	This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

	This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)