fsnaix's picture
Update README.md
8d76032 verified
metadata
license: mit
language:
  - en
  - vi
base_model:
  - BlossomsAI/BloomVN-8B-chat
pipeline_tag: text-generation
library_name: transformers
tags:
  - conversational
Logo

🌟 BloomVN-8B-Chat-Reasoning

A fine-tuned multilingual model for Vietnamese reasoning

NOTE

  • This model is a test version for our new training pipeline. THIS IS NOT SUITABLE FOR PRODUCTION
  • The full model will be updated soon.

πŸ“‹ Overview

  • A language model with reasoning capability that provides step-by-step reasoning in Vietnamese before delivering answers.
  • The model follows a structured XML format with explicit reasoning tags.
  • It's designed for educational applications and complex problem-solving tasks in Vietnamese.

πŸ”§ Method

  • Fine-tuned using Group Relative Policy Optimization (GRPO) with Unsloth for hardware efficiency.
  • Employs rule-based reward functions to encourage adherence to Vietnamese XML reasoning format.
  • Uses LoRA adaptation on a Vietnamese dataset spanning various task types.

πŸ’« Quantization

Coming Soon!

🀝 Contributors

Developed with ❀️ by BlossomAI


Star ⭐️ this repo if you find it valuable!