Kushtrim
/

whisper-large-v3-turbo-shqip

Automatic Speech Recognition

Model card Files Files and versions Community

whisper-large-v3-turbo-shqip / README.md

Kushtrim's picture

Upload Whisper Large V3 Turbo

69f3fe8 verified 3 months ago

|

history blame contribute delete

1.53 kB

	---
	library_name: transformers
	language:
	- sq
	license: mit
	base_model: openai/whisper-large-v3-turbo
	datasets:
	- Kushtrim/audioshqip-200h
	metrics:
	- wer
	model-index:
	- name: Whisper Large v3 Turbo Shqip
	results:
	- task:
	type: automatic-speech-recognition
	name: Automatic Speech Recognition
	dataset:
	name: Audio Shqip 200 orë
	type: Kushtrim/audioshqip-200h
	args: 'config: sq, split: test'
	metrics:
	- type: wer
	value: 19.891368436098556
	name: Wer
	---

	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	# Whisper Large V3 Turbo Shqip

	This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) specifically for the Albanian language, including the Gheg dialect. It was trained on a meticulously curated dataset comprising 200 hours of high-quality Albanian audio.

	## Key Features
	- Language Coverage: Supports standard Albanian as well as the Gheg dialect, ensuring robust transcription performance across regional variations.
	- Dataset: Fine-tuned on 200 hours of diverse and well-annotated Albanian audio data, capturing a wide range of accents, speech contexts, and domains.

	This model is optimized for automatic speech recognition (ASR) tasks in Albanian and can be used in applications such as transcription, subtitling, and real-time speech processing.