File size: 628 Bytes
de0254f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
---
language: en
tags:
  - transcription
  - T5
  - huggingface
license: apache-2.0
datasets: custom
model_type: t5
---

# T5-based Audio Transcription Fusion Model

This model combines transcriptions from multiple sources separated by '/' to generate an optimal transcription. It is fine-tuned on a dataset where each sample has three candidate transcriptions and a reference transcription.

### Training Details
Model trained on 21000 samples for 10 epochs with T5-small as the base model.

Training Loss: 0.005756139289587736

### Evaluation Details
Test Loss: 0.011949276849159604
Word Error Rate (WER): 0.10040761999833625