whisper-webui-translate

Running

SoybeanMilk commited on 7 days ago

Commit

f6806bf

verified ·

1 Parent(s): 45ab1d3

Add support for the Whisper model MediaTek-Research/Breeze-ASR-25. (#8)

- Add your change (8892fec9e2b48b629d96f455846ae299d86a74eb)

Co-authored-by: SoybeanMilkGood <SoybeanMilk@users.noreply.huggingface.co>

Files changed (3) hide show

config.json5 CHANGED Viewed

@@ -38,6 +38,11 @@
       {
         "name": "large-v3-turbo",
         "url": "large-v3-turbo"
       }
       // Uncomment to add custom Japanese models
       //{

       {
         "name": "large-v3-turbo",
         "url": "large-v3-turbo"
+      },
+      {
+        "name": "Breeze-ASR-25",
+        "url": "SoybeanMilk/faster-whisper-Breeze-ASR-25",
+        "type": "huggingface"
       }
       // Uncomment to add custom Japanese models
       //{

docs/options.md CHANGED Viewed

@@ -1,4 +1,4 @@
-# Standard Options
 To transcribe or translate an audio file, you can either copy an URL from a website (all [websites](https://github.com/yt-dlp/yt-dlp/blob/master/supportedsites.md)
 supported by YT-DLP will work, including YouTube). Otherwise, upload an audio file (choose "All Files (*.*)"
 in the file selector to select any file type, including video files) or use the microphone.
@@ -18,6 +18,7 @@ Select the model that Whisper will use to transcribe the audio:
 | large-v2  | 1550 M     | N/A                | large              | ~10 GB        | 1x             |
 | large-v3  | 1550 M     | N/A                | large              | ~10 GB        | 1x             |
 | turbo     | 809 M      | N/A                | turbo              | ~6 GB         | 8x             |
 ## Language

+# Standard Options
 To transcribe or translate an audio file, you can either copy an URL from a website (all [websites](https://github.com/yt-dlp/yt-dlp/blob/master/supportedsites.md)
 supported by YT-DLP will work, including YouTube). Otherwise, upload an audio file (choose "All Files (*.*)"
 in the file selector to select any file type, including video files) or use the microphone.
 | large-v2  | 1550 M     | N/A                | large              | ~10 GB        | 1x             |
 | large-v3  | 1550 M     | N/A                | large              | ~10 GB        | 1x             |
 | turbo     | 809 M      | N/A                | turbo              | ~6 GB         | 8x             |
+| breeze-asr-25 | 1550 M | N/A                | breeze-asr-25      | ~10 GB        | 1x             |
 ## Language

src/whisper/fasterWhisperContainer.py CHANGED Viewed

@@ -47,8 +47,10 @@ class FasterWhisperContainer(AbstractWhisperContainer):
             if model_url == "large":
                 # large is an alias for large-v1
                 model_url = "large-v1"
-            elif model_url == "large-v3-turbo":
                 model_url = "deepdml/faster-whisper-large-v3-turbo-ct2"
         device = self.device

             if model_url == "large":
                 # large is an alias for large-v1
                 model_url = "large-v1"
+            if model_url == "large-v3-turbo":
                 model_url = "deepdml/faster-whisper-large-v3-turbo-ct2"
+            elif model_url == "Breeze-ASR-25":
+                model_url = "SoybeanMilk/faster-whisper-Breeze-ASR-25"
         device = self.device