Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
1.38k
Follow
Microsoft
12.3k
Automatic Speech Recognition
Transformers
Safetensors
24 languages
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2503.01743
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
74
Train
Use this model
Update README.md
#2
by
fasdfgaer
- opened
Feb 27
base:
refs/heads/main
←
from:
refs/pr/2
Discussion
Files changed
+1
-1
fasdfgaer
Feb 27
•
edited Feb 27
Corrected the typo "Audio Uniderstanding" to "Audio Understanding".
See translation
❤️
1
1
+
Update README.md
faf353bc
nguyenbh
changed pull request status to
merged
Feb 28
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment