--- license: mit language: - en base_model: - ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition --- Speech Emotion Recognition - 6-Class Classifier This model is a fine-tuned version of ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition, specifically designed to classify emotions in English speech. ๐Ÿง  Emotion Classes The model predicts one of the following six emotions: Happy Angry Disgust Fearful Neutral Sad ๐Ÿ“Š Dataset The model was trained on the Speech Emotion Recognition dataset from Kaggle: ๐Ÿ”— https://www.kaggle.com/datasets/kevinignatiuswijaya/speech-emotion-recognition-dl ๐ŸŽฏ Accuracy Achieved an accuracy of 84% on the test set. ๐Ÿ”ง Base Model Fine-tuned from the pretrained model: ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition # Load model and feature extractor model = Wav2Vec2ForSequenceClassification.from_pretrained("your-username/your-model-name") extractor = Wav2Vec2FeatureExtractor.from_pretrained("your-username/your-model-name") # Create pipeline classifier = pipeline("audio-classification", model=model, feature_extractor=extractor) # Predict emotion result = classifier("path/to/audio.wav") print(result) ๐Ÿงช Applications This model can be used for: Emotion-aware virtual assistants Mental health monitoring tools Human-computer interaction research Call center emotion analytics ๐Ÿ“ License Ensure compliance with the licenses for both the Kaggle dataset and the pretrained model used.