PMAA: A Progressive Multi-scale Attention Autoencoder Model for High-Performance Cloud Removal from Multi-temporal Satellite Imagery Paper • 2303.16565 • Published Mar 29, 2023 • 1
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models Paper • 2505.16211 • Published May 22 • 18
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline Paper • 2505.19314 • Published May 25 • 4
SepPrune: Structured Pruning for Efficient Deep Speech Separation Paper • 2505.12079 • Published May 17
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix Paper • 2505.13032 • Published May 19 • 2
Advances in Speech Separation: Techniques, Challenges, and Future Trends Paper • 2508.10830 • Published 23 days ago • 13
Apollo: Band-sequence Modeling for High-Quality Audio Restoration Paper • 2409.08514 • Published Sep 13, 2024 • 12