-
FLAME: Factuality-Aware Alignment for Large Language Models
Paper • 2405.01525 • Published • 29 -
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
Paper • 2405.14333 • Published • 42 -
Transformers Can Do Arithmetic with the Right Embeddings
Paper • 2405.17399 • Published • 55 -
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture
Paper • 2405.18991 • Published • 12
Collections
Discover the best community collections!
Collections including paper arxiv:2508.01242
-
Tesslate/UIGEN-X-8B
Text Generation • 8B • Updated • 87 • • 58 -
Intelligent-Internet/II-Search-4B
Text Generation • 4B • Updated • 913 • 99 -
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh
Paper • 2508.01242 • Published • 9 -
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens
Paper • 2508.05305 • Published • 45
-
CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes
Paper • 2411.00771 • Published • 9 -
SynCity: Training-Free Generation of 3D Worlds
Paper • 2503.16420 • Published • 26 -
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities
Paper • 2501.08983 • Published • 20 -
Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion
Paper • 2407.13759 • Published • 18
-
345
Qwen2.5 Omni 7B Demo
🏆Generate text and speech from audio, video, and text inputs
-
2.59k
F5-TTS
🗣F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
309
Kokoro TTS Zero
🎴✨[With v1.0.0] Accelerated TTS on Kokoro-82M
-
fixie-ai/ultravox-v0_5-llama-3_2-1b
Audio-Text-to-Text • 0.7B • Updated • 377k • 56
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 275 • 95 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 35 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 98 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 89
-
ReLearn: Unlearning via Learning for Large Language Models
Paper • 2502.11190 • Published • 30 -
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
Paper • 2502.11089 • Published • 165 -
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
Paper • 2502.11357 • Published • 10 -
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
Paper • 2503.12797 • Published • 32
-
FLAME: Factuality-Aware Alignment for Large Language Models
Paper • 2405.01525 • Published • 29 -
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
Paper • 2405.14333 • Published • 42 -
Transformers Can Do Arithmetic with the Right Embeddings
Paper • 2405.17399 • Published • 55 -
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture
Paper • 2405.18991 • Published • 12
-
Tesslate/UIGEN-X-8B
Text Generation • 8B • Updated • 87 • • 58 -
Intelligent-Internet/II-Search-4B
Text Generation • 4B • Updated • 913 • 99 -
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh
Paper • 2508.01242 • Published • 9 -
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens
Paper • 2508.05305 • Published • 45
-
CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes
Paper • 2411.00771 • Published • 9 -
SynCity: Training-Free Generation of 3D Worlds
Paper • 2503.16420 • Published • 26 -
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities
Paper • 2501.08983 • Published • 20 -
Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion
Paper • 2407.13759 • Published • 18
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 275 • 95 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 35 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 98 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 89
-
345
Qwen2.5 Omni 7B Demo
🏆Generate text and speech from audio, video, and text inputs
-
2.59k
F5-TTS
🗣F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
309
Kokoro TTS Zero
🎴✨[With v1.0.0] Accelerated TTS on Kokoro-82M
-
fixie-ai/ultravox-v0_5-llama-3_2-1b
Audio-Text-to-Text • 0.7B • Updated • 377k • 56
-
ReLearn: Unlearning via Learning for Large Language Models
Paper • 2502.11190 • Published • 30 -
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
Paper • 2502.11089 • Published • 165 -
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
Paper • 2502.11357 • Published • 10 -
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
Paper • 2503.12797 • Published • 32