Blog, Articles, and discussions

Training and Finetuning Reranker Models with Sentence Transformers v4

By March 26, 2025 • 125

Community Articles

view all

I trained a Language Model to schedule events with GRPO!

•

6 days ago

• 46

Bamba-9B-v2 - Fast and powerful!

and 12 others •

7 days ago

• 26

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

3 days ago

• 18

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 31

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

4 days ago

• 6

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 243

Code a simple RAG from scratch

•

Oct 29, 2024

• 64

PipelineRL

and 3 others •

10 days ago

• 17

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 38

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

OpenManus: The Open Source Alternative to Manus AI

•

Mar 30

• 13

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

and 1 other •

8 days ago

• 5

A Guide to Running Qwen 3 Locally with Ollama and vLLM

•

6 days ago

• 4

Agent2Agent and MCP: An End-to-End Tutorial for a complete Agentic Pipeline

•

6 days ago

• 4

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 115

Getting Started with Sentiment Analysis using Python

By February 2, 2022 • 53

Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

By February 1, 2022 • 8

Supercharged Searching on the Hugging Face Hub

By January 25, 2022

Boost Wav2Vec2 with n-gram LM in 🤗 Transformers

By January 12, 2022 • 11

Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker

By January 11, 2022

Perceiver IO: a scalable, fully-attentional model that works on any modality

By December 15, 2021 • 7

Training CodeParrot 🦜 from Scratch

By December 8, 2021 • 7

Getting Started with Hugging Face Transformers for IPUs with Optimum

By November 30, 2021 guest

Accelerating PyTorch distributed fine-tuning with Intel technologies

By November 19, 2021 • 1

Fine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers

By November 15, 2021 • 26

Scaling up BERT-like model Inference on modern CPU - Part 2

By November 4, 2021 • 1

Hosting your Models and Datasets on Hugging Face Spaces using Streamlit

By October 5, 2021 • 6

Showcase Your Projects in Spaces using Gradio

By October 5, 2021 • 11

Introducing Optimum: The Optimization Toolkit for Transformers at Scale

By September 14, 2021 • 1

Community Articles

I trained a Language Model to schedule events with GRPO!

•

6 days ago

• 46

Bamba-9B-v2 - Fast and powerful!

and 12 others •

7 days ago

• 26

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

3 days ago

• 18

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 542

Creating your custom Ghibli Text-to-Image model

and 3 others •

4 days ago

• 14

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 228

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

7 days ago

• 12

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 127

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 31

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

4 days ago

• 6

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 243

Code a simple RAG from scratch

•

Oct 29, 2024

• 64

PipelineRL

and 3 others •

10 days ago

• 17

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 38

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

OpenManus: The Open Source Alternative to Manus AI

•

Mar 30

• 13

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

and 1 other •

8 days ago

• 5

A Guide to Running Qwen 3 Locally with Ollama and vLLM

•

6 days ago

• 4

Agent2Agent and MCP: An End-to-End Tutorial for a complete Agentic Pipeline

•

6 days ago

• 4

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 115

View all