1 1 1

Yeonseok Kim PRO

yeonseok-zeticai

https://zetic.ai

AI & ML interests

On device AI with mobile hardware utilizations

Recent Activity

updated a Space about 1 month ago

ZETIC-ai/README

published a Space about 1 month ago

ZETIC-ai/README

reacted to their post with 😎 3 months ago

🚀 Real-Time On-Device AI Agent with Polaris-4B — Run It Yourself, No Cloud, No Cost We just deployed a real-time on-device AI agent using the Polaris-4B-Preview model — one of the top-performing <6B open LLMs on Hugging Face. 📱 What’s remarkable? This model runs entirely on a mobile device, without cloud, and without any manual optimization. It was built using ZETIC.MLange, and the best part? ➡️ It’s totally automated, free to use, and anyone can do it. You don’t need to write deployment code, tweak backends, or touch device-specific SDKs. Just upload your model — and ZETIC.MLange handles the rest. 🧠 About the Model - Model: Polaris-4B-Preview - Size: ~4B parameters - Ranking: Top 3 on Hugging Face LLM Leaderboard (<6B) - Tokenizer: Token-incremental inference supported - Modifications: None — stock weights, just optimized for mobile ⚙️ What ZETIC.MLange Does ZETIC.MLange is a fully automated deployment framework for On-Device AI, built for AI engineers who want to focus on models — not infrastructure. Here’s what it does in minutes: - 📊 Analyzes model structure - ⚙️ Converts to mobile-optimized format (e.g., GGUF, ONNX) - 📦 Generates a runnable runtime environment with pre/post-processing - 📱 Targets real mobile hardware (CPU, GPU, NPU — including Qualcomm, MediaTek, Apple) - 🎯 Gives you a downloadable SDK or mobile app component — ready to run And yes — this is available now, for free, at https://mlange.zetic.ai 🧪 For AI Engineers Like You, If you want to: - Test LLMs directly on-device - Run models offline with no latency - Avoid cloud GPU costs - Deploy to mobile without writing app-side inference code Then this is your moment. You can do exactly what we did, using your own models — all in a few clicks. 🎯 Start here → https://mlange.zetic.ai 📬 Want to try Polaris-4B on your own app? contact@zetic.ai, or just visit https://mlange.zetic.ai , it is opened as free! Great work @Chancy, @Zhihui , @tobiaslee !

View all activity

Organizations

updated a Space about 1 month ago

README

🚀

published a Space about 1 month ago

README

🚀

reacted to their post with 😎🤗👍🤝🧠❤️👀🚀🔥 3 months ago

Post

3199

🚀 Real-Time On-Device AI Agent with Polaris-4B — Run It Yourself, No Cloud, No Cost

We just deployed a real-time on-device AI agent using the Polaris-4B-Preview model — one of the top-performing <6B open LLMs on Hugging Face.

📱 What’s remarkable?
This model runs entirely on a mobile device, without cloud, and without any manual optimization. It was built using ZETIC.MLange, and the best part?

➡️ It’s totally automated, free to use, and anyone can do it.
You don’t need to write deployment code, tweak backends, or touch device-specific SDKs. Just upload your model — and ZETIC.MLange handles the rest.

🧠 About the Model
- Model: Polaris-4B-Preview
- Size: ~4B parameters
- Ranking: Top 3 on Hugging Face LLM Leaderboard (<6B)
- Tokenizer: Token-incremental inference supported
- Modifications: None — stock weights, just optimized for mobile

⚙️ What ZETIC.MLange Does
ZETIC.MLange is a fully automated deployment framework for On-Device AI, built for AI engineers who want to focus on models — not infrastructure.

Here’s what it does in minutes:
- 📊 Analyzes model structure
- ⚙️ Converts to mobile-optimized format (e.g., GGUF, ONNX)
- 📦 Generates a runnable runtime environment with pre/post-processing
- 📱 Targets real mobile hardware (CPU, GPU, NPU — including Qualcomm, MediaTek, Apple)
- 🎯 Gives you a downloadable SDK or mobile app component — ready to run
And yes — this is available now, for free, at https://mlange.zetic.ai

🧪 For AI Engineers Like You, If you want to:
- Test LLMs directly on-device
- Run models offline with no latency
- Avoid cloud GPU costs
- Deploy to mobile without writing app-side inference code

Then this is your moment. You can do exactly what we did, using your own models — all in a few clicks.

🎯 Start here → https://mlange.zetic.ai

📬 Want to try Polaris-4B on your own app? contact@zetic.ai, or just visit https://mlange.zetic.ai , it is opened as free!

Great work @Chancy , @Zhihui , @tobiaslee !

posted an update 3 months ago

Post

3199

🚀 Real-Time On-Device AI Agent with Polaris-4B — Run It Yourself, No Cloud, No Cost

We just deployed a real-time on-device AI agent using the Polaris-4B-Preview model — one of the top-performing <6B open LLMs on Hugging Face.

📱 What’s remarkable?
This model runs entirely on a mobile device, without cloud, and without any manual optimization. It was built using ZETIC.MLange, and the best part?

➡️ It’s totally automated, free to use, and anyone can do it.
You don’t need to write deployment code, tweak backends, or touch device-specific SDKs. Just upload your model — and ZETIC.MLange handles the rest.

🧠 About the Model
- Model: Polaris-4B-Preview
- Size: ~4B parameters
- Ranking: Top 3 on Hugging Face LLM Leaderboard (<6B)
- Tokenizer: Token-incremental inference supported
- Modifications: None — stock weights, just optimized for mobile

⚙️ What ZETIC.MLange Does
ZETIC.MLange is a fully automated deployment framework for On-Device AI, built for AI engineers who want to focus on models — not infrastructure.

Here’s what it does in minutes:
- 📊 Analyzes model structure
- ⚙️ Converts to mobile-optimized format (e.g., GGUF, ONNX)
- 📦 Generates a runnable runtime environment with pre/post-processing
- 📱 Targets real mobile hardware (CPU, GPU, NPU — including Qualcomm, MediaTek, Apple)
- 🎯 Gives you a downloadable SDK or mobile app component — ready to run
And yes — this is available now, for free, at https://mlange.zetic.ai

🧪 For AI Engineers Like You, If you want to:
- Test LLMs directly on-device
- Run models offline with no latency
- Avoid cloud GPU costs
- Deploy to mobile without writing app-side inference code

Then this is your moment. You can do exactly what we did, using your own models — all in a few clicks.

🎯 Start here → https://mlange.zetic.ai

📬 Want to try Polaris-4B on your own app? contact@zetic.ai, or just visit https://mlange.zetic.ai , it is opened as free!

Great work @Chancy , @Zhihui , @tobiaslee !

reacted to cfahlgren1's post with 👍 3 months ago

Post

512

I ran the Anthropic Misalignment Framework for a few top models and added it to a dataset: cfahlgren1/anthropic-agentic-misalignment-results

You can read the reasoning traces of the models trying to blackmail the user and perform other actions. It's very interesting!!

reacted to merve's post with 👀 3 months ago

Post

668

we've merged LightGlue keypoint matcher to Hugging Face transformers! it allows commercial use when paired with an open-source keypoint detector 🙏🏻

it works very well, try it yourself: ETH-CVG/LightGlue

here's an in-the-wild test with two images of the same place ⤵️

1 reply

reacted to albertvillanova's post with 🚀 3 months ago

Post

1721

🚀 SmolAgents v1.19.0 is live!
This release brings major improvements to agent flexibility, UI usability, streaming architecture, and developer experience: making it easier than ever to build smart, interactive AI agents. Here's what's new:

🔧 Agent Upgrades
- Support for managed agents in ToolCallingAgent
- Context manager support for cleaner agent lifecycle handling
- Output formatting now uses XML tags for consistency

🖥️ UI Enhancements
- GradioUI now supports reset_agent_memory: perfect for fresh starts in dev & demos.

🔄 Streaming Refactor
- Streaming event aggregation moved off the Model class
- ➡️ Better architecture & maintainability

📦 Output Tracking
- CodeAgent outputs are now stored in ActionStep
- ✅ More visibility and structure to agent decisions

🐛 Bug Fixes
- Smarter planning logic
- Cleaner Docker logs
- Better prompt formatting for additional_args
- Safer internal functions and final answer matching

📚 Docs Improvements
- Added quickstart examples with tool usage
- One-click Colab launch buttons
- Expanded reference docs (AgentMemory, GradioUI docstrings)
- Fixed broken links and migrated to .md format

🔗 Full release notes:
https://github.com/huggingface/smolagents/releases/tag/v1.19.0

💬 Try it out, explore the new features, and let us know what you build!

#smolagents #opensource #AIagents #LLM #HuggingFace

reacted to cgeorgiaw's post with 🚀 3 months ago

Post

2706

Huge new bio datasets just dropped!!!

Check out them out @

ginkgo-datapoints
Read the blog for more info: https://huggingface.co/blog/cgeorgiaw/gdp

1 reply

reacted to pagezyhf's post with 🔥 3 months ago

Post

3233

Hackathons in Paris on July 5th and 6th!

Hugging Face just wrapped 4 months of deep work with AMD to push kernel-level optimization on their MI300X GPUs. Now, it's time to share everything we learned.

Join us in Paris at STATION F for a hands-on weekend of workshops and a hackathon focused on making open-source LLMs faster and more efficient on AMD.

Prizes, amazing host speakers, ... if you want more details, navigate to https://lu.ma/fmvdjmur!

2 replies

reacted to their post with 🤝👍🧠 3 months ago

Post

3304

Hi everyone,

I’ve been running small language models (SLLMs) directly on smartphones — completely offline, with no cloud backend or server API calls.

I wanted to share:
1. ⚡ Tokens/sec performance across several SLLMs
2. 🤖 Observations on hardware utilization (where the workload actually runs)
3. 📏 Trade-offs between model size, latency, and feasibility for mobile apps

There are reports for below models
- QWEN3 0.6B
- NVIDIA/Nemotron QWEN 1.5B
- SimpleScaling S1
- TinyLlama
- Unsloth tuned Llama 3.2 1B
- Naver HyperClova 0.5B

📜Comparable Benchmark reports (no cloud, all on-device):
I’d really value your thoughts on:
- Creative ideas to further optimize inference under these hardware constraints
- Other compact LLMs worth testing on-device
- Experiences you’ve had trying to deploy LLMs at the edge

If there’s interest, I’m happy to share more details on the test setup, hardware specs, or the tooling we used for these comparisons.

Thanks for taking a look, and you can build your own through at "https://mlange.zetic.ai"!

Yeonseok Kim PRO

AI & ML interests

Recent Activity

Organizations

yeonseok-zeticai's activity

README

README