Show HN: DistilKitPlus, a distillation framework between any LLMs

https://github.com/agokrani/distillKitPlus

8•ayushnangia16•2h ago

Over the past few months, I have built a distillation toolkit that supports cross-tokenizer distillation (e.g., distilling from LLaMA to Qwen vocab, or others). This approach has worked well on reasoning datasets like AIME, and we’ve validated on models like Phi and Qwen.

We’ve also integrated Modal for quick deployment (with $30/month credits to try it out).

Would love any feedback!

GitHub: https://github.com/agokrani/distillKitPlus

Docs: https://distillkitplus.mintlify.app/

Comments

shikharM07•2h ago

this is kinda interesting but I'm curious what is the smallest model size that I can distill without compromising the accuracy?

agokrani•1h ago

We can distill 14B model to 4B model with performance improvements on AIME24 and GSM8K. We will share our results with a detailed blog post later.

vijit-singh•7m ago

this is very cool. will try it out.

Boston Homeowners Have $1B Tax Bill Looming over Them

Understanding effective type Aliasing in C [pdf]

Still Speaking to the Nations: The Importance of LW in the 21st Century

Zombieverter: Open source VCU for reusing salvage EV components

Everything Is the 'Twitter Files' Now

Three lessons from safety stock theory

Seductive Language for Narcissists in Job Postings

New Self-Healing Polymer Possesses a Quality Never Before Seen

Markdown Babel: Universal Tool to Make Markdown Files Executable

At Age 76, Jeannie Rice Keeps Running Fast. She Has an Incredibly High V02 Max

All-Cause Mortality and Life Expectancy by Birth Cohort Across US States

Overengineering PR Create with Jj

Terminal Colors

Warren Buffett Took the Long View

The future of web development is AI. Get on or get left behind

The Story of Onestop.mid

Thirty-five Democrats joined the GOP to repeal California's EV mandate

Moving from Skype to Microsoft Teams

Show HN: Now sqlize.online support the last SQLite version (3.49.1)

What do you know about language games and some real life examples you've encount

OpenAI Scuttles For-Profit Transformation

Prompt Engineering Interactive Tutorial

SolarWinds Security Chief Tim Brown Hopes the SEC Will Dismiss Charges

Why O3 Is the Best Model yet for Real-World Learning

Is Amazon Censoring 2010's Robin Hood in the United States?

Study links fossilized flying reptile tracks to animals that made them

Uganda declares end of Ebola outbreak

113M-Year-Old 'Hell Ant' Discovery Is Oldest Ever Found

Meshlets and Mesh Shaders

Anthropic's AI for Science Program