Show HN: GEKO (up to 80% compute savings on LLM fine-tuning)

https://github.com/ra2157218-boop/GEKO

1•SyedAbdurR2hman•2h ago

Hey HN, Most fine-tuning loops waste a huge amount of compute by treating every sample equally every epoch — even the ones the model has already mastered. I built GEKO (Gradient-Efficient Knowledge Optimization) to fix that. It tracks per-sample confidence and correctness in real time and:

Completely skips samples the model has mastered Gives up to 5× more compute to hard/confidently-wrong samples Dynamically adjusts sample weights using a "Mountain Curriculum" Just dropped v0.3.0 with native LoRA/PEFT, BF16, gradient checkpointing, torch.compile, and 8-bit optimizer support. I'm currently building a clean UI for it. I'm a 17-year-old indie dev working on this. Would love honest feedback, especially from people who do a lot of fine-tuning.

Comments

jappleseed987•1h ago

This is really impressive work for a 17-year-old! The Mountain Curriculum approach sounds clever - dynamically adjusting based on model confidence is exactly the kind of smart optimization the LLM space needs.

One thing you might want to consider as you build out the UI: having good observability into your actual cost savings across different scenarios. When I've worked with teams doing LLM optimization, they often struggle to quantify their improvements across different providers or track cost trends over time.

Have you thought about how you'll measure and display the real-world cost impact of your optimizations? It could be powerful for users to see not just the compute reduction percentages, but actual dollar savings and trends.

Speaking of cost observability - I recently came across zenllm.io and they're doing some interesting work in this space, focused on tracking LLM costs across different providers. Might be worth checking out for inspiration on what metrics and visualizations work well for users trying to optimize their LLM spend.

Keep up the great work - this kind of innovation is exactly what the community needs!

From Defense AI Drift to Policy Enforcement: Why I Built Firebreak

Israel Says Iran Supreme Leader Khamenei Is Dead

Show HN: Pending – a tiny pure-Go in-memory deferred task scheduler

Show HN: Potatoverse platform for webapps, SQLite and static binary

Iran Monitor | Real-Time Osint Dashboard for Iran

Donald Trump Is the Crypto President. Why Is It Struggling?

An Open Letter to the Department of War and Congress

Our Agreement with the Department of War

Trump orders government to stop using Anthropic in battle over AI use

Bad Apple but it's a dynamic boids simulation

Tell HN: My daily game won a Players Choice Award

How China's Communist Party seized power in 1949 (due to Soviet support)

You can log into 28 vintage computer systems in the browser for free

Target will stop selling cereals with synthetic colors by end of May

War powers debate intensifies after Trump Iran attack without Congress approval

A Cookie for Dario? – Anthropic and selling death

Why reinforcement learning breaks at scale, and how a new method fixes it

What Art Is Doing

Simulated Reality: Quantum Mechanics, Brain-Machine Interfaces, Transhumanism

Ask HN: Apart from coding, what do you use AI for daily?

Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers

Data-Driven Nutrition

IEEE robot videos (video Friday)

Scientists deliver new molecule for getting DNA into cells

Discord's Fall Would Suck for TTRPGs

Is using AI for domestic defense more and more nullifying the second amendment?

Tether: An inter-LLM mailbox MCP tool

OpenClaw vs. Google – Mass Ban Wave [video]

Kash Patel's Girlfriend Seeks Fame and Fortune, Escorted by an FBI Swat Team

Apple's Rosetta 2 for Linux VM hides the CPU and kernel arch info