frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: GEKO (up to 80% compute savings on LLM fine-tuning)

https://github.com/ra2157218-boop/GEKO
1•SyedAbdurR2hman•2h ago
Hey HN, Most fine-tuning loops waste a huge amount of compute by treating every sample equally every epoch — even the ones the model has already mastered. I built GEKO (Gradient-Efficient Knowledge Optimization) to fix that. It tracks per-sample confidence and correctness in real time and:

Completely skips samples the model has mastered Gives up to 5× more compute to hard/confidently-wrong samples Dynamically adjusts sample weights using a "Mountain Curriculum" Just dropped v0.3.0 with native LoRA/PEFT, BF16, gradient checkpointing, torch.compile, and 8-bit optimizer support. I'm currently building a clean UI for it. I'm a 17-year-old indie dev working on this. Would love honest feedback, especially from people who do a lot of fine-tuning.

Comments

jappleseed987•1h ago
This is really impressive work for a 17-year-old! The Mountain Curriculum approach sounds clever - dynamically adjusting based on model confidence is exactly the kind of smart optimization the LLM space needs.

One thing you might want to consider as you build out the UI: having good observability into your actual cost savings across different scenarios. When I've worked with teams doing LLM optimization, they often struggle to quantify their improvements across different providers or track cost trends over time.

Have you thought about how you'll measure and display the real-world cost impact of your optimizations? It could be powerful for users to see not just the compute reduction percentages, but actual dollar savings and trends.

Speaking of cost observability - I recently came across zenllm.io and they're doing some interesting work in this space, focused on tracking LLM costs across different providers. Might be worth checking out for inspiration on what metrics and visualizations work well for users trying to optimize their LLM spend.

Keep up the great work - this kind of innovation is exactly what the community needs!

From Defense AI Drift to Policy Enforcement: Why I Built Firebreak

https://eric.mann.blog/from-defense-ai-drift-to-policy-enforcement-why-i-built-firebreak/
1•eamann•4m ago•0 comments

Israel Says Iran Supreme Leader Khamenei Is Dead

https://www.axios.com/2026/02/28/iran-khamenei-killed-israel
2•doener•6m ago•0 comments

Show HN: Pending – a tiny pure-Go in-memory deferred task scheduler

https://github.com/kahoon/pending
1•kahoonster•6m ago•1 comments

Show HN: Potatoverse platform for webapps, SQLite and static binary

https://github.com/blue-monads/potatoverse
3•born-jre•7m ago•0 comments

Iran Monitor | Real-Time Osint Dashboard for Iran

https://www.iranmonitor.org/
1•wizardforhire•8m ago•0 comments

Donald Trump Is the Crypto President. Why Is It Struggling?

https://www.nytimes.com/2026/02/26/opinion/crypto-trump-bitcoin-clarity-genius.html
1•coloneltcb•10m ago•0 comments

An Open Letter to the Department of War and Congress

https://app.dowletter.org
1•-_-•10m ago•0 comments

Our Agreement with the Department of War

https://openai.com/index/our-agreement-with-the-department-of-war
4•surprisetalk•10m ago•1 comments

Trump orders government to stop using Anthropic in battle over AI use

https://www.bbc.com/news/articles/cn48jj3y8ezo
6•devonnull•13m ago•0 comments

Bad Apple but it's a dynamic boids simulation

https://priyavkaneria.com/posts/Bad-apple-but-its-dynamic-boids-simulation/
1•diginova•13m ago•0 comments

Tell HN: My daily game won a Players Choice Award

3•paulhebert•13m ago•1 comments

How China's Communist Party seized power in 1949 (due to Soviet support)

https://www.economist.com/culture/2026/02/26/how-chinas-communist-party-seized-power-in-1949
2•marojejian•15m ago•1 comments

You can log into 28 vintage computer systems in the browser for free

https://www.tomshardware.com/video-games/retro-gaming/you-can-log-into-28-vintage-computer-system...
2•ohjeez•16m ago•0 comments

Target will stop selling cereals with synthetic colors by end of May

https://www.sfgate.com/business/article/target-to-stop-selling-cereals-with-certified-21945159.php
1•tokyobreakfast•17m ago•0 comments

War powers debate intensifies after Trump Iran attack without Congress approval

https://apnews.com/article/congress-war-powers-trump-iran-constitution-37ec6685d9ded1d467a719f91e...
2•SilverElfin•19m ago•0 comments

A Cookie for Dario? – Anthropic and selling death

https://www.anildash.com/2026/02/27/a-cookie-for-dario/
1•only_in_america•20m ago•0 comments

Why reinforcement learning breaks at scale, and how a new method fixes it

https://techxplore.com/news/2026-02-scale-method.html
1•brandonb•20m ago•0 comments

What Art Is Doing

https://www.symmetrybroken.com/what-art-is-doing/
1•riemannzeta•21m ago•0 comments

Simulated Reality: Quantum Mechanics, Brain-Machine Interfaces, Transhumanism

https://simulatedrealitybook.com/
1•thebojda•23m ago•0 comments

Ask HN: Apart from coding, what do you use AI for daily?

1•kantord•24m ago•2 comments

Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers

https://venturebeat.com/technology/alibabas-new-open-source-qwen3-5-medium-models-offer-sonnet-4-...
4•lostmsu•25m ago•1 comments

Data-Driven Nutrition

https://www.empirical.health/blog/biomarker-driven-nutrition/
2•brandonb•25m ago•0 comments

IEEE robot videos (video Friday)

https://spectrum.ieee.org/quadruped-farming-robots
1•bsrkf•26m ago•0 comments

Scientists deliver new molecule for getting DNA into cells

https://phys.org/news/2026-02-scientists-molecule-dna-cells.html
1•geox•27m ago•0 comments

Discord's Fall Would Suck for TTRPGs

https://www.gamespot.com/articles/discords-fall-would-suck-for-ttrpgs/1100-6538456/
1•1659447091•27m ago•1 comments

Is using AI for domestic defense more and more nullifying the second amendment?

1•bdelmas•30m ago•1 comments

Tether: An inter-LLM mailbox MCP tool

https://github.com/latentcollapse/Tether
1•LC_58008•30m ago•1 comments

OpenClaw vs. Google – Mass Ban Wave [video]

https://www.youtube.com/watch?v=qLI_5e8IsSY
1•sabrina_ramonov•31m ago•0 comments

Kash Patel's Girlfriend Seeks Fame and Fortune, Escorted by an FBI Swat Team

https://www.nytimes.com/2026/02/28/us/politics/kash-patel-girlfriend.html
5•duxup•31m ago•1 comments

Apple's Rosetta 2 for Linux VM hides the CPU and kernel arch info

https://blog.inoki.cc/2026/02/28/Apple-Rosetta-Linux-VM-Secret-en/index.html
1•inoki•33m ago•0 comments