frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Loft CLI – Fine-tune and run LLMs (1–3B) on 8 GB MacBook Air, no GPUs

1•dips2umar•5h ago
GitHub: https://github.com/diptanshu1991/LoFT

I built *LoFT*, a lightweight CLI that turns any 8 GB laptop into a tiny LLM training and inference rig — no GPU, no cloud.

5 Commands: 1. `loft finetune` → Train LoRA adapters on CPU 2. `loft merge` → Merge adapters into model 3. `loft export` → Convert to GGUF (FP16) 4. `loft quantize` → Apply Q4_0 (4-bit) quantization 5. `loft chat` → llama.cpp CPU chat @ ~7 tok/s

Benchmarks on 8 GB MacBook Air: | Step | Time | Peak RAM | |-------------|--------|----------| | Finetune | 23 min (sample run) | 308 MB | | Merge | 4.7 min | 322 MB | | Quantize | 21 sec | 322 MB | | Inference | 6.9 tok/s | 322 MB |

Also ran a full 300-row Dolly finetune (2 epochs) in *~1.5 hours*, achieving *sub-1 loss* on CPU-only setup. No crashes, swap kills, or GPU needed.

Why this matters: - Makes local LLM customization accessible to devs without GPU access - Enables domain-specific agents (summarizer, support bot, Q&A) on commodity laptops - Everything runs via CPU (no CUDA, no cloud)

Would love feedback on: - UX improvements or edge cases - Adapter recipes you’d want (legal, summarization, customer support, etc.) - Cool things you’d build with low-RAM LLMs

MIT-licensed, 100% local. Feedback is very welcome.

– Diptanshu \

Comments

dips2umar•4h ago
Author here — happy to answer any questions!

One thing that surprised us: on an 8 GB M2 Air, peak RAM never exceeded 330 MB during a full 300-sample finetune (2 epochs) — thanks to gradient checkpointing, which reduces memory usage by recomputing activations instead of storing them.

If anyone tries LoFT on Windows or Linux, I’d love to hear your first-token latency with `loft chat`. On macOS we see ~145 ms/token with TinyLlama + GGUF.

A Founder's 20-Year Journey Through Compound Stupidity

https://www.valleyofdoubt.com/p/a-founders-20-year-journey-through
1•shandsaker_au•38s ago•0 comments

PBS Passport Streaming Service

https://help.pbs.org/support/solutions/articles/5000692392-what-is-pbs-passport-
1•rez10191•3m ago•0 comments

Experimenting with Flox's new build and publish

https://thefridaydeploy.substack.com/p/experimenting-with-floxs-new-build
1•tomberek•6m ago•1 comments

Elizabeth Holmes's Partner Has a New Blood-Testing Startup

https://www.nytimes.com/2025/05/10/business/elizabeth-holmes-partner-blood-testing-startup.html
2•wslh•10m ago•3 comments

Investment Promised a 25% Yield, Then Collapsed 98% (2020)

https://www.nasdaq.com/articles/this-cant-miss-investment-promised-a-25-yield-then-collapsed-98-2020-06-22
2•paulpauper•13m ago•0 comments

AI Is the Answer to Everything

https://banagale.com/ai.htm
3•bredren•20m ago•1 comments

How China Became the World's Biggest Shipbuilder

https://www.construction-physics.com/p/how-china-became-the-worlds-biggest
3•throw0101b•24m ago•0 comments

Why It Feels Like Every Company Suddenly Wants to Sell You Protein [video]

https://www.youtube.com/watch?v=7XcU3cN-EeE
2•mgh2•32m ago•1 comments

OpenAI Ignored IMO Request, Announced Math Results Before Closing Ceremony

https://twitter.com/mihonarium/status/1946880931723194389
8•py4•32m ago•0 comments

Thawing vacuum-packed fish correctly

https://www.canr.msu.edu/news/open_your_vacuum_packed_fish_before_thawing
2•js2•38m ago•1 comments

Rethinking "Progress": A Hard Look at Sustainability

1•upwardbound2•38m ago•0 comments

WordPecker: Open-Source Personalized Duolingo

4•arbayi•38m ago•0 comments

The Physical Turing Test: Jim Fan on Nvidia's Roadmap for Embodied AI

https://www.youtube.com/watch?v=_2NijXqBESI
2•mgh2•40m ago•0 comments

Crayola CEO's how-to-succeed guide: Lose the tie pretend you don't know anything

https://www.aol.com/crayola-ceo-succeed-guide-hires-124500427.html
2•Bluestein•41m ago•0 comments

Assessing interstellar comet 3I/ATLAS with the 10.4M Gran Telescopio Canarias

https://arxiv.org/abs/2507.12922
2•bikenaga•43m ago•0 comments

AI Coding Tools Underperform in Field Study with Experienced Developers

https://www.infoq.com/news/2025/07/ai-productivity/
6•mikece•45m ago•0 comments

IPv6 Based Canvas

https://canvas.openbased.org/
3•tylermarques•46m ago•0 comments

Think Toggles Are Dumb

https://www.paritybits.me/think-toggles-are-dumb/
3•LorenDB•53m ago•0 comments

Trump threatens stadium deal unless NFL team readopts Redskins name

https://www.reuters.com/sports/trump-threatens-washington-stadium-deal-unless-nfl-team-readopts-redskins-name-2025-07-20/
11•geox•59m ago•4 comments

U.S.-Based Wells Fargo Banker Blocked from Leaving China

https://www.wsj.com/world/china/wells-fargo-banker-china-89824413
3•impish9208•59m ago•1 comments

Delta Air Lines is using AI to set the maximum price you're willing to pay

https://www.theverge.com/news/709556/delta-air-lines-ai-ticket-price-rollout
9•pseudolus•1h ago•1 comments

How Distillation Makes AI Models Smaller and Cheaper

https://www.quantamagazine.org/how-distillation-makes-ai-models-smaller-and-cheaper-20250718/
3•pseudolus•1h ago•0 comments

'Flutter': The song that saved raves from a government ban

https://faroutmagazine.co.uk/flutter-the-song-that-saved-raves-from-a-government-ban/
3•joelanman•1h ago•0 comments

Cuban Experiences on Computing and Education

https://link.springer.com/content/pdf/10.1007/978-0-387-09657-5_4
2•marcodiego•1h ago•0 comments

The Last SS Guard

https://www.zeit.de/gesellschaft/2025-06/concentration-camp-guard-gregor-formanek-ss-national-socialism-sachsenhausen-court-trial-english
2•slow_typist•1h ago•0 comments

Nvidia Bringing CUDA to RISC-V

https://www.phoronix.com/news/NVIDIA-CUDA-Coming-To-RISC-V
2•michaelkrem•1h ago•0 comments

Mathematical Foundations for Finance

https://metaphor.ethz.ch/x/2021/hs/401-3913-01L/
2•ibobev•1h ago•0 comments

Cloudflare Learning Center

https://www.cloudflare.com/en-ca/learning/
3•vaughands•1h ago•0 comments

Global hack on Microsoft product hits U.S., state agencies, researchers say

https://www.washingtonpost.com/technology/2025/07/20/microsoft-sharepoint-hack/
5•spenvo•1h ago•1 comments

Longevity Expert Breaks Down the Science and Hype of Biological Aging Tests

https://www.scientificamerican.com/article/what-new-biological-age-clocks-say-about-longevity-according-to-eric-topol/
3•Bluestein•1h ago•0 comments