frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: OfflineLLM: Live Voice Chat with DeepSeek, Llama on iOS and VisionOS

https://offlinellm.bilaal.co.uk/
4•bilaal_dc5631•1d ago
Hi, this is something I've been working on for the past 18 months. There are an abundance of tools to run LLMs locally on desktops (e.g. ollama, LM Studio), but other devices have been left out. This is has been a project to run these models onto iOS and visionOS, which has turned out to work really well. Even an iPhone 14 Pro can quite easily run the 3B parameter version of Llama 3.2. CLIP models also work well too!

It also has a Live Voice Chat which gives a 2-way conversation experience, functionality similar to the cloud-based Gemini Live feature that Google offers.

Under the hood it can run most GGUF models, using a heavily forked and diverged verison of llama.cpp which has helped performance on the mobile devices.

The next steps are to integrate Apple's on device 3B model which hopefully they will be opening up access to at WWDC in a week's time. I'm also in the middle of adding in support for Gemma 3 and Qwen 3.

Let me know what you think!

Comments

35jelly35•1d ago
> Even an iPhone 14 Pro can quite easily run the 3B parameter version of Llama 3.2

Wow. I never thought a non-Apple Intelligence phone would be able to run this. Does the phone get hot at all?

Also, how long did it take you to build this and how easy is it to test this in Xcode?

bilaal_dc5631•1d ago
Thanks for the questions.

> Does the phone get hot at all?

It's pretty reasonable and similar to the heat you'll get when playing an intensive game. If you're sensible it's pretty usable.

> how long did it take you to build this

I first started in 2023 and managed to get an MVP out the same year. That was pretty basic and a lot of work has been done since. I don't have an accurate measure of how much time has been spent, but it's had a lot of my attention since I released the first MVP.

> how easy is it to test this in Xcode?

This is pretty nice actually. It runs absolutely fine in the simulator, which is where I do most of my testing. The only time I have to move to a physical device is for performance testing, which isn't a huge drain on productivity.

Show HN: I made a browser extension to view local times for standard timezones

https://chromewebstore.google.com/detail/staying-global-time/blpmhhkibokmnhodmlkcakngaipbaiio
1•lezhu•5m ago•0 comments

OpenDNS systematically blocking legitimate businesses with broken appeal process

1•capodieci•5m ago•0 comments

Plutonium Mountain: The 17-year mission to guard remains of Soviet nuclear tests

https://www.belfercenter.org/publication/plutonium-mountain-inside-17-year-mission-secure-legacy-soviet-nuclear-testing
2•jmillikin•11m ago•0 comments

There should be no Computer Art (1971)

https://dam.org/museum/essays_ui/essays/there-should-be-no-computer-art/
1•glimshe•12m ago•0 comments

Allegation: Indian programmers were behind AI chatbot

https://www.heise.de/en/news/Allegation-700-Indian-programmers-were-actually-behind-the-AI-chatbot-10422929.html
2•doener•14m ago•0 comments

The EU's "Encryption Roadmap" Makes Everyone Less Safe

https://www.eff.org/deeplinks/2025/06/eus-encryption-roadmap-makes-everyone-less-safe
3•nickslaughter02•15m ago•0 comments

Forecasting: Principles and Practice, the Pythonic Way

https://otexts.com/fpppy/
1•sebg•19m ago•0 comments

The Shape of the Essay Field

https://paulgraham.com/field.html
7•luisb•21m ago•0 comments

Wendelstein 7-X sets new fusion record

https://www.heise.de/en/news/Wendelstein-7-X-sets-new-fusion-record-10422955.html
2•doener•25m ago•0 comments

Using jemalloc to get to the bottom of a memory leak (2015)

https://technology.blog.gov.uk/2015/12/11/using-jemalloc-to-get-to-the-bottom-of-a-memory-leak/
1•mattrighetti•25m ago•0 comments

(Preprint) Proof of Existence and Mass Gap for SU(3) Yang-Mills in 4D Space-Time

https://arxiv.org/abs/2506.00284
2•andrepd•28m ago•1 comments

A teen died after being blackmailed with A.I.-generated nudes

https://www.cbsnews.com/news/sextortion-generative-ai-scam-elijah-heacock-take-it-down-act/
2•doener•32m ago•0 comments

EU Commission refuses to disclose authors behind its mass surveillance proposal

https://old.reddit.com/r/europe/comments/1l2655n/the_eu_commission_refuses_to_disclose_the/
4•nickslaughter02•35m ago•3 comments

Ask HN: What's Your Spirituality?

1•keepamovin•35m ago•0 comments

AI hype fuels pay rise – but only if you're in the right gig

https://www.theregister.com/2025/06/03/ai_productivity_pwc/
1•rntn•36m ago•0 comments

Vatican Library manuscripts to be restored and digitized

https://angelusnews.com/news/vatican/vatican-library-restored-digitized/
1•CoBE10•37m ago•0 comments

SwiftUI in 2025: Forget MVVM

https://dimillian.medium.com/swiftui-in-2025-forget-mvvm-262ff2bbd2ed
2•frizlab•40m ago•0 comments

Analyst Suggests Apple Might Be Considering Buying Unity After Legal Defeat

https://80.lv/articles/analyst-suggests-apple-might-be-considering-buying-unity-after-legal-defeat-to-epic-games
1•chrisjj•43m ago•1 comments

A Fun Way to Learn Programming Using Python Turtle Graphics

https://25scripts.com/tutorial/introduction-to-python-turtle-graphics-a-fun-way-to-learn-programming/
1•cewblog•45m ago•0 comments

Show HN: SocialHQ – An AI Ghostwriter for LinkedIn for Founders

https://socialhq.me/
2•amanchanda•48m ago•7 comments

CVE-2025-4143

https://nvd.nist.gov/vuln/detail/cve-2025-4143
1•terabytest•49m ago•0 comments

Show HN: Offloading GPU Workloads from Kubernetes to RunPod via Virtual Kubelet

https://github.com/BSVogler/k8s-runpod-kubelet
1•BSVogler•49m ago•0 comments

My First Month with Math Academy

https://lmsherlock.substack.com/p/my-first-month-with-math-academy
1•jarrett-ye•49m ago•1 comments

AI didn't kill Stack Overflow

https://www.infoworld.com/article/3993482/ai-didnt-kill-stack-overflow.html
2•fifticon•51m ago•1 comments

MIT Announces the Initiative for New Manufacturing

https://news.mit.edu/2025/mit-announces-initiative-for-new-manufacturing-0527
2•rbanffy•58m ago•0 comments

Show HN: gsum – Incremental Checksums on 20 Algos, 8 OSes – Vibe Coded

https://github.com/guilt/gsum
1•vkaku•1h ago•0 comments

Apex announces Comet satellite bus for constellations – SpaceNews

https://spacenews.com/apex-announces-comet-satellite-bus-for-constellations/
1•rbanffy•1h ago•0 comments

U.S. sanctions may be inadvertently accelerating China's semiconductor ambitions

https://www.tomshardware.com/tech-industry/semiconductors/instead-of-crippling-chinas-semiconductor-ambitions-u-s-sanctions-may-be-inadvertently-accelerating-them-report-claims-washington-measures-could-be-bolstering-chinas-chip-market
4•rbanffy•1h ago•1 comments

'Crazy' data rules hit German plans to boost army reserve

https://www.ft.com/content/db0d9cc0-8d63-4107-ad62-3452fcd181ae
1•thm•1h ago•0 comments

Bristol Myers makes $11B deal with BioNTech to join the cancer-drug race

https://www.marketwatch.com/story/bristol-myers-makes-11-billion-deal-with-biontech-to-join-the-cancer-drug-race-5399a68a?mod=mw_latestnews
1•doener•1h ago•0 comments