frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

New tech promises cheaper, brighter, environmentally friendlier display screens

https://www.eurekalert.org/news-releases/1083484
1•lentoutcry•1m ago•0 comments

MacBook Touch Bar Persists Flickering When It Is Idle

https://iboysoft.com/howto/macbook-pro-touch-bar-flickering.html
1•Nancy-ooo•3m ago•1 comments

How Amazon, Google, Microsoft, and Meta are reshaping performance management

https://www.businessinsider.com/carrots-sticks-google-microsoft-meta-reshape-performance-management-2025-4
1•gmargari•11m ago•0 comments

Firefox Moves to GitHub

https://github.com/mozilla-firefox/firefox
2•thefilmore•13m ago•0 comments

Latest Zig from master can cross-compile to FreeBSD 14

https://ziggit.dev/t/latest-zig-from-master-can-cross-compile-to-freebsd-14/10081
1•rguiscard•16m ago•0 comments

Weird iOS Bug in Messages App

https://rambo.codes/posts/2025-05-12-cracking-the-dave-and-busters-anomaly
3•wolfi1•21m ago•0 comments

High-powered solar cells aim to replace batteries in low-power electronics

https://techxplore.com/news/2025-04-high-powered-solar-cells-aim.html
1•PaulHoule•23m ago•0 comments

Travel magazine's fake destination [video]

https://www.youtube.com/watch?v=PCUrNHMuzSw
1•neom•27m ago•0 comments

App Store for Hosted Agents?

2•jakejmedeiros•28m ago•0 comments

My Sheety Website

https://sheety.website/
1•behnamoh•31m ago•0 comments

OpenAI/GPT-2

https://github.com/openai/gpt-2/blob/master/src/model.py
2•olalonde•35m ago•0 comments

Ask HN: Is Apple vmm not limited to few VMs per machine?

https://www.docker.com/blog/docker-desktop-for-mac-qemu-virtualization-option-to-be-deprecated-in-90-days/
1•fefferkorn•36m ago•1 comments

What you want to know (but dare to ask) about Conjunctive Query Processing

https://harp-lab.com/2025/04/12/wcoj.html
1•matt_d•49m ago•0 comments

Hardware testing automation: a status update

https://postmarketos.org/blog/2025/05/13/hw-ci-status/
1•yorwba•57m ago•0 comments

The Problem with Washout Periods

https://www.exfatloss.com/p/the-problem-with-washout-periods
1•paulpauper•57m ago•0 comments

India-Pakistan Risks Loom

https://www.bloomberg.com/news/features/2025-05-11/trump-negotiated-india-pakistan-ceasefire-adds-new-risks-to-kashmir-conflict
1•colonCapitalDee•59m ago•2 comments

U.S. EPA takes aim at start-stop systems in cars

https://www.wral.com/news/local/epa-targets-start-stop-systems-may-2025/
3•walterbell•59m ago•0 comments

A Live Look at the Senate AI Hearing

https://thezvi.substack.com/p/a-live-look-at-the-senate-ai-hearing
1•paulpauper•1h ago•0 comments

Show HN: Ad Sniper. a simple distraction remover for Firefox

https://github.com/cab11150904/AdSniper
1•WarcrimeActual•1h ago•0 comments

The Case for the Death Penalty

https://unherd.com/2025/05/the-case-for-the-death-penalty/
1•Tomte•1h ago•0 comments

Tesla Model Y Indoor Cabin Radar Teardown [video]

https://www.youtube.com/watch?v=QSMJeUvjAcs
2•sudonanohome•1h ago•0 comments

Revolutionizing SaaS for Legal, Finance and Compliance – Meet Agami Technologies

https://agamitechnologies.com/
1•qareena•1h ago•1 comments

Bye, bye Solaris, it was a nice ride while it lasted (2017)

https://itwire.com/opinion-and-analysis-sp-481/open-sauce/79738-bye,-bye-solaris,-it-was-a-nice-ride-while-it-lasted.html
2•TMWNN•1h ago•0 comments

Google forced publishers to accept AI scraping as price of appearing in search

https://pressgazette.co.uk/platforms/how-google-forced-publishers-to-accept-ai-scraping-as-price-of-appearing-in-search/
3•thm•1h ago•0 comments

There Is a Monster in the Forest

https://bsky.app/profile/joles.bsky.social/post/3logjuqggkk2q
3•kentbrew•1h ago•0 comments

Acoustic modulation of mechanosensitive genes and adipocyte differentiation

https://www.nature.com/articles/s42003-025-07969-1
1•walterbell•1h ago•0 comments

Open-source ML agent turns natural language into trained models

https://old.reddit.com/r/artificial/comments/1kkag85/we_built_an_opensource_ml_agent_that_turns/
1•felineflock•1h ago•0 comments

Google Facing at Least €12B in Civil Claims Across Europe

https://www.bloomberg.com/news/articles/2025-05-13/google-facing-at-least-12-billion-in-civil-claims-across-europe
2•mfiguiere•1h ago•0 comments

Bioelectrical synchronization of Picea abies during a solar eclipse

https://royalsocietypublishing.org/doi/10.1098/rsos.241786
1•doodlebugging•1h ago•1 comments

An LLM That Remembers over 300 Conversation Turns: HEMA Research Paper

https://www.haebom.dev/archive?tl=en&post=7916x82r8k8j524kpyg3
1•haebom•1h ago•0 comments
Open in hackernews

Fastvlm: Efficient vision encoding for vision language models

https://github.com/apple/ml-fastvlm
185•nhod•4h ago

Comments

BryanLegend•3h ago
Seems like the main thing holding these new minds back is being able to see well. Breakthroughs like this will fix that.
efnx•3h ago
That and the ability to hold on to knowledge.
static_void•1h ago
... or say they don't know.
kamranjon•3h ago
Apple out here playing 5d chess, installing neural cores in their hardware and writing crazy efficient vision models to run on em. Cool stuff.
wmf•3h ago
I thought they turned sycophancy off...
kamranjon•3h ago
Awe yes I admit, I think the new Apple hardware is real cool
vFunct•3h ago
Can it fill a wine glass to the rim?
mkl•2h ago
It's for interpreting images, not generating them.
turnsout•3h ago
Apple has gotten a slow start in the LLM world, but they have the only long term strategy that makes sense. They’re going to dominate the 2030s.
boroboro4•3h ago
What exactly the strategy is?
generalizations•3h ago
They can run locally on-device: a win for cost, latency and privacy (privacy is pragmatic: it means you can use all the user's data as context without qualms). There's a reason Microsoft tried so hard to push for the neural processors a year or two ago. Avoiding the cost of the datacenter while offering good-enough inference (emphasis on good) is a massive win.
turnsout•2h ago
Yes, thank you; this is the strategy I was referring to. It will take some time for the models and chips to get there, but on-device inference will have massive advantages for privacy, speed and cost. Plus it will drive demand for hardware—at first, iPhones, but soon AirPods and glasses.
xnx•2h ago
Google already has some of the best on device models (Gemma) and chips (Tensor).
AceJohnny2•1h ago
> and chips (Tensor)

Is there actually any hard data out there comparing the NPU on the Google Tensor G4 vs the Apple A18? I wasn't able to quickly find anything concrete.

I mean Apple has been shipping mobile NPUs for longer than Google (Apple: since A11 in 2017, Google: since 2021), and are built on (ostensibly) a smaller silicon node that Google's (G4: Samsung SF4P vs A18: TSMC N3E). However, the G4 appears to have more RAM bandwidth (68.26 GB/s vs 60 GB/s on A18).

weikju•2h ago
They are running data centers and offloading some things to chatGPT though, not just running on device.

In fact there’s no clear indication when Apple Intelligence is running on-device or in their Private Cloud Compute.

jfarina•3h ago
What strategy is that?
ryanmcgarvey•3h ago
I presume they mean that distribution is king and they make all the devices.
insane_dreamer•3h ago
As the father of a young child whose optic nerves are highly deteriorated (compression) and is expected to lose his sight (when exactly is unknown; based on original projections he should be blind by now, but an experimental treatment run in a trial at the NIH (KEEP FUNDING SCIENCE) has stabilized his sight), I'm overjoyed with the advances being made in VLMs. I can now envision a future where even if he loses his sight he'll be able to interact with the world around him, go to college, have a fulfilling career (he loves science and engineering, and is talented for his young age), etc.
lynx97•30m ago
I grew up in the 80s as a 100% blind child. Technology was by far not as advanced as today. Computers were just coming up when I was around 12. I learnt to type on a oldschool typewriter, and I also learnt to write braille with a pretty heavy full-metal embossing device. OCR was still quite bad. When I switched to what you call high scooll, I used a laptop with integrated Braille display to follow classes. Used good old DOS as OS and Word 5.5 as my "notepad". Except for PC Lingua for Latin, I basically had no tools specialized for learning. A electronic notepad and my brain was all I had to follow school. And I still made it. I have a great job I love, my own appartment, a sweet girlfriend and I am basically completely independent. To a point where I had to forcefully send away my mother since her continued attempts to "help" me were basically detrimental to my own development. I can not emphasis how important it is how you deal with it as a parent. Since parents are indeed the biggest hinderence to development, we have a saying around here amongst disabled people: "additional disability due to parental overprotection" (Zusatzbehinderung Eltern). Please take a moment to understand what this means, without feeling personally attacked. Its important. Your child can leave home around 18, just like every other kid. I did. Don't slow that process down artificially. The more this is prolonged, the harder it gets for the individual to actually obtain independence.

I am telling you this because I read between the lines that you believe current technology is a reason for you to be hopeful. Sure, it should be. But never forget, your child can do much more then you as a sighted person will ever be able to understand. Don't let them drown in your own misery. Let them discover what they can do. You will be surprised what they come up with. And dont fall for Gear Acquision Syndrome. Sure, tools are nice, and they do get better, which is also nice. I LOVE vision models, to stay on topic somehow. However, I still leave my house with only a cane and my phone in my pocket. I do occasionally ask Siri "Where am I" to get an address if I happen to have forgotten where I am exactly, currently. But at the end of the day, my cane is what shows me the way. Most tech is hype, plain old hearing and your sense of touch gets you much farther then you might think.

Wish you all the best for your own journey, and the development of your child.

liamwire•2h ago
It feels like this is the required level of speed-up needed re. time-to-first-token to make continuous vision useful for on-device applications like an assistant that can see and take action on your screen, ala the original Apple Intelligence demos. It’s very impressive seeing the app in the repo and I’m excited to build it tonight and play around.
nine_k•2h ago
With that, a really helpful aid for blind people can be made, running just on their phone, fed from a camera in their eyeglasses. Somebody who could not move around without an assistant could become autonomous in daily life.
adamsiem•2h ago
Anyone using vision to parse screenshots? QVQ was too slow. Will give this a shot.
abrichr•2h ago
You might be interested in https://github.com/OpenAdaptAI/OpenAdapt
logankeenan•2h ago
I used molmo to parse screenshots in order to detect locations of UI elements. See the repo below. I think Omni parser from Microsoft would also work well.

https://github.com/logankeenan/george

https://github.com/microsoft/OmniParser

nprateem•1h ago
OMG Apple finally managed to hire an AI researcher.
Aeroi•1h ago
I built/building a realtime voice+vision app called Sen, its currently live in beta and streams frames over webrtc. It's fast and smart, but Im super curious to see how these models do as we get closer to the metal. I can see these running on-device in the future with super fast ttfb.
keyle•1h ago
Do you have a write up of the tech stack and setup? Or willing to give the gist here?

I'd like to make a private Qwen or similar for my kids to prompt with a button and voice control. It doesn't need vision... Although eventually that'd be very cool.

Siri just sucks.

We might not be there yet...

Aeroi•1h ago
yeah i made a post on here, but the algo sent it to the gulag abyss.

https://news.ycombinator.com/item?id=43926673

keyle•30m ago
That's a good product site but it doesn't help me in anyway...
Aeroi•1h ago
I also ran across an interesting robot toy demo today that had voice built in. it was whimsical and seemed like it was aimed towards primary education and kids. Someone here might know the name.
nikolayasdf123•1h ago
2GB for 0.5B smallest model. it does not make sense for each app to download this. apple must have plans to pre-load these models on os level and expose SDK for all apps to call these models locally. exciting times!

opened issue for them to confirm this: https://github.com/apple/ml-fastvlm/issues/7

nikolayasdf123•1h ago
google and cloud LLM providers must be biting their teeth now! haha
nikolayasdf123•1h ago
distributing this heavy compute and moving it close to device where 1. source of data happens; 2. decision and output about the result of analysis is done; is way to go. super low latency, no network traffic, privacy, less overhead in cloud. this is amazing
porphyra•57m ago
It seems that the future of robotics is VLA models. Even Tesla FSD is an end-to-end VLA model. Efficient vision encoding will be a huge part of making robots safe and responsive.
lynx97•20m ago
I wonder, can I convert/run this with llama.cpp? It being LLaVA based seems promising.