frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Anyone melding GPT-level intelligence with physical world?

2•iamnnk•2h ago
The current state of LLMs (ChatGPT, Gemini) give the impression of having 'solved digital experience' completely. They are self contained to the extent that the 2023 technique of building wrappers on top of them to customise experiences seems redundant.

I intuitively sense scope for a meld of such intelligence with the physical world.

Are there startups that are building anything cool in this space?

Comments

ai_critic•2h ago
What on earth ever gave you that impression?
gtirloni•2h ago
That's an interesting question but the "AI wrappers" aren't going away because the LLMs 1) aren't totally deterministic and 2) feeding them the correct prompts and context is still very valuable. In other words, one-shotting doesn't work for every use case (which is essentially what your saying when you say they are "self-contained", right? Unfortunately, they aren't/can't be).

Regarding the physical world, that's a deeper question. You have people that say LLM's "understand", that they are "intelligent" and that this is an "emergent behavior" of all their weights. You also have people that say they are nothing more than a stochastic parrot or auto-complete on steroids.

I'm in neither camp but let's do a thought exercise. Multi-modal LLM's are training on text, video, and sound. They can know what a chair looks like, what sound it make if you drag it over a wooden floor, and what it would look like when you do that (from this mysterious PoV somewhere). Now take that "knowledge" and ask it to give you 3D coordinates to move a chair right now in the room you're standing in: it simply can't. It's lacking a lot of information about the actual measurements of the room, its own movement capabilities (or those of the human to carry out the task), etc.

There are AI that can do this, but they aren't good for text. We have self-driving cars and factory robots doing things constrained to those domains.

If you say "meld" as in "let's combine a bunch of different AI technologies together with each one doing what it does best", I'm sure people are working on this already. But LLM's are but a small part of solving that problem.

EDIT: if you still can, please add "Ask HN: " to your title here.

Tesla's Forgotten Founder Speaks Out – Exclusive with Martin Eberhard (YouTube) [video]

https://www.youtube.com/watch?v=88KHfX_kPIY
1•cletusw•1m ago•0 comments

What Musk, Altman and Others Say About AI-Funded 'Universal Basic Income'

https://www.wsj.com/tech/ai/universal-income-tech-executives-a16eb2d0
1•fortran77•5m ago•0 comments

Gemma 3-270M

https://huggingface.co/collections/ggml-org/gemma-3-270m-689e0105d56462786413d7fc
1•georgehill•6m ago•0 comments

Unaligned GPT-OSS-20B-base extracted from OpenAI's model

https://twitter.com/jxmnop/status/1955436067353502083
1•fragmede•7m ago•0 comments

Debate Website

https://bicker.ca/
1•lucasadilla•8m ago•1 comments

Show HN: I made a tool that turns niche research into daily marketing tasks

https://launchprint.deplo.yt
1•LeoGoverG•11m ago•0 comments

How we use a 3-stage, human-in-the-loop AI workflow to overhaul rsyslog's docs

https://www.rsyslog.com/shipping-better-docs-with-ai-restructuring-module-parameters-for-clarity-and-consistency/
1•rgerhards•11m ago•1 comments

The Internal Tooling Maturity Ladder

https://robbyonrails.com/articles/2025/08/13/internal-tooling-maturity-ladder/
1•mooreds•12m ago•0 comments

My Year of Rust

https://xavd.id/blog/post/my-year-of-rust/
1•ingve•14m ago•0 comments

Gemma 3 270M

https://twitter.com/osanseviero/status/1956024223773663291
2•tosh•15m ago•0 comments

Art of the Nerd Snipe

https://lichess.org/@/Toadofsky/blog/art-of-the-nerd-snipe/rxLpGts5
1•fzliu•15m ago•0 comments

Salmon as Keystone Species

https://en.wikipedia.org/wiki/Salmon_run
1•jijijijij•15m ago•0 comments

Show HN: Modelence – Supabase for MongoDB

https://github.com/modelence/modelence
3•artahian•15m ago•0 comments

Dam sabotage blamed on pro-Russia hackers

https://www.newsinenglish.no/2025/08/14/dam-sabotage-blamed-on-pro-russia-hackers/
2•gnabgib•15m ago•0 comments

The Consistency and Performance of the Iterative Bayesian Update

https://arxiv.org/abs/2508.09980
1•georgehe9•16m ago•0 comments

Pro-Russian hackers blamed for water dam sabotage in Norway

https://www.bleepingcomputer.com/news/security/pro-russian-hackers-blamed-for-water-dam-sabotage-in-norway/
1•gpi•17m ago•0 comments

We know so little about black holes, I still think we are inside one

https://bigthink.com/starts-with-a-bang/36-billion-solar-masses-heaviest-black-hole/
1•ieuanking•18m ago•1 comments

Futarchy's Fundamental Flaw

https://dynomight.net/futarchy-market/
1•crescit_eundo•19m ago•0 comments

Trump Reportedly Offering Putin Natural Resources Off Alaska

https://www.newsweek.com/alaska-russia-trump-resources-2113295
3•structuredPizza•19m ago•2 comments

From Stress Test to Skills Test: A Smarter Approach to Technical Interviews

https://samuelmullen.com/articles/from-stress-test-to-skills-test
1•samullen•20m ago•1 comments

Gemma 3 270M: The compact model for hyper-efficient AI

https://developers.googleblog.com/en/introducing-gemma-3-270m/
4•meetpateltech•20m ago•1 comments

Show HN: A visual size comparison tool for tech gadgets

https://comparisontabl.es/size-comparison/
1•GuidoL•21m ago•0 comments

I Made a Realtime C/C++ Build Visualizer

https://danielchasehooper.com/posts/syscall-build-snooping/
2•dhooper•22m ago•0 comments

AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?

https://arxiv.org/abs/2507.15887
1•PaulHoule•25m ago•0 comments

II Lines of Code

https://kaleidawave.github.io/posts/formatting-and-parsing-numbers/
1•kaleidawave•25m ago•0 comments

Google launches AI-powered flight search tool

https://blog.google/products/search/google-flights-ai-flight-deals/
2•thm•26m ago•0 comments

CNCF Survey Finds Argo CD as Majority Adopted GitOps Solution for Kubernetes

https://www.cncf.io/announcements/2025/07/24/cncf-end-user-survey-finds-argo-cd-as-majority-adopted-gitops-solution-for-kubernetes/
1•gpi•27m ago•0 comments

Infamous people search site is back

https://www.zdnet.com/article/this-infamous-people-search-site-is-back-after-leaking-3-billion-records-how-to-remove-your-data-from-it-asap/
2•CrankyBear•28m ago•0 comments

Vibe coding platform Anything arrives, our hands-on suggests caution

https://www.theregister.com/2025/08/14/anything_vibe_coding_platform_released/
2•rntn•30m ago•0 comments

Inner speech in motor cortex and implications for speech neuroprostheses

https://www.sciencedirect.com/science/article/pii/S0092867425006816
1•zahirbmirza•31m ago•1 comments