frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

We accidentally solved robotics by watching 1M hours of YouTube

https://ksagar.bearblog.dev/vjepa/
38•alexcos•6h ago

Comments

okdood64•6h ago
Does YouTube allow massive scraping like this in their ToS?
dangoodmanUT•5h ago
What ToS
bobmcnamara•5h ago
https://www.youtube.com/static?template=terms ?
mouse_•5h ago
Probably not.

Who cares at this point? No one is stopping ML sets from being primarily pirated. The current power is effectively dismantling copyright for AI related work.

perching_aix•5h ago
> The current power is effectively dismantling copyright for AI related work.

Out of the loop apparently, could you elaborate? By "the current power" I take you mean the current US administration?

bgwalter•5h ago
Trump fired the head of the copyright office:

https://www.heise.de/en/news/After-criticism-of-AI-training-...

The "Big Beautiful Bill" contains a clause that prohibits state "AI" legislation.

Trump has a "Crypto and AI czar" who is very active in promoting "AI" on his YouTube propaganda outlet. The same czar also promoted, pre-election of course, accelerated peace with Russia and then stopped talking about the subject altogether.

perching_aix•5h ago
Oh wow okay, genuinely missed these. Thanks.
snickerdoodle12•5h ago
> Who cares at this point

Anyone who has a shred of integrity. I'm not a fan of overreaching copyright laws, but they've been strictly enforced for years now. Decades, even. They've ruined many lives, like how they killed Aaron Swartz.

But now, suddenly, violating copyright is totally okay and carries no consequences whatsoever because the billionaires decided that's how they can get richer now?

If you want to even try to pretend you don't live in a plutocracy and that the rule of law matters at all these developments should concern you.

MaxPock•5h ago
They don't and neither do I allow my site - whose content I found on Gemini -scraped
klysm•5h ago
I don't think they can legally prevent it
perching_aix•5h ago
My "lawyer" (gpt4o) claims that since YouTube is merely a non-exclusive licensee of the user content upload to their service, even if they have such restrictions in their ToS (they do), they likely would not hold up in court, citing [0]. Something about that non-exclusivity meaning they cannot constrain the copyright further on their own terms. Which I guess makes sense?

And since scraping of publicly available data is not illegal (in the US, according to the aforementioned "lawyer"), it seems like it's okay?

Not legal advice.

[0] https://www.skadden.com/insights/publications/2024/05/distri...

rzzzt•5h ago
Friendly unit conversion man at your service: 114 years.
isoprophlex•5h ago
How much is that in football fields?
forks•5h ago
If you accept 30 years as the average lifespan of an nfl stadium, 3.8
ReptileMan•5h ago
So a half zoom meeting... or 1/3 Teams one.
perching_aix•5h ago
I genuinely wish there was a cost estimation feature built into them. Doesn't even have to be even remotely close to the true cost if it's anything like the meetings I attend, there will be enough people and it will go on for long enough to make up for it.
ReptileMan•5h ago
I worked as consultant. And started billing at normal hourly rates for meetings. You will be surprised how fast the company desire for my participation in them decreased.
hobs•5h ago
Why would you do anything but that? You want to just chat with me forever the rate is the rate.
contingencies•5h ago
This is interesting for generalized problems ("make me a sandwich") but not useful for most real world functions ("perform x within y space at z cost/speed"). I think the number of people on the humanoid bandwagon trying to implement generalized applications is staggering right now. The physics tells you they will never be as fast as purpose-built devices, nor as small, nor as cheap. That's not to say there's zero value there, but really we're - uh - grasping at straws...
foobarian•5h ago
I wonder if a generalized machine would have an advantage from scale, and then putting all the specialized stuff into software. We have seen this play out before.
ahmedbaracat•5h ago
Well, there’s a middle ground, kinda. Using more specialized hardware (ex: cobots) but deploy state-of-art Physical AI (ML/Computer Vision) on them. We’re building one such startup at ko-br (https://ko-br.com/) :))
contingencies•5h ago
Quite a few startups in your space. Many deployed with customers. Good luck finding a USP!
jjangkke•5h ago
Very good point! This area faces a similar misalignment of goals in that it tries to be a generic fit-all solution that is rampant with today's LLMs.

We made a sandwich but it cost you 10x more than it would a human and slower might slowly become faster and more efficient but by the time you get really good at it, its simply not transferable unless the model is genuinely able to make the leap across into other domains that humans naturally do.

I'm afraid this is where the barrier of general intelligence and human intelligence lies and with enough of these geospatial motor skill database, we might get something that mimics humans very well but still run into problems at the edge, and this last mile problem really is a hinderance to so many domains where we come close but never complete.

I wonder if this will change with some sort of computing changes as well as how we interface with digital systems (without mouse or keyboard), then this might be able to close that 'last mile gap'.

esjeon•5h ago
Note that the username here is a Korean derogatory term for Chinese people.
jes5199•4h ago
analogy: a CPU is more expensive, more complicated, more energy demanding than custom made circuitry, in most cases.
imranq•5h ago
This was a bit hard to read. It would be good to have a narrative structure and more clear explanation of concepts.
signal-intel•5h ago
Very intentional. Their response would be: “if you need narrative structure and clear explanation of concepts, yngmi”.
richard___•5h ago
Solved??? Where?
pr337h4m•5h ago
IMO, VideoMimic is a better proof-of-concept

https://www.videomimic.net/

https://www.videomimic.net/page1.html

Keyframe•5h ago
Looks like it was trained on Shaolin Drunken Fist videos. Does it look drunk because of the videos or because there's a discrepancy between videos and it not accounting for gravity and physics in general?
throwaway198846•5h ago
I wonder how much language does this model understand. If we pan across text will it fill in sensible next word? How good will it be?
ErrorNoBrain•5h ago
Someone watched 'Devs' ?

if you havent - highly recommended.

andruby•5h ago
Do you have a link or a less generic search term?
hshshshshsh•5h ago
Bro chatgpt exist.
conception•5h ago
Do we have a “let me ChatGPT that for you..” site yet?
VladVladikoff•5h ago
It’s a TV show made by Adam Garland https://m.imdb.com/title/tt8134186/ It’s pretty good sci fi IMHO
hahaxdxd123•5h ago
Extremely oversold article.

> the core insight: predict in representation space, not pixels

We've been doing this since 2014? Not only that, others have been doing it at a similar scale. e.g. Nvidia's world foundation models (although those are generative).

> zero-shot generalization (aka the money shot)

This is easily beaten by flow-matching imitation learning models like what Pi has.

> accidentally solved robotics

They're doing 65% success on very simple tasks.

The research is good. This article however misses a lot of other work in the literature. I would recommend you don't read it as an authoritative source.

accidentallfact•5h ago
https://news.ycombinator.com/item?id=44073183
Voloskaya•5h ago
This article contains so many falsehoods and history rewrites that it's pretty painful to read.
rozab•5h ago
I just wrote a reply to a comment talking about the AI tells this writing has, but it got flagged so my comment disappeared when I hit post. I'll rephrase out of spite:

My first thought upon reading this was that an LLM had been instructed to add a pithy meme joke to each paragraph. They don't make sense in context, and while some terminally online people do speak in memes, those people aren't quoting doge in 2025.

There's also a sense of incoherence in the whole piece. For instance, this section:

"- after: 22 million videos + 1 million images (now we're talking)

they basically hoovered up everything: something-something v2, kinetics, howto100m, and a billion youtube videos"

Was it a billion vids or 22m? It turns out the latter sentence is just rephrasing the list of sources in a cool casual way, and the last one is called YT-Temporal-1B. That's a billion frames of video, not a billion videos.

Error handling in Rust

https://felix-knorr.net/posts/2025-06-29-rust-error-handling.html
60•emschwartz•1h ago•37 comments

I made my VM think it has a CPU fan

https://wbenny.github.io/2025/06/29/i-made-my-vm-think-it-has-a-cpu-fan.html
357•todsacerdoti•8h ago•74 comments

Tesla sales drop for fifth month in a row in Europe

https://abcnews.go.com/Business/wireStory/europeans-angry-musk-buying-cars-tesla-sales-drop-123203026
63•doener•34m ago•42 comments

Ask HN: What Are You Working On? (June 2025)

38•david927•2h ago•101 comments

Cell Towers Can Double as Cheap Radar Systems for Ports and Harbors (2014)

https://spectrum.ieee.org/cell-tower-signals-can-improve-port-security
5•transpute•36m ago•3 comments

4-10x faster in-process pub/sub for Go

https://github.com/kelindar/event
89•kelindar•7h ago•16 comments

Show HN: Octelium – FOSS Alternative to Teleport, Cloudflare, Tailscale, Ngrok

https://github.com/octelium/octelium
252•geoctl•11h ago•95 comments

Many ransomware strains will abort if they detect a Russian keyboard installed (2021)

https://krebsonsecurity.com/2021/05/try-this-one-weird-trick-russian-hackers-hate/
126•air7•3h ago•64 comments

Revisiting Knuth's "Premature Optimization" Paper

https://probablydance.com/2025/06/19/revisiting-knuths-premature-optimization-paper/
32•signa11•3d ago•11 comments

Bloom Filters by Example

https://llimllib.github.io/bloomfilter-tutorial/
159•ibobev•10h ago•25 comments

Using the Internet without IPv4 connectivity

https://jamesmcm.github.io/blog/no-ipv4/
242•jmillikin•14h ago•108 comments

Modelling API rate limits as diophantine inequalities

https://vivekn.dev/blog/rate-limit-diophantine
6•viveknathani_•2d ago•1 comments

The Medley Interlisp Project: Reviving a Historical Software System [pdf]

https://interlisp.org/documentation/young-ccece2025.pdf
67•pamoroso•7h ago•6 comments

Loss of key US satellite data could send hurricane forecasting back 'decades'

https://www.theguardian.com/us-news/2025/jun/28/noaa-cuts-hurricane-forecasting-climate
164•trauco•4h ago•88 comments

Personal care products disrupt the human oxidation field

https://www.science.org/doi/10.1126/sciadv.ads7908
170•XzetaU8•5h ago•117 comments

ZeroRISC Gets $10M Funding, Says Open-Source Silicon Security Inevitable

https://www.eetimes.com/zerorisc-gets-10-million-funding-says-open-source-silicon-security-inevitable/
16•wslh•1h ago•3 comments

Show HN: A tool to benchmark LLM APIs (OpenAI, Claude, local/self-hosted)

https://llmapitest.com/
22•mrqjr•6h ago•3 comments

Show HN: Sharpe Ratio Calculation Tool

https://www.fundratios.com/
14•navquant•4h ago•4 comments

Why Go Rocks for Building a Lua Interpreter

https://www.zombiezen.com/blog/2025/06/why-go-rocks-for-building-lua-interpreter/
51•Bogdanp•3d ago•19 comments

Show HN: Rust -> WASM, K-Means Color Quantization Crate for Image-to-Pixel-Art

https://github.com/gametorch/image_to_pixel_art_wasm
15•gametorch•3d ago•0 comments

More on Apple's Trust-Eroding 'F1 the Movie' Wallet Ad

https://daringfireball.net/2025/06/more_on_apples_trust-eroding_f1_the_movie_wallet_ad
765•dotcoma•14h ago•500 comments

Show HN: Summle – A little maths Game

https://summle.net
13•kirchhoff•3d ago•3 comments

Brad Woods Digital Garden

https://garden.bradwoods.io
46•samuel246•2d ago•3 comments

Tools I love: mise(-en-place)

https://blog.vbang.dk/2025/06/29/tools-i-love-mise/
120•micvbang•4h ago•41 comments

Unhooking from Amazon Ebooks

https://remysharp.com/2025/06/29/unhooking-from-amazon-ebooks
39•Timothee•3h ago•47 comments

Solving `Passport Application` with Haskell

https://jameshaydon.github.io/passport/
280•jameshh•23h ago•109 comments

Implementing fast TCP fingerprinting with eBPF

https://halb.it/posts/ebpf-fingerprinting-1/
60•halb•11h ago•25 comments

Most ints are not floats

https://www.johndcook.com/blog/2025/06/27/most-ints-are-not-floats/
32•zdw•2d ago•50 comments

Schizophrenia is the price we pay for minds poised near the edge of a cliff

https://www.psychiatrymargins.com/p/schizophrenia-is-the-price-we-pay
222•Anon84•1d ago•330 comments

Scientists Retrace 30k-Year-Old Sea Voyage, in a Hollowed-Out Log

https://www.nytimes.com/2025/06/25/science/anthropology-ocean-migration-japan.html
35•benbreen•3d ago•15 comments