frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Project Fetch: Phase Two

https://www.anthropic.com/research/project-fetch-phase-two
31•stopachka•2h ago

Comments

bob778•1h ago
> Preliminary trials with Claude Mythos Preview showed that it would not provide an apples-to-apples comparison with other models because of how we had set up the experiment and how the model was served.

What does this mean? My guess is they couldn’t co-locate Mythos close enough to reduce latency?

(I’m assuming this experiment pre-dates the export controls)

georgemcbay•1h ago
> My guess is they couldn’t co-locate Mythos close enough to reduce latency?

I doubt network latency is the reason. Even when connecting from literally across the world network latency is lost in the noise of overall response latency of even fast models.

The overall response latency of the model very well could have been the difference, though. AFAIK Mythos is structured to do relatively slow "deep thinking".

bannable•1h ago
Depending on the timeline, it could be that they're not allowed to access Mythos because of something like non-US citizens on the team or the lack of some way for them to meet the constraint DOD has them under.
georgemcbay•53m ago
I strongly suspect if that was the case they would have just directly mentioned that Mythos couldn't be used because of that reason, it would be less confusing and less suspect messaging than saying it wasn't an "apples-to-apples comparsion".
joshu•1h ago
stop trying to make fetch happen
jascha_eng•1h ago
This mostly reads as a comparison between Opus 4.7 and 4.1 it would be more interesting if they reran the experiment against a team of humans with 4.7 and see how much the humans still improve the results today.
etchalon•1h ago
Do you want Terminators? Because this is how you get Terminators.
didibus•46m ago
I'm getting a bit tired of these disguised adverts.

Here's how non robotics engineers used AI to do a short robot integration task faster than other non robotics engineers without AI.

Where "better" mostly means faster, and who knows what happens on longer horizons, with actual robotics experts, robustness requirements, or tasks where the hard part is control rather than API spelunking.

dragonwriter•36m ago
> I'm getting a bit tired of these disguised adverts.

Its not disguised. Corporate blogs exist overtly to promote the company and its work.

Disguised promotions where notionally independent media publish promotional pieces as news concealing that they were fed to them by party whose products they promote area thing, but this is just the most overt undisguised promotion.

rvz•8m ago
> Its not disguised. Corporate blogs exist overtly to promote the company and its work.

It is. That makes the "research" heavily biased. If xAI did the same thing, with Elon Musk screaming about that it is "AGI", you would not believe them at all.

Given that the work is not independent, such articles of this "research" can easily be manipulated or the results being massaged to promote the company positively.

But when others outside of the company try out the work or reproduce it, they get different results. So of course we continue to hear unverified research especially in AI when the frontier labs do not release their architecture, weights at all.

So I would not straight up believe results from the first party source unless multiple sources outside of the company have verified it.

Renting a sewing machine from the library

https://www.bbc.com/future/article/20260618-the-weird-and-wonderful-libraries-of-finland
112•sohkamyung•3h ago•52 comments

Epoll vs. io_uring in Linux

https://sibexi.co/posts/epoll-vs-io_uring/
62•Sibexico•3h ago•19 comments

Show HN: TownSquare, a tiny presence layer for websites

https://townsquare.cauenapier.com/
80•cauenapier•14h ago•26 comments

15-minute at-home Lyme disease tick test

https://www.bostonglobe.com/2026/06/17/business/lyme-disease-tick-test/
44•bookofjoe•2d ago•13 comments

Slow breathing modulates brain function and risk behavior

https://www.cell.com/neuron/fulltext/S0896-6273(26)00339-9
68•croes•4h ago•8 comments

Loupe – A iOS app that raises awareness about what native apps can see

https://github.com/mysk-research/loupe
78•Cider9986•14h ago•18 comments

'We had to get out of the way': The backlash over delivery robots

https://www.bbc.com/news/articles/c0rygp005wjo
35•higginsniggins•2h ago•28 comments

SMPTE Makes Its Standards Freely Accessible

https://www.smpte.org/blog/smpte-makes-its-standards-freely-accessible-openingstandards-library-t...
235•zdw•9h ago•65 comments

Alice is impatient

https://brooker.co.za/blog/2026/06/19/waiting.html
62•birdculture•6h ago•17 comments

Unauthorized alert sent to cell phones across Brazil

https://www.cnn.com/2026/06/20/americas/brazil-hackers-unauthorized-alert-latam
94•zdw•6h ago•67 comments

UHF X11: X11 Built for VisionOS and Apple Vision Pro

https://www.lispm.net/apps/uhf-x11/
173•zdw•9h ago•30 comments

DOS Game "F-15 Strike Eagle II" reversing project needs DOS test pilots

https://neuviemeporte.github.io/f15-se2/2026/06/20/needyou.html
212•LowLevelMahn•11h ago•58 comments

When I reject AI code even if it works

https://vinibrasil.com/when-i-reject-ai-code-even-if-it-works/
34•vnbrs•1h ago•13 comments

CSSQuake

https://cssquake.com/
466•msalsas•15h ago•101 comments

Project Fetch: Phase Two

https://www.anthropic.com/research/project-fetch-phase-two
31•stopachka•2h ago•10 comments

Developers don't understand CORS (2019)

https://fosterelli.co/developers-dont-understand-cors
7•toilet•1h ago•1 comments

Semiconductor Lifeline Keeps Fighter Jets in the Air

https://spectrum.ieee.org/phoenix-semiconductors-legacychips-oems
41•rbanffy•4d ago•11 comments

Moving Beyond Fork() + Exec()

https://lwn.net/Articles/1076018/
7•signa11•2d ago•1 comments

PostgresBench: A Reproducible Benchmark for Postgres Services

https://clickhouse.com/blog/postgresbench
83•saisrirampur•7h ago•22 comments

Whole cross-sectional human ultrasound tomography

https://www.nature.com/articles/s41551-026-01660-4
30•lnyan•2d ago•4 comments

Show HN: Make PDFs look scanned (CLI or in the browser via WASM)

https://github.com/overflowy/make-look-scanned
95•overflowy•8h ago•47 comments

Linux eliminates the strncpy API after six years of work, 360 patches

https://www.phoronix.com/news/Linux-7.2-Drops-strncpy
104•simonpure•5h ago•78 comments

Inference cost at scale with napkin math

https://injuly.in/blog/napkin-inference-cost/index.html
63•gmays•4d ago•14 comments

Show HN: StartupWiki – A Free Alternative to Crunchbase

https://startupwiki.tech/
162•shpran•10h ago•55 comments

Temporary Cloudflare accounts for AI agents

https://blog.cloudflare.com/temporary-accounts/
178•farhadhf•15h ago•97 comments

The Wholesale Plagiarism of Obscure Sorrows

https://waxy.org/2026/06/the-wholesale-plagiarism-of-obscure-sorrows/
325•ridesisapis•8h ago•136 comments

White House delays US voting-machine vulnerability report

https://www.reuters.com/world/white-house-delays-release-us-voting-machine-study-midterms-near-20...
39•logickkk1•1h ago•26 comments

The rise of South Korea’s weapons business

https://www.politico.com/news/magazine/2026/06/20/south-korea-weapons-dealer-trump-00959559
118•JumpCrisscross•15h ago•42 comments

Bun has an open PR adding shared-memory threads to JavaScriptCore

https://github.com/oven-sh/WebKit/pull/249
116•gr4vityWall•9h ago•219 comments

Supermarket giant Tesco sues VMware for breach of contract (2025)

https://www.theregister.com/software/2025/09/03/supermarket-giant-tesco-sues-vmware-for-breach-of...
96•wglb•5h ago•26 comments