frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ornith-1.0: Self-scaffolding LLMs for agentic coding

https://deep-reinforce.com/ornith_1_0.html
47•kordlessagain•1d ago

Comments

SwellJoe•1d ago
I added this to a benchmark I've been doing of how well agents find security bugs, specifically security bugs originally found by Mythos. It performs poorly with only read/grep/ls tools, but in a follow-up test with a full shell and Python, it doubled its findings (still a poor showing, but it does at least indicate it is doing what it says on the tin: making tools to help it solve problems). It also did worse than Qwen AgentWorld, another recent post-train of Qwen 3.6 MoE intended for agentic use.

https://swelljoe.com/post/will-it-mythos/

kordlessagain•18h ago
Good to know. Thanks for the research!
Balinares•10h ago
I'd have expected this to get more HN attention. Qwen 3.6 35B capability in a 9B model is a bonkers claim.
chid•9h ago
I thought so too when I read the headline but I expect it's basically Qwen3.5-9B
juliangoldsmith•5h ago
It looks like they're comparing Orinth 9B to Qwen 3.5 35B, not Qwen 3.6. I guess it kind of makes sense since it's a finetune of 3.5, but I totally missed until I looked closely.

In my brief tests, Ornith 35B performed quite well. It won't replace DeepSeek V4 Flash for me, but if it was fast and cheap enough it might.

I don't remember being super impressed with Ornith 9B, but I could see it being on par with Qwen 3.5 35B.

nzach•10h ago
Instead of training the model to directly answer questions we trained the model to always write and execute the code that would solve the question ?

If that is the case, this isn't just a fancy way to perform prompt optimization?

.self: A new top-level domain designed to support self-hosting

https://hccf.onmy.cloud/2026/06/21/reclaiming-our-digital-selves-hccfs-vision-for-a-human-centere...
200•HumanCCF•3h ago•130 comments

Qwen 3.6 27B is the sweet spot for local development

https://quesma.com/blog/qwen-36-is-awesome/
509•stared•5h ago•444 comments

Free the Icons

https://weblog.rogueamoeba.com/2026/06/26/free-the-icons/
74•zdw•2d ago•11 comments

Is It Out Yet?

https://outyet.ai
26•partsch•1h ago•10 comments

Rocketlab acquires Iridium

https://investors.rocketlabcorp.com/news-releases/news-release-details/rocket-lab-acquire-iridium...
332•everfrustrated•8h ago•203 comments

Ornith-1.0: self-improving open-source models for agentic coding

https://github.com/deepreinforce-ai/Ornith-1
124•danboarder•5h ago•27 comments

Scientists find molecular-level evidence for two structures in liquid water

https://phys.org/news/2026-06-scientists-molecular-evidence-liquid.html
9•wglb•40m ago•1 comments

A native graphical shell for SSH

https://probablymarcus.com/blocks/2026/06/28/native-graphical-shell-for-SSH.html
211•mrcslws•7h ago•96 comments

WATaBoy: JIT-Ing Game Boy Instructions to WASM Beats a Native Interpreter

https://humphri.es/blog/WATaBoy/
163•energeticbark•7h ago•24 comments

Wallace the 6 inch f/2.8 telescope, building it, and hiking with it

https://lucassifoni.info/blog/hiking-with-wallace/
89•chantepierre•3d ago•13 comments

JumpServer: Open-Source Privileged Access Management

https://github.com/jumpserver/jumpserver
44•neitsab•3h ago•11 comments

US Supreme Court rules geofence warrants require constitutional protections

https://www.theguardian.com/us-news/2026/jun/29/supreme-court-geofence-warrants-case-decision
373•cdrnsf•7h ago•174 comments

Micro-Agent: Beat Frontier Models with Collaboration Inside Model API

https://vllm.ai/blog/2026-06-29-micro-agent-frontier-models
40•matt_d•4h ago•11 comments

What happens when you run a CUDA kernel?

https://fergusfinn.com/blog/what-happens-when-you-run-a-gpu-kernel/
190•mezark•9h ago•24 comments

Apple Neural Engine: Architecture, Programming, and Performance

https://arxiv.org/abs/2606.22283
77•Jimmc414•1d ago•9 comments

Working With AI: A concrete example

https://htmx.org/essays/working-with-ai/
61•comma_at•8h ago•23 comments

South Korea to spend $1T on more memory chip production and humanoid robots

https://arstechnica.com/ai/2026/06/south-korea-to-spend-1t-on-more-memory-chip-production-and-hum...
15•jnord•38m ago•0 comments

30-year sentence for transporting zines is a five-alarm fire for free speech

https://theintercept.com/2026/06/26/daniel-sanchez-estrada-zines-prairieland-free-speech/
159•xrd•1d ago•63 comments

Ornith-1.0: Self-scaffolding LLMs for agentic coding

https://deep-reinforce.com/ornith_1_0.html
47•kordlessagain•1d ago•6 comments

European ISPs Want Rightsholders Held Accountable for Overblocking Damage

https://torrentfreak.com/european-isps-want-rightsholders-held-accountable-for-overblocking-damage/
319•Brajeshwar•6h ago•83 comments

Dark Sky Lighting

https://www.savingourstars.org/darkskylighting#whatisdarkskylighting
118•alexandrehtrb•4d ago•16 comments

One million passports leaked online

https://cambridgeanalytica.org/data-breaches-scandals/passports-driver-licenses-exposed-public-in...
81•jruohonen•1d ago•54 comments

Sandia National Labs SA3000 8085 CPU

https://www.cpushack.com/2026/06/03/sandia-national-labs-sa3000-8085-cpu/
151•rbanffy•12h ago•38 comments

You Don't Know Jack About Formal Verification

https://queue.acm.org/detail.cfm?id=3819084
84•eatonphil•8h ago•36 comments

Font-Family Recommendations

https://chrismorgan.info/font-family
41•birdculture•3d ago•12 comments

Venetian Bridge Brawls in 17th and 18th Century Art

https://publicdomainreview.org/collection/venice-bridge-fights/
50•pepys•3d ago•28 comments

Rebuilding the Computer Room

https://alexwlchan.net/2026/computer-room/
87•ingve•11h ago•45 comments

Is sunscreen the new margarine? (2019)

https://www.outsideonline.com/health/wellness/sunscreen-sun-exposure-skin-cancer-science/
57•markgavalda•17h ago•56 comments

Samsung, SK Hynix, Micron Sued in US over Memory Price Fixing

https://en.sedaily.com/international/2026/06/29/samsung-sk-hynix-micron-sued-in-us-over-memory-pr...
322•donohoe•11h ago•156 comments

Instagram is incorporating users' photos in ads for Meta Glasses

https://twitter.com/i/status/2071277885646868536
311•notRobot•9h ago•135 comments