frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Leanstral 1.5: Proof Abundance for All

https://mistral.ai/news/leanstral-1-5/
34•programLyrique•1h ago

Comments

boulos•44m ago
This is nice work, but I found the bug finding example to be weird:

> One such bug was in the sign function for zigzag decoding of the datrs/varinteger library. On input Std.U64.MAX, the expression (value + 1) overflowed, causing crashes in debug mode and silent corruption in release mode—an edge case that testing and fuzzing would typically miss.

In what way would this boundary condition case be considered something that "testing [...] would typically miss"? It's certainly something that bad tests would miss or not think about, but I find that (a) careful people and (b) ML coding systems are actually really good at "oh, I should test the extreme values". Especially for things that parse user input.

I'm curious if they found other bugs that were more interesting, but found them too hard to explain quickly.

pierrefermat1•28m ago
When you dogfood your AI slop all day, suddenly everything becomes impressive
satvikpendem•34m ago
I also submitted the HuggingFace link itself here: https://news.ycombinator.com/item?id=48779902
nullc•24m ago
It would be nice if special purpose models provided a some diverse examples of exactly the input required to get its expected performance on a mix of problem types. Maybe also a document intended for LLMs to read that advises on prompt construction.

I've found that you can get wildly different quality results from these sorts of models due to seemingly insignificant differences in prompt construction. It would be much easier to guess at what it wants if I could just see some RL transcripts -- and so the model author is in a much better position to provide initial advice.

Giant trees have no trouble pumping water to top branches

https://news.exeter.ac.uk/faculty-of-environment-science-and-economy/giant-trees-have-no-trouble-...
44•hhs•1h ago•19 comments

Odin, Wikipedia and Engagement Farming

https://katamari64.se/posts/2026/odin-wikipedia/
19•stock_toaster•43m ago•2 comments

SearXNG: A free internet metasearch engine

https://github.com/searxng/searxng
109•theanonymousone•3h ago•26 comments

The circuit that lets your brain think and see

https://www.engineering.columbia.edu/about/news/circuit-lets-your-brain-think-and-see
16•hhs•1h ago•1 comments

Leanstral 1.5: Proof Abundance for All

https://mistral.ai/news/leanstral-1-5/
34•programLyrique•1h ago•4 comments

Steam Controller Auto-Charge – pilot to magnetic charging puck using CV

https://github.com/FossPrime/Steam-Controller-Auto-Charge
31•zdw•1h ago•5 comments

Amsterdam invented the fire department

https://worksinprogress.co/issue/how-amsterdam-invented-the-fire-department/
25•zdw•1h ago•6 comments

Dispersion loss counteracts embedding condensation in small language models

https://chenliu-1996.github.io/projects/LM-Dispersion/
15•E-Reverance•1h ago•3 comments

Jamesob's guide to running SOTA LLMs locally

https://github.com/jamesob/local-llm
256•livestyle•9h ago•121 comments

Espionage Against the European Parliament

https://citizenlab.ca/research/member-of-committee-investigating-spyware-hacked-with-pegasus/
248•ledoge•3h ago•62 comments

GLM5.2 on AMD MI355X at 2626 tok/s/node at over 2x lower cost than Blackwell

https://www.wafer.ai/blog/glm52-amd
38•latchkey•2h ago•9 comments

Infracost (YC W21) Is Hiring a Marketing Lead to Shift FinOps Left

https://www.ycombinator.com/companies/infracost/jobs/YTJcFwr-marketing-lead
1•akh•3h ago

Applied Category Theory Course (2018)

https://math.ucr.edu/home/baez/act_course/index.html
38•measurablefunc•3h ago•5 comments

We put a Redis server inside our runtime

https://encore.dev/blog/redis-runtime
13•eandre•2d ago•5 comments

New serious vulnerabilities spiked around release of Claude Mythos Preview

https://epoch.ai/data-insights/cve-severity-spike
21•cubefox•2h ago•6 comments

FreeBSD ate my RAM

https://crocidb.com/post/freebsd-ate-my-ram/
80•theanonymousone•4h ago•32 comments

Africans Are Turning to Starlink

https://www.economist.com/middle-east-and-africa/2026/07/02/africans-are-turning-to-starlink
79•bookofjoe•2h ago•71 comments

International chess federation sanctions Kramnik

https://www.fide.com/fide-ethics-disciplinary-commission-issues-a-decision-in-case-involving-gm-v...
110•DarkContinent•7h ago•58 comments

Costco is the anti-Amazon

https://phenomenalworld.org/analysis/the-anti-amazon/
261•bookofjoe•8h ago•247 comments

Factories are just rooms

https://interconnected.org/home/2026/07/03/factories
179•arbesman•8h ago•73 comments

Software, from First Principles

https://fazamhd.com/mental-models/software/
16•faza•2h ago•6 comments

Hunting a 16-year-old SQLite WAL bug with TLA+

https://ubuntu.com/blog/hunting-a-16-year-old-sqlite-bug-with-tla-is-dqlite-affected
163•peterparker204•3d ago•12 comments

GitFut – Your GitHub stats turned into a World-Cup-style player card

https://gitfut.com
6•redbell•1h ago•4 comments

Show HN: Mcpsnoop – Wireshark for MCP (transparent proxy and live TUI)

https://github.com/kerlenton/mcpsnoop
45•kerlenton•7h ago•13 comments

Wordgard: In-browser rich-text editor from the creator of ProseMirror

https://wordgard.net/
255•indy•15h ago•90 comments

PostgreSQL and the OOM killer: Why we use strict memory overcommit

https://www.ubicloud.com/blog/postgresql-and-the-oom-killer-why-we-use-strict-memory-overcommit
150•furkansahin•11h ago•85 comments

I Wasn't Allowed Prompting ChatGPT During My Chalk Talk: This Is Discrimination (2025)

https://inpreparation.substack.com/p/opinion-i-was-not-allowed-to-type
134•theanonymousone•6h ago•71 comments

A peek into Reddit's anti-spam internals

https://lyra.horse/blog/2026/06/reddit-spam-internals/
154•OuterVale•6d ago•56 comments

Valve open-source the Steam Machine e-ink screen so you can make your own

https://www.gamingonlinux.com/2026/07/valve-open-source-the-steam-machine-e-ink-screen-so-you-can...
527•ahlCVA•11h ago•97 comments

Ask HN: Is anyone experimenting with different ways of using LLMs for coding?

123•yehiaabdelm•17h ago•149 comments