frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Why do LLMs attend to the first token?

https://arxiv.org/abs/2504.02732
1•adhi01•2m ago•0 comments

Lead Has Turned into Gold: Breakthrough At The Large Hadron Collider

https://abcnews.go.com/Technology/scientists-turn-lead-gold-1st-time-split/story
1•handfuloflight•6m ago•1 comments

Northeastern student demanded tuition refund after professor used ChatGPT

https://fortune.com/2025/05/15/chatgpt-openai-northeastern-college-student-tuition-fees-back-catching-professor/
1•bsoles•7m ago•0 comments

A Brief History of MySQL Replication

https://altmannmarcelo.medium.com/a-brief-history-of-mysql-replication-85f057922800
1•marceloaltmann•7m ago•1 comments

Etcd v3.6.0

https://etcd.io/blog/2025/announcing-etcd-3.6/
1•programd•17m ago•0 comments

Microagents accelerated their AI agent platform launch by embedding Pipedream's

https://pipedream.com/blog/microagents-accelerated-their-ai-agent-platform-launch-by-embedding-pipedreams-integrations/
1•todsacerdoti•20m ago•0 comments

Software engineer lost his $150K-a-year job to AI – living in a trailer

https://www.yahoo.com/news/software-engineer-lost-150k-job-090000839.html
1•msolujic•20m ago•1 comments

Echidna mothers change their pouch microbiome to protect tiny puggles

https://www.theguardian.com/environment/2025/may/16/echidna-mothers-change-their-pouch-microbiome-to-protect-tiny-pink-jelly-bean-puggles-new-research-finds
1•anotherevan•22m ago•0 comments

Trump's sanctions on ICC prosecutor have halted tribunal's work

https://abcnews.go.com/International/wireStory/trumps-sanctions-icc-prosecutor-halted-tribunals-work-121824057
8•oever•23m ago•2 comments

Show HN: Hyprsqrl – open-source revolut you can self-host (live iban/sepa/ACH)

https://github.com/different-ai/hyprsqrl
1•ben_talent•23m ago•0 comments

Can We Create Real Consciousness?

https://lopanpaol.substack.com/p/can-we-create-real-consciousness
1•lopanapol•28m ago•2 comments

Canon Cat Emulation

https://archive.org/details/canoncat
1•fzzzy•30m ago•0 comments

Cutting the Cord: Addressing the movement's dependence on Big Tech [pdf]

https://mayfirst.coop/files/cutting-the-cord.en.pdf
1•SamWhited•32m ago•0 comments

The hidden consequences of being banned from dating apps

https://mashable.com/article/what-happens-when-youre-banned-from-dating-apps
1•avonmach•33m ago•0 comments

Show HN: Switchpoint AI – Cut LLM Cost and Improve Performance with Auto Routing

https://www.switchpoint.dev/
3•hmichaelson20•35m ago•0 comments

Runbear helps teams onboard AI like a new hire–with help from Pipedream Connect

https://pipedream.com/blog/runbear/
1•todsacerdoti•37m ago•0 comments

Show HN: SpeedSketch – Practice figure drawing with timed sketches

https://speedsketch.co.uk
1•fdf515•38m ago•0 comments

Progress Report – Linux 6.15

https://asahilinux.org/2025/05/progress-report-6-15/
1•Conan_Kudo•38m ago•0 comments

Cold War's Greatest Spy Weapons and Gadgets: Covert Tools of Espionage Masters

https://rarehistoricalphotos.com/spy-weapons-gadgets-from-cold-war/
1•bookofjoe•39m ago•0 comments

Learning to Learn

https://www.charliehb.com/blog/01-learning-to-learn
1•cbrinicombe•43m ago•0 comments

'We Are the Most Rejected Generation'

https://www.nytimes.com/2025/05/15/opinion/rejection-college-youth.html
3•tkgally•44m ago•0 comments

Europe's costly military pensions complicate defense buildup

https://www.japantimes.co.jp/news/2025/05/08/world/politics/europe-military-pensions-defense-complicate/
2•PaulHoule•46m ago•0 comments

Malicious 'Checker' Packages on PyPI Probe TikTok and Instagram for Valid

https://socket.dev/blog/malicious-checker-packages-on-pypi-probe-tiktok-and-instagram
2•feross•46m ago•0 comments

Decomposition of Redundant Flows Through Compound AI Systems

https://pandyamarut.github.io/res/posts/1/
1•mwiki02•46m ago•0 comments

Understanding task types in the Gemini Embedding API

https://technicalwriting.dev/ml/embeddings/tasks/index.html
1•kaycebasques•48m ago•0 comments

Mathematical Beauty, Truth and Proof in the Age of AI

https://www.quantamagazine.org/mathematical-beauty-truth-and-proof-in-the-age-of-ai-20250430/
1•ngrislain•49m ago•0 comments

GPT 4.1 Prompting Guide

https://cookbook.openai.com/examples/gpt4-1_prompting_guide
2•sanj•51m ago•0 comments

Kids Online Safety Act Will Make the Internet Worse for Everyone

https://www.eff.org/deeplinks/2025/05/kids-online-safety-act-will-make-internet-worse-everyone
9•mdp2021•51m ago•4 comments

We built an AI that calls you to keep you accountable and reach your goals

https://www.commitify.me/
2•MustafaYenler•51m ago•1 comments

Quantifying the trade-offs between renewable energy visibility and system costs

https://www.nature.com/articles/s41467-025-59029-1
1•gnabgib•52m ago•0 comments
Open in hackernews

The current state of TLA⁺ development

https://ahelwer.ca/post/2025-05-15-tla-dev-status/
70•todsacerdoti•4h ago

Comments

femto2151•3h ago
I think the major contribution that can be made to TLA is in the field of divulgation. Sofware engineers don't know modern formal methods because for years they were pidgeonholed into safety critical systems. Universities are not teaching them anymore despite having being successfully applied since the 2010s to the development of so many cloud systems that other companies can rely on.
amenghra•2h ago
"I think we can increase the TLC model checker throughput to 1 billion states per minute (1000x speedup) by writing a bytecode interpreter. C"

I never used TLC with a large model, but I bet making the tool faster would make it more useful to lots of people.

I wonder what the speedup would be if the code targeted a GPU?

PessimalDecimal•20m ago
Agreed.

But I wonder if the logic of model checking is actually amenable to vectorization. I suspect not really, even for something basic like checking safety properties where you could try to shard the state space across cores. There is still likely to be some synchronization that is needed that eliminates the benefits. A cheaper way to test it would be to look to vectorize on the CPU first.

For a pure hardware based speedup, if there is effort to transcompile TLA+ specs to C++, there could then be a further step to transcompile that to say Verilog and try to run the model checking on an FPGA. That _might_ pay off.

alfalfasprout•2h ago
TBH, as cool as TLA+ is, the biggest issue I generally see with trying to use formal methods in practice is that you need to keep the specification matching the actual implementation. Otherwise whatever you've formally verified doesn't match what actually got built.

So formal methods may be used for extremely critical parts of systems (eg; safety critical systems or in embedded where later fixes cannot be readily rolled out) but they fail to make inroads in most other development because it's a lot of extra work.

ahelwer•2h ago
Hillel Wayne wrote a post[0] about this issue recently, but on a practical level I think I want to address it by writing a "how-to" on trace validation & model-based testing. There are a lot of projects out there that have tried this, where you either get your formal model to generate events that push your system around the state space or you collect traces from your system and validate that they're a correct behavior of your specification. Unfortunately, there isn't a good guide out there on how to do this; everybody kind of rolls their own, presents the conference talk, rinse repeat.

But yeah, that's basically the answer to the conformance problem for these sort of lightweight formal methods. Trace validation or model-based testing.

[0] https://buttondown.com/hillelwayne/archive/requirements-chan...)

sterlind•2h ago
Plus my Kayfabe system [0], which was partly inspired by Ron Pressler's article on trace validation:

0. https://conf.tlapl.us/2020/11-Star_Dorminey-Kayfabe_Model_ba...

lovich•17m ago
why are the lower case L's in that document bolded? a different weight? Not sure what the right technical change term is for the visual difference but it was extremely noticeable immediately upon opening the document
yuppiemephisto•2h ago
What do you think of embedding it in a formal system like Lean as a frontend?
jazzyjackson•47m ago
On the other hand, how many million man hours are spent re inventing the wheel that could instead be spent contributing to a library of extremely well-specified wheels?
cmrdporcupine•2h ago
"The 2025 TLA⁺ Community Event was held last week on May 4th at McMaster University in Hamilton, Ontario, Canada. "

Damn. Happened a 10 minute drive from my house and I didn't even know about it.

TLA+ is on the infinite bucket list for me. I'm sure like many others, I know the value of learning and applying formal verification, but it feels impenetrable knowing really how to jump in.

layer8•2h ago
The article doesn’t mention PlusCal. What is the future of that, will it co-evolve with TLA⁺?
dgan•1h ago
about a year ago, at my job, i wrote a spec for authentication in TLA+, and while writing it, i discovered a bug/attack vector, which would allow an attacker to basically bypass the double-authentication.

It surely did produce a fancy, mathy PDF which I proudly shown and explained to my team, but honestly, a little duck-talking would have permitted to find the same bug without TLA+

For aome context, it's a CRUD api + web interface for external clients, nothing too complicated, and I really wanted to try TLA+ in real life

pron•29m ago
> I think we can increase the TLC model checker throughput to 1 billion states per minute (1000x speedup) by writing a bytecode interpreter.

Truffle [1] can convert an interpreter to a JIT compiler -- there's no need to invent bytecode, and instead of an interpreter you get compilation to native, and it's easy to add intrinsics in Java; optimisations can be added gradually over time. This would probably be the most effective strategy, even in general, and certainly compared to cost.

[1]: https://www.graalvm.org/latest/graalvm-as-a-platform/languag...

PessimalDecimal•28m ago
I'm very happy to see this!

A few years ago I tried making some contributions to the TLC codebase. It was definitely "academic code," lacking tests, reinvesting basic structured instead of using them from libraries, and largely the work of a single contributor with seemingly no code reviews for commits. I was motivated to try to help improve things and wanted to get a sense for what that would be like by sending a small PR to get a feeling for working with the code owners. They basically stonewalled me. It was odd.

geertj•6m ago
My current thinking on model checking (still evolving):

Modeling languages are useful to check the correctness of an algorithm during development. During development, a model can also serve as a specification for the actual implementation. This requires that your the modeling language is readable to a broad range of developers, which TLA+ is not. We have been experimenting with FizzBee (fizzbee.io) which looks promising in this regards.

When you go to prod, you really want to test your actual implementation, not a model of it. For this you want something like https://github.com/awslabs/shuttle (for Rust), or https://github.com/cmu-pasta/fray (for Java). Or use something custom.