frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Toroidal Logit Bias – Reduce LLM hallucinations 40% with no fine-tuning

https://github.com/Paraxiom/topological-coherence
1•slye514•1m ago•1 comments

Top AI models fail at >96% of tasks

https://www.zdnet.com/article/ai-failed-test-on-remote-freelance-jobs/
3•codexon•1m ago•1 comments

The Science of the Perfect Second (2023)

https://harpers.org/archive/2023/04/the-science-of-the-perfect-second/
1•NaOH•2m ago•0 comments

Bob Beck (OpenBSD) on why vi should stay vi (2006)

https://marc.info/?l=openbsd-misc&m=115820462402673&w=2
2•birdculture•5m ago•0 comments

Show HN: Glimpsh – exploring gaze input inside the terminal

https://github.com/dchrty/glimpsh
1•dochrty•6m ago•0 comments

The Optima-l Situation: A deep dive into the classic humanist sans-serif

https://micahblachman.beehiiv.com/p/the-optima-l-situation
1•subdomain•7m ago•0 comments

Barn Owls Know When to Wait

https://blog.typeobject.com/posts/2026-barn-owls-know-when-to-wait/
1•fintler•7m ago•0 comments

Implementing TCP Echo Server in Rust [video]

https://www.youtube.com/watch?v=qjOBZ_Xzuio
1•sheerluck•7m ago•0 comments

LicGen – Offline License Generator (CLI and Web UI)

1•tejavvo•10m ago•0 comments

Service Degradation in West US Region

https://azure.status.microsoft/en-gb/status?gsid=5616bb85-f380-4a04-85ed-95674eec3d87&utm_source=...
2•_____k•11m ago•0 comments

The Janitor on Mars

https://www.newyorker.com/magazine/1998/10/26/the-janitor-on-mars
1•evo_9•12m ago•0 comments

Bringing Polars to .NET

https://github.com/ErrorLSC/Polars.NET
3•CurtHagenlocher•14m ago•0 comments

Adventures in Guix Packaging

https://nemin.hu/guix-packaging.html
1•todsacerdoti•15m ago•0 comments

Show HN: We had 20 Claude terminals open, so we built Orcha

1•buildingwdavid•16m ago•0 comments

Your Best Thinking Is Wasted on the Wrong Decisions

https://www.iankduncan.com/engineering/2026-02-07-your-best-thinking-is-wasted-on-the-wrong-decis...
1•iand675•16m ago•0 comments

Warcraftcn/UI – UI component library inspired by classic Warcraft III aesthetics

https://www.warcraftcn.com/
1•vyrotek•17m ago•0 comments

Trump Vodka Becomes Available for Pre-Orders

https://www.forbes.com/sites/kirkogunrinde/2025/12/01/trump-vodka-becomes-available-for-pre-order...
1•stopbulying•18m ago•0 comments

Velocity of Money

https://en.wikipedia.org/wiki/Velocity_of_money
1•gurjeet•21m ago•0 comments

Stop building automations. Start running your business

https://www.fluxtopus.com/automate-your-business
1•valboa•25m ago•1 comments

You can't QA your way to the frontier

https://www.scorecard.io/blog/you-cant-qa-your-way-to-the-frontier
1•gk1•26m ago•0 comments

Show HN: PalettePoint – AI color palette generator from text or images

https://palettepoint.com
1•latentio•27m ago•0 comments

Robust and Interactable World Models in Computer Vision [video]

https://www.youtube.com/watch?v=9B4kkaGOozA
2•Anon84•30m ago•0 comments

Nestlé couldn't crack Japan's coffee market.Then they hired a child psychologist

https://twitter.com/BigBrainMkting/status/2019792335509541220
1•rmason•32m ago•1 comments

Notes for February 2-7

https://taoofmac.com/space/notes/2026/02/07/2000
2•rcarmo•33m ago•0 comments

Study confirms experience beats youthful enthusiasm

https://www.theregister.com/2026/02/07/boomers_vs_zoomers_workplace/
2•Willingham•40m ago•0 comments

The Big Hunger by Walter J Miller, Jr. (1952)

https://lauriepenny.substack.com/p/the-big-hunger
2•shervinafshar•41m ago•0 comments

The Genus Amanita

https://www.mushroomexpert.com/amanita.html
1•rolph•46m ago•0 comments

We have broken SHA-1 in practice

https://shattered.io/
10•mooreds•47m ago•4 comments

Ask HN: Was my first management job bad, or is this what management is like?

1•Buttons840•48m ago•0 comments

Ask HN: How to Reduce Time Spent Crimping?

2•pinkmuffinere•49m ago•1 comments
Open in hackernews

LLMs Are Bad Judges. So Use Our Classifier Instead

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5331811
41•lordgrenville•4mo ago

Comments

adlumal•4mo ago
Having done some work in the legal AI field, I wonder how this classifier deals with issues of transparency, explainability and ultimately trust? It’s valuable to have some idea of how a proceedings might unfold but from my experience most competent lawyers have a high bar when it comes to trusting any AI/ML output.
Taikonerd•4mo ago
I was worried about explainability, too. If the classifier just spat out "INNOCENT" or "GUILTY," it would be useless -- the legal reasoning has to be part of the output.

Looking at the paper, the classifier definitely does output its reasoning:

"The legal issue at hand is whether the 50/50 royalty split in the 1961 contract binds only pre-existing affiliates or if it also includes affiliates that come into being after the agreement..."

leobg•4mo ago
This reads like an ad for Arbitrus.ai. It’s copywriting lingo:

> We built one called Arbitrus. We put it through a mini-Choi test and it mopped the floor with the competition

lordgrenville•4mo ago
True, but they own that:

> Declaration of Interest: [Authors] have financial interests in...Arbitrus.ai. As the title would suggest, the authors are making no effort to obfuscate this fact.

causal•4mo ago
I'd argue that dressing an ad up as an academic paper is obfuscation
Taikonerd•4mo ago
I had thought: what's the business model for Arbitrus? Is it going to be a sort of "suggested finding" tool for judges? Or are law firms going to use it to screen cases, so they can pick winners?

It seems like the answer is neither: on their website, Arbitrus.ai says it's for private arbitration. "Arbitrus is a private court system with an AI judge. Why use the public court system or expensive AAA arbitration to settle your disputes, when you can do it faster, cheaper, and better with Arbitrus?"

nextaccountic•4mo ago
What kind of classifier is this? I mean is it k-NN (for example), or something else?

Even LLMs can be viewed as classifiers, as the paper (ad?) itself admits.

esafak•4mo ago
pg36 "This is proprietary and part of Fortuna’s moat, so we explain it to the extent appropriate."
opwieurposiu•4mo ago
I love the idea of Arbitrus.ai, but they want $2500 a go to test it. I wish they had a demo version to play with.
barbazoo•4mo ago
The margin and line spacing makes this hard to read. Is this how you're supposed to typeset a paper? Some pages have three, maybe four sentences on them.
LawKek•3mo ago
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5587611. Related to this post.