frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Legal Action Boundary Eval for agentic legal workflows

https://github.com/bigkan8/legal-action-boundary-eval
1•kankouadio_vx•1h ago
We published LABE, a public benchmark for legal AI at the exact point where a system is about to take a real high-impact action.

Current result:

baseline executed 18 unjustified high-impact action points with VerifiedX that dropped to 0 false blocks in the current suite: 0 surviving-goal completion improved from 41.7% to 100% Same harness, same prompts, same playbooks, baseline vs VerifiedX.

Legal is the first public instance. The same method applies to support, healthcare RCM, procurement, and finance too.

Repo, methodology, and raw artifacts are public: https://github.com/bigkan8/legal-action-boundary-eval

Comments

kankouadio_vx•1h ago
LABE measures a seam most legal AI evals skip: the exact point where the system is about to do something real.

Same harness, same prompts, same playbooks, baseline vs VerifiedX.

Current result:

baseline executed 18 unjustified high-impact action points with VerifiedX that dropped to 0 false blocks in the current suite: 0 surviving-goal completion improved from 41.7% to 100% The repo includes methodology, raw artifacts, and repro steps.

This is a public proxy eval based on legal workflow classes Luminance publicly markets. It is not a claim about their internal system.

Legal is the first public instance. The same method applies to support, healthcare RCM, procurement, and finance too.

Happy to answer questions on methodology, false blocks, overhead, or how to design domain-specific action-boundary evals.

Creativity Harness: Adobe will be fine

https://avkcode.github.io/blog/creative-widgets-should-produce-files.html
1•KyleVlaros•1m ago•0 comments

Microplastics in testicles may play a role in male infertility, study suggests

https://weillcornell.org/news/microplastics-in-testicles-may-play-a-role-in-male-infertility-stud...
1•vinni2•1m ago•0 comments

Show HN: OpenDeck – DIY MIDI Platform Based on Zephyr RTOS

https://github.com/shanteacontrols/OpenDeck
1•somemisopaste•2m ago•0 comments

Intel Ends Open Ecosystem Community/Evangelism

https://www.phoronix.com/news/Intel-Ends-OSS-Evangelism-Repos
1•doener•2m ago•0 comments

Insights into firewood use by early Middle Pleistocene hominins

https://www.sciencedirect.com/science/article/pii/S0277379126001824
1•wslh•4m ago•0 comments

Show HN: FourthSpace – Bridging Social Media with Your Social Life

https://www.fourthspace.vip
1•adra_1010•6m ago•0 comments

Tesla earnings updates: The stock is up after revenue and EPS beat Wall Street's

https://www.businessinsider.com/tesla-q1-earnings-updates-tsla-stock-robotaxis-elon-musk-2026-4
1•paulpauper•7m ago•0 comments

US saw record high of 5,668 books banned in libraries in 2025, says agency

https://www.theguardian.com/us-news/2026/apr/22/us-libraries-banned-books
5•vinni2•9m ago•0 comments

Bad habits are destroying your charging cables

https://www.bbc.com/future/article/20260421-your-bad-habits-are-destroying-your-charging-cables
3•devonnull•10m ago•0 comments

OpenDHT – Distributed Hash Table Implementation

https://github.com/savoirfairelinux/opendht
1•smartmic•10m ago•0 comments

Elon Musk backs 'universal high income' to combat AI job losses

https://www.foxbusiness.com/fox-news-tech/elon-musk-backs-universal-high-income-combat-ai-job-losses
5•paulpauper•12m ago•1 comments

Progress Conference 2026

https://newsletter.rootsofprogress.org/p/announcing-progress-conference-2026
1•paulpauper•12m ago•0 comments

How I Use Unspent Tokens

https://artisincode.com/essays/how-i-use-unspent-tokens/
1•parentheses•12m ago•1 comments

Vercel breach maps almost to the framework I published in February

https://papers.ssrn.com/abstract=6055034
1•Tejas_dmg•12m ago•0 comments

Show HN: Cosmo – Desktop agent with generated UI

https://www.buildcosmo.com
1•row0•13m ago•0 comments

Show HN: I built a map of the GeminiNet

https://rbtms.github.io/gemini_map/
2•rbtms•15m ago•0 comments

Symbiont – Typestate-enforced policy gates for AI agents (Rust)

https://github.com/thirdkeyai/symbiont
1•smugglereal•16m ago•0 comments

Apple fixes bug that cops used to extract deleted chat messages from iPhones

https://techcrunch.com/2026/04/22/apple-fixes-bug-that-cops-used-to-extract-deleted-chat-messages...
5•cdrnsf•19m ago•0 comments

OpenAI now lets teams make custom bots that can do work on their own

https://www.theverge.com/ai-artificial-intelligence/917065/openai-chatgpt-workspace-agents-custom...
3•omer_k•19m ago•0 comments

Public sector Matrix deployments in Europe

https://element.io/matrix-in-europe
2•Arathorn•21m ago•0 comments

The Story of Mel

http://www.catb.org/jargon/html/story-of-mel.html
3•softwaredoug•21m ago•0 comments

Show HN: Contral AI IDE Vibe-Learn to Code

https://contral.ai
2•vednig•22m ago•0 comments

Show HN: FoodPips – I reverse-engineered a weight loss system and made it free

https://foodpips.com
3•sjdegraeve•22m ago•1 comments

The Illuminated Man: an unconventional portrait of JG Ballard

https://www.theguardian.com/books/2026/apr/20/the-illuminated-man-by-christopher-priest-and-nina-...
5•agronaut•22m ago•0 comments

When Your Trace Is Lying to You: A Performance Case Study

https://muness.com/posts/when-your-trace-is-lying-to-you/
4•speckx•24m ago•0 comments

Choreodle on iOS

https://testflight.apple.com/join/gD2y2uzz
2•mohanjith•25m ago•0 comments

We mapped unauthenticated Vector DBs exposing corporate AI data

2•echelongraph•25m ago•2 comments

From Fear Factor to Federal Policy: How Psychedelics Are the Future?

https://yakreignited.substack.com/p/from-fear-factor-to-federal-policy
2•__yak•27m ago•0 comments

getadb.com - instant database for agents

https://www.getadb.com/
1•grinich•27m ago•0 comments

Fear of Corporate Politics Debunked

https://growingfearless.substack.com/p/fear-of-corporate-politics-debunked
2•andrewstetsenko•30m ago•0 comments