frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How do I use LLMs to generate test cases for groundedness benchmarks?

https://devblogs.microsoft.com/ise/intuitive-evaluation-framework-for-agentic-chatbots/
1•this_steve_j•2h ago

Comments

this_steve_j•2h ago
What are some ways to avoid common methological pitfalls when generating test cases for "groundedness" benchmarks with automation?

Confirmation bias is one obvious pitfall that comes to mind, but also I wonder how it is possible to achieve reproducibility when the input is stochastic.

Ace Frehley RIP [video]

https://www.youtube.com/watch?v=DXeeY9D9u94
1•1vuio0pswjnm7•1m ago•0 comments

AI's Effect on the US Economy Is Exaggerated

https://www.bloomberg.com/opinion/articles/2025-10-15/ai-effect-on-us-economy-is-exaggerated
1•alephnerd•2m ago•0 comments

Pkij: Single-file, zero-dependency CLI tool designed for managing monorepos

https://github.com/iyioio/pkij
1•handfuloflight•2m ago•0 comments

ICE, Border Patrol agents to receive pay during government shutdown

https://www.reuters.com/world/us/some-federal-law-enforcement-receive-pay-during-government-shutd...
4•clanky•6m ago•0 comments

How bacterial quorum sensing slows wound healing

https://today.ucsd.edu/story/bacterial-chatter-slows-wound-healing
2•hhs•8m ago•0 comments

Weird font rendering in Chrome on the SWR site

https://imgur.com/a/AN6UCod
1•razodactyl•8m ago•1 comments

H-1B Visa Holders Disappear from US Housing Market

https://www.newsweek.com/h-1b-visa-holders-disappear-from-us-housing-market-10882216
1•thelastgallon•10m ago•0 comments

Code Canvas App

https://marketplace.visualstudio.com/items?itemName=alex-c.code-canvas-app
1•handfuloflight•11m ago•0 comments

Discord-Markdown-Previewer.pro

https://discord-markdown-previewer.pro/
1•jiduhe•20m ago•0 comments

Palmer Luckey on Joe Rogan [video]

https://www.youtube.com/watch?v=-9LFj6YOK2U
1•neko_ranger•24m ago•0 comments

Renaming the default branch of Rust-lang/rust

https://blog.rust-lang.org/inside-rust/2025/10/16/renaming-the-default-branch-of-rust-langrust/
4•sergiotapia•26m ago•0 comments

Welcome to Central Air

https://www.centralairpodcast.com/p/welcome-to-central-air
1•paulpauper•26m ago•0 comments

Observations on AI and the Capital Markets in 2025

https://vinaysridhar.com/ai-capital-markets-reflections.html
1•paulpauper•29m ago•0 comments

What does a Nobel Prize on 'innovation-driven economic growth' reward?

https://beatricecherrier.wordpress.com/2025/10/13/what-does-a-nobel-prize-on-innovation-driven-ec...
1•paulpauper•29m ago•0 comments

Huel Is Fine

https://thebsdetector.substack.com/p/huel-is-fine
1•mirabilis•35m ago•0 comments

Art Must Act

https://aeon.co/essays/harold-rosenberg-exhorted-artists-to-take-action-and-resist-cliche
1•tintinnabula•41m ago•0 comments

The Great Butterfly Heist

https://www.theguardian.com/global/2025/oct/04/great-butterfly-heist-how-collector-stole-thousand...
1•lermontov•42m ago•0 comments

E-Waste Recycling with Deep Eutectic Solvents

https://www.descycle.com/technology
1•dillonshook•45m ago•0 comments

Intellectualism Has Hampered Generative Art (2018)

https://www.tylerxhobbs.com/words/intellectualism-has-hampered-generative-art
1•aaronbrethorst•46m ago•0 comments

Vault of Horror – The Inside Mac cover you never saw

https://folklore.org/Vault_of_Horror.html
2•stmw•46m ago•0 comments

Kanchha Sherpa, last surviving member of first team to scale Everest, dies 92

https://www.pbs.org/newshour/world/kanchha-sherpa-last-surviving-member-of-history-making-mount-e...
1•1659447091•49m ago•1 comments

Death by a Thousand Reports

https://datamethods.substack.com/p/death-by-a-thousand-reports
1•zekrom•50m ago•0 comments

The Status Trap in Tech

https://datamethods.substack.com/p/the-status-trap-in-tech
1•zekrom•50m ago•0 comments

Book review: The game that never ends: How lawyers shape the videogame industry

https://networks.h-net.org/group/reviews/20126596/xiao-mailland-game-never-ends-how-lawyers-shape...
1•hhs•52m ago•0 comments

Where AI Coding Agents Go to Die

https://chatbotkit.com/reflections/where-ai-coding-agents-go-to-die
2•_pdp_•55m ago•0 comments

Too many scientific 'discoveries' get discredited

https://www.japantimes.co.jp/commentary/2025/10/01/world/scientific-discoveries-get-discredited/
1•PaulHoule•55m ago•0 comments

Ask HN: Should I still not upgrade to tahoe (macOS 26)?

1•nidnogg•57m ago•1 comments

From Web2 to Web3: Building Decentralized Front Ends with Wagmi

https://jsdev.space/wagmi-react/
1•javatuts•1h ago•0 comments

Is creative destruction on the decline?

https://www.ft.com/content/93f0fc4a-eab5-4364-835a-15530379436f
1•hhs•1h ago•0 comments

End of the World?

1•Toby1VC•1h ago•0 comments