frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Why is it hard to evaluate GenAI applications?

https://andreagao.com/posts/genai-evaluation-challenge/
3•gytrcrt•9h ago

Comments

PaulHoule•9h ago
What I noticed in the 2010s was that there was very little enthusiasm to do evaluation for information retrieval or classical ML even though it was often straightforward to do.

Interest in eval has skyrocketed just like vector databases have in the LLM age. Finally people see enough value in an ML system to be worth doing eval work, but... it's much harder!

gytrcrt•8h ago
I think the difference is: 1. there was no hallucination from information retrieval or classic ML back 2010s 2. there was way lower engagement from general public or even regulator on classic ML system. aka, people were not able to directly "talk" to a ML system like ChatGPT

the 2 points combined drive way more scrutiny on GenAI models/apps.

Ask HN: Do we need a language designed specifically for AI code generation?

1•baijum•2m ago•0 comments

Good pixel art can be one-shotted by AI now

https://gametorch.app/collections/7
2•gametorch•10m ago•1 comments

I dream of roombas: 1000s of automated AI robots that autonomously maintain code

https://ghuntley.com/ktlo/
2•ghuntley•16m ago•0 comments

China Kicks Off Human Testing of Implantable Brain-Computer Interface Devices

https://www.yicaiglobal.com/news/china-kicks-off-human-testing-of-implantable-brain-computer-interface-devices
1•gametorch•23m ago•0 comments

Why are front end dev demand so high if front end development is easier? (2012)

https://simonwillison.net/2012/Feb/13/why-are-front-end/
8•thunderbong•24m ago•0 comments

A Novel "Reasoning"-Enhancing Technique for Large Language Models

https://marqcodes.com
1•N3Xxus_6•30m ago•2 comments

Astonishing discovery by computer scientist: how to squeeze space into time [video]

https://www.youtube.com/watch?v=p_AW6fomKPI
1•drhodes•32m ago•0 comments

Show HN: Resumable Web Streams

https://github.com/vercel/resumable-stream
2•cramforce•37m ago•0 comments

AMC Says It Will Show More Ads Before Movies

https://www.nytimes.com/2025/06/06/business/movies-theaters-ads-amc.html
3•cebert•45m ago•3 comments

Getting C++ Hello World working on Windows (a comedy & tragedy)

https://sdegutis.github.io/blog/creating-cpp-hello-world.html
2•90s_dev•47m ago•2 comments

NASA delays next flight of Boeing's alternative to SpaceX Dragon

https://theedgemalaysia.com/node/758199
2•bookmtn•49m ago•0 comments

Can Schrodinger's Cat Factor Numbers?

https://mathpages.com/home/kmath013/kmath013.htm
2•gametorch•49m ago•0 comments

NASA Delays Next Flight of Boeing's Alternative to SpaceX Dragon

https://www.bloomberg.com/news/articles/2025-06-06/nasa-delays-next-flight-of-boeing-s-alternative-to-spacex-dragon
2•bookmtn•51m ago•0 comments

California AG vows crack down on copper wire thefts in the state

https://abc7.com/post/california-ag-rob-bonta-vows-crack-down-copper-wire-thefts-state/16678391/
2•lxm•51m ago•0 comments

Show HN: A photo backup idea – to your own storage, not iCloud/Google

https://myphoto-vault.netlify.app/
3•Nainiket•57m ago•0 comments

Trump administration races to fix a big mistake: DOGE fired too many people

https://www.washingtonpost.com/business/2025/06/06/doge-staff-cuts-rehiring-federal-workers/
11•MilnerRoute•58m ago•1 comments

Getting Past Procastination

https://spectrum.ieee.org/getting-past-procastination
4•WaitWaitWha•59m ago•2 comments

Reverse Engineering Cursor's LLM Client

https://www.tensorzero.com/blog/reverse-engineering-cursors-llm-client/
3•paulwarren•1h ago•0 comments

Show HN: Cpdown – Copy any webpage/YouTube subtitle as clean Markdown(LLM-ready)

https://github.com/ysm-dev/cpdown
2•ysm0622•1h ago•0 comments

Pentagon Disinformation Fueled America's UFO Mythology

https://www.wsj.com/politics/national-security/ufo-us-disinformation-45376f7e
3•doener•1h ago•0 comments

Open-source code repos open to supply chain attacks, researchers warn

https://www.scworld.com/news/open-source-code-repos-open-to-supply-chain-attacks-researchers-warn
3•ricecat•1h ago•0 comments

Ask HN: What non-AI projects are you working on?

4•kikki•1h ago•3 comments

Nintendo Switch 2 Teardown [video]

https://www.youtube.com/watch?v=RvD1OCHhhS0
3•Lwrless•1h ago•0 comments

TSA urges people to stop trying to use a Costco card as a sufficient Real ID

https://www.wsfa.com/2025/06/06/tsa-urges-people-stop-trying-use-costco-card-sufficient-real-id/
8•sharkweek•1h ago•0 comments

The reason Indians are lost

https://www.economist.com/asia/2025/06/05/the-real-reason-indians-are-lost
2•RestlessMind•1h ago•1 comments

Ask HN: Why are job descriptions and resumes so bad?

1•throwaway123198•1h ago•0 comments

Show HN: Pcrassist.com – AI powered report assistant for EMTs

https://pcrassist.com/
1•josdijkstra•1h ago•0 comments

Error Monads the Hard Way

https://articles.pragdave.me/p/error-monads-the-hard-way
1•thunderbong•1h ago•0 comments

Show HN: C++ SFML Game Engine for Nintendo Switch, Web (HTML5), PC and Mobile

https://github.com/Is-Daouda/is-Engine
1•Is_Daouda•1h ago•0 comments

Musk's XAI Is Trying to Borrow $5B While His Relationship with Trump Blows Up

https://www.wsj.com/finance/musks-xai-is-trying-to-borrow-5-billion-while-his-relationship-with-trump-blows-up-4b963361
3•TheAlchemist•2h ago•0 comments