frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

OpenAI Realizes It Made a Terrible Mistake

https://www.msn.com/en-us/news/technology/openai-realizes-it-made-a-terrible-mistake/ar-AA1MwydF
4•galaxyLogic•2h ago

Comments

galaxyLogic•1h ago
I was once working with an E-Learning company and proposed that our multiple-choice tests should give -1 for the wrong choice, 1 for correct choice and 0 for no answer.

Instead they wanted to only give poinhts for correct answers, not penalize wrong answers. That obviously leads to and promotes guessing. Why did they want it that way? I think they wanted to show that with our product people actually learned the stuff and thus was worth paying for. You could pass the test by making "good guesses".

Something similar seems to be going on here. AI companies want their LMS to get good scores even when they don't know the answer, in which case they guess. That is bad because they don't tell us when they're guessing.

I think it should be OK for LMS to guess but only if it clearly tells the user it's answer is just a guess, when it is.

ItsBob•23m ago
> I think it should be OK for LMS to guess but only if it clearly tells the user it's answer is just a guess, when it is.

Or, alternatively, it shows us the confidence level of the answer, e.g. a value between 0 and 1, with 1 being 100% confident.

That would work for me.

We sped up code search for Graphite Chat

https://graphite.dev/blog/how-we-sped-up-code-search-graphite-chat
1•kiyanwang•36s ago•0 comments

Vendor by Default (2021)

https://macwright.com/2021/03/11/vendor-by-default
1•moebrowne•6m ago•0 comments

Trump Is Shutting Down the War on Cancer

https://www.nytimes.com/2025/09/14/magazine/cancer-research-grants-funds-trump.html
2•Teever•9m ago•0 comments

Apache Foundation Unveils Its Branding Overhaul with New Logo and "The ASF" Name

https://www.phoronix.com/news/New-Apache-Software-Logo
1•maxloh•10m ago•0 comments

Three random words to create a password that's 'long enough and strong enough'

https://www.ncsc.gov.uk/collection/top-tips-for-staying-secure-online/three-random-words
1•tombot•15m ago•0 comments

Show HN: AI SEO in WordPress with your own OpenAI key

https://sgeowp.com/
1•glennhv•16m ago•0 comments

Show HN: Instantly turn your LinkedIn into a personal website with AI

https://paje.ai
1•FlorinDobinciuc•17m ago•0 comments

Protecting Rust against supply chain attacks

https://kerkour.com/rust-supply-chain-attacks
2•todsacerdoti•17m ago•0 comments

The EU Data Act Just Killed Long SaaS Contracts

https://revenuewizards.com/blog/the-eu-data-act-just-killed-long-saas-contracts
1•adzicg•18m ago•0 comments

Hacktoberfest 2025

https://hacktoberfest.com
1•samcurryo•18m ago•0 comments

My "Show HN" Follow-Up for "Swimming in Tech Debt"

https://loufranco.com/blog/show-hn-follow-up-for-swimming-in-tech-debt
1•furkansahin•20m ago•0 comments

Samsung 870 QVO 4TB SATA SSD-s: how are they doing after 4 years of use?

https://ounapuu.ee/posts/2025/09/15/samsung-870-qvo/
1•furkansahin•21m ago•0 comments

[shown hn]: A long meeting? See where you could've travelled instead

https://www.meetingmiles.app/
1•mikeborozdin•28m ago•1 comments

Show HN: Celestial Fortunes – AI Blends Eastern and Western Astrology

https://celestialfortunes.net
1•Waffle2180•30m ago•0 comments

Ask HN: Voluntary ID verification for better services. Would you use it?

1•kisamoto•30m ago•0 comments

The OpenRouter for OCR

https://parserstudio.com/
1•ajarellanod•33m ago•1 comments

Network Breakthrough: GNN-Pomdp Enables Robust Policies in Dynamic Systems

https://quantumzeitgeist.com/network-systems-routing-breakthrough-gnn-pomdp-framework-enables-sca...
1•Fake4d•43m ago•0 comments

Redis too slow for ML? Try Blackbird an RDMA multitier distributed cache

https://github.com/blackbird-io/blackbird
6•hackercat010•51m ago•1 comments

Physics-Defying Marketing: Misleading Vendor Article, and VNA Calibration Primer

https://niconiconi.neocities.org/tech-notes/review-of-a-misleading-vendor-article-on-vna-calibrat...
1•ignaloidas•1h ago•0 comments

A list of software and other offerings with free developer tiers

https://free-for.dev/#/
2•EyeRunnMan•1h ago•0 comments

Leaving KDE after 25 years

https://jriddell.org/2025/09/14/adios-chicos-25-years-of-kde/
4•HieronymusBosch•1h ago•0 comments

The Mac App Flea Market

https://blog.jim-nielsen.com/2025/mac-app-flea-market/
2•ingve•1h ago•0 comments

Folks, we have the best π

https://lcamtuf.substack.com/p/folks-we-have-the-best
3•fratellobigio•1h ago•0 comments

Will Macs get Apple's new memory protection?

https://eclecticlight.co/2025/09/15/will-macs-get-apples-new-memory-protection/
2•ingve•1h ago•0 comments

I Spent Weeks Writing My Own Scripting Language for My Game – Was It Worth It? [video]

https://www.youtube.com/watch?v=i5-LqmgytDw
2•skibz•1h ago•0 comments

Trump: Lot of People on the Left "Are Already Under Investigation"

https://www.realclearpolitics.com/video/2025/09/14/trump_a_lot_of_people_you_would_traditionally_...
5•KnuthIsGod•1h ago•1 comments

Prek – a faster, drop-in alternative to pre-commit (written in Rust)

https://github.com/j178/prek
3•joshxa•1h ago•0 comments

Can you make it to the end of this column?

https://www.economist.com/finance-and-economics/2025/09/11/can-you-make-it-to-the-end-of-this-column
2•helsinkiandrew•1h ago•1 comments

Linking to Text Fragments with a Bookmarklet

https://alexwlchan.net/2025/text-fragments-bookmarklet/
2•Bogdanp•1h ago•0 comments

The Anthropic 'Red Team' tasked with breaking its AI models

https://fortune.com/2025/09/04/anthropic-red-team-pushes-ai-models-into-the-danger-zone-and-burni...
1•jaredwiener•1h ago•0 comments