news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

OpenAI Realizes It Made a Terrible Mistake

https://www.msn.com/en-us/news/technology/openai-realizes-it-made-a-terrible-mistake/ar-AA1MwydF

5•galaxyLogic•4mo ago

Comments

galaxyLogic•4mo ago

I was once working with an E-Learning company and proposed that our multiple-choice tests should give -1 for the wrong choice, 1 for correct choice and 0 for no answer.

Instead they wanted to only give poinhts for correct answers, not penalize wrong answers. That obviously leads to and promotes guessing. Why did they want it that way? I think they wanted to show that with our product people actually learned the stuff and thus was worth paying for. You could pass the test by making "good guesses".

Something similar seems to be going on here. AI companies want their LMS to get good scores even when they don't know the answer, in which case they guess. That is bad because they don't tell us when they're guessing.

I think it should be OK for LMS to guess but only if it clearly tells the user it's answer is just a guess, when it is.

ItsBob•4mo ago

> I think it should be OK for LMS to guess but only if it clearly tells the user it's answer is just a guess, when it is.

Or, alternatively, it shows us the confidence level of the answer, e.g. a value between 0 and 1, with 1 being 100% confident.

That would work for me.

galaxyLogic•4mo ago

That's a good idea

gus_massa•4mo ago

It depends on what you want. We gave our students voluntary additional exorcice, and we gave them 1 point for a correct answer and .5 in the second guess. The idea was to encourage them to learn from their first mistake (and perhaps some clue, like "remember the parenthesis" or "(a/b)/(c/d) is not (ac)/(bd)")

galaxyLogic•4mo ago

That's a great idea

AegisMind – AI system with 12 brain regions modeled on human neuroscience

https://www.aegismind.app

1•aegismind_app•2m ago•1 comments

Zig – Package Management Workflow Enhancements

https://ziglang.org/devlog/2026/#2026-02-06

1•Retro_Dev•4m ago•0 comments

AI-powered text correction for macOS

https://taipo.app/

1•neuling•7m ago•1 comments

AppSecMaster – Learn Application Security with hands on challenges

https://www.appsecmaster.net/en

1•aqeisi•8m ago•1 comments

Fibonacci Number Certificates

https://www.johndcook.com/blog/2026/02/05/fibonacci-certificate/

1•y1n0•10m ago•0 comments

AI Overviews are killing the web search, and there's nothing we can do about it

https://www.neowin.net/editorials/ai-overviews-are-killing-the-web-search-and-theres-nothing-we-c...

3•bundie•15m ago•0 comments

City skylines need an upgrade in the face of climate stress

https://theconversation.com/city-skylines-need-an-upgrade-in-the-face-of-climate-stress-267763

3•gnabgib•16m ago•0 comments

1979: The Model World of Robert Symes [video]

https://www.youtube.com/watch?v=HmDxmxhrGDc

1•xqcgrek2•20m ago•0 comments

Satellites Have a Lot of Room

https://www.johndcook.com/blog/2026/02/02/satellites-have-a-lot-of-room/

2•y1n0•21m ago•0 comments

1980s Farm Crisis

https://en.wikipedia.org/wiki/1980s_farm_crisis

4•calebhwin•21m ago•1 comments

Show HN: FSID - Identifier for files and directories (like ISBN for Books)

https://github.com/skorotkiewicz/fsid

1•modinfo•26m ago•0 comments

Show HN: Holy Grail: Open-Source Autonomous Development Agent

https://github.com/dakotalock/holygrailopensource

1•Moriarty2026•33m ago•1 comments

Show HN: Minecraft Creeper meets 90s Tamagotchi

https://github.com/danielbrendel/krepagotchi-game

1•foxiel•41m ago•1 comments

Show HN: Termiteam – Control center for multiple AI agent terminals

https://github.com/NetanelBaruch/termiteam

1•Netanelbaruch•41m ago•0 comments

The only U.S. particle collider shuts down

https://www.sciencenews.org/article/particle-collider-shuts-down-brookhaven

2•rolph•44m ago•1 comments

Ask HN: Why do purchased B2B email lists still have such poor deliverability?

1•solarisos•44m ago•2 comments

Show HN: Remotion directory (videos and prompts)

https://www.remotion.directory/

1•rokbenko•46m ago•0 comments

Portable C Compiler

https://en.wikipedia.org/wiki/Portable_C_Compiler

2•guerrilla•48m ago•0 comments

Show HN: Kokki – A "Dual-Core" System Prompt to Reduce LLM Hallucinations

1•Ginsabo•49m ago•0 comments

Software Engineering Transformation 2026

https://mfranc.com/blog/ai-2026/

1•michal-franc•50m ago•0 comments

Microsoft purges Win11 printer drivers, devices on borrowed time

https://www.tomshardware.com/peripherals/printers/microsoft-stops-distrubitng-legacy-v3-and-v4-pr...

3•rolph•50m ago•1 comments

Lunch with the FT: Tarek Mansour

https://www.ft.com/content/a4cebf4c-c26c-48bb-82c8-5701d8256282

2•hhs•54m ago•0 comments

Old Mexico and her lost provinces (1883)

https://www.gutenberg.org/cache/epub/77881/pg77881-images.html

1•petethomas•57m ago•0 comments

'AI' is a dick move, redux

https://www.baldurbjarnason.com/notes/2026/note-on-debating-llm-fans/

5•cratermoon•58m ago•0 comments

The source code was the moat. But not anymore

https://philipotoole.com/the-source-code-was-the-moat-no-longer/

1•otoolep•58m ago•0 comments

Does anyone else feel like their inbox has become their job?

1•cfata•58m ago•1 comments

An AI model that can read and diagnose a brain MRI in seconds

https://www.michiganmedicine.org/health-lab/ai-model-can-read-and-diagnose-brain-mri-seconds

2•hhs•1h ago•0 comments

Dev with 5 of experience switched to Rails, what should I be careful about?

2•vampiregrey•1h ago•0 comments

AlphaFace: High Fidelity and Real-Time Face Swapper Robust to Facial Pose

https://arxiv.org/abs/2601.16429

1•PaulHoule•1h ago•0 comments

Scientists discover “levitating” time crystals that you can hold in your hand

https://www.nyu.edu/about/news-publications/news/2026/february/scientists-discover--levitating--t...

3•hhs•1h ago•0 comments