frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Goal: Ship 1M Lines of Code Daily

1•feastingonslop•3m ago•0 comments

Show HN: Codex-mem, 90% fewer tokens for Codex

https://github.com/StartripAI/codex-mem
1•alfredray•5m ago•0 comments

FastLangML: FastLangML:Context‑aware lang detector for short conversational text

https://github.com/pnrajan/fastlangml
1•sachuin23•9m ago•1 comments

LineageOS 23.2

https://lineageos.org/Changelog-31/
1•pentagrama•12m ago•0 comments

Crypto Deposit Frauds

1•wwdesouza•13m ago•0 comments

Substack makes money from hosting Nazi newsletters

https://www.theguardian.com/media/2026/feb/07/revealed-how-substack-makes-money-from-hosting-nazi...
1•lostlogin•13m ago•0 comments

Framing an LLM as a safety researcher changes its language, not its judgement

https://lab.fukami.eu/LLMAAJ
1•dogacel•15m ago•0 comments

Are there anyone interested about a creator economy startup

1•Nejana•17m ago•0 comments

Show HN: Skill Lab – CLI tool for testing and quality scoring agent skills

https://github.com/8ddieHu0314/Skill-Lab
1•qu4rk5314•17m ago•0 comments

2003: What is Google's Ultimate Goal? [video]

https://www.youtube.com/watch?v=xqdi1xjtys4
1•1659447091•17m ago•0 comments

Roger Ebert Reviews "The Shawshank Redemption"

https://www.rogerebert.com/reviews/great-movie-the-shawshank-redemption-1994
1•monero-xmr•19m ago•0 comments

Busy Months in KDE Linux

https://pointieststick.com/2026/02/06/busy-months-in-kde-linux/
1•todsacerdoti•20m ago•0 comments

Zram as Swap

https://wiki.archlinux.org/title/Zram#Usage_as_swap
1•seansh•33m ago•0 comments

Green’s Dictionary of Slang - Five hundred years of the vulgar tongue

https://greensdictofslang.com/
1•mxfh•34m ago•0 comments

Nvidia CEO Says AI Capital Spending Is Appropriate, Sustainable

https://www.bloomberg.com/news/articles/2026-02-06/nvidia-ceo-says-ai-capital-spending-is-appropr...
1•virgildotcodes•37m ago•2 comments

Show HN: StyloShare – privacy-first anonymous file sharing with zero sign-up

https://www.styloshare.com
1•stylofront•38m ago•0 comments

Part 1 the Persistent Vault Issue: Your Encryption Strategy Has a Shelf Life

1•PhantomKey•42m ago•0 comments

Show HN: Teleop_xr – Modular WebXR solution for bimanual robot teleoperation

https://github.com/qrafty-ai/teleop_xr
1•playercc7•45m ago•1 comments

The Highest Exam: How the Gaokao Shapes China

https://www.lrb.co.uk/the-paper/v48/n02/iza-ding/studying-is-harmful
2•mitchbob•49m ago•1 comments

Open-source framework for tracking prediction accuracy

https://github.com/Creneinc/signal-tracker
1•creneinc•51m ago•0 comments

India's Sarvan AI LLM launches Indic-language focused models

https://x.com/SarvamAI
2•Osiris30•52m ago•0 comments

Show HN: CryptoClaw – open-source AI agent with built-in wallet and DeFi skills

https://github.com/TermiX-official/cryptoclaw
1•cryptoclaw•55m ago•0 comments

ShowHN: Make OpenClaw respond in Scarlett Johansson’s AI Voice from the Film Her

https://twitter.com/sathish316/status/2020116849065971815
1•sathish316•57m ago•2 comments

CReact Version 0.3.0 Released

https://github.com/creact-labs/creact
1•_dcoutinho96•59m ago•0 comments

Show HN: CReact – AI Powered AWS Website Generator

https://github.com/creact-labs/ai-powered-aws-website-generator
1•_dcoutinho96•59m ago•0 comments

The rocky 1960s origins of online dating (2025)

https://www.bbc.com/culture/article/20250206-the-rocky-1960s-origins-of-online-dating
1•1659447091•1h ago•0 comments

Show HN: Agent-fetch – Sandboxed HTTP client with SSRF protection for AI agents

https://github.com/Parassharmaa/agent-fetch
1•paraaz•1h ago•0 comments

Why there is no official statement from Substack about the data leak

https://techcrunch.com/2026/02/05/substack-confirms-data-breach-affecting-email-addresses-and-pho...
13•witnessme•1h ago•4 comments

Effects of Zepbound on Stool Quality

https://twitter.com/ScottHickle/status/2020150085296775300
2•aloukissas•1h ago•1 comments

Show HN: Seedance 2.0 – The Most Powerful AI Video Generator

https://seedance.ai/
2•bigbromaker•1h ago•0 comments
Open in hackernews

AI hallucinations will be solvable within a year (2024)

https://fortune.com/2024/04/16/ai-hallucinations-solvable-year-ex-google-researcher/
12•rvz•6mo ago

Comments

techpineapple•6mo ago
I wonder if it would be better to have 1 “perfect” LLM trying to solve problems or 5 intentionally biased LLM’s.
d00mB0t•6mo ago
I'm so tired of these rich dweebs pontificating to everyone.
ipv6ipv4•6mo ago
There is ample evidence that hallucinations are incurable in the best extant model of intelligence - people.
add-sub-mul-div•6mo ago
Someday we'll figure out how to program computers to behave deterministically so that they can complement our human abilities rather than badly impersonate them.
Dylan16807•6mo ago
Getting down to the level of a median helpful human with the same knowledge would be a massive step forward.

Getting down to the level of a moderately humble expert taking the time to double check would be almost as good as solving it.

davesmylie•6mo ago
Obviously it's well over a year since this article was posted and if anything I've anecdotally noticed hallucinations getting more, not less, common.

Possibly/probably with another years experience with LLMs I'm just more attuned to noticing when they have lost the plot and are making shit up

BoorishBears•6mo ago
RL for reasoning definitely introduces hallucinations, and sometimes it introduces a class of hallucinations that feels a lot worse than the classic ones.

I noticed OpenAI's models picked up a tendency to hold strong convictions on completely unknowable things.

"<suggests possible optimization> Implement this change and it will result in a 4.5% uplift in performance"

"<provides code> I ran the updated script 10 times and it completes 30.5 seconds faster than before on average"

It's bad it enough it convinces itself it did things it can't do, but then it goes further and hallucinates insights from the tasks it hallucinated itself doing in the first places!

I feel like lay people aren't ready for that. Normal hallucinations felt passive, like a slip up. To the unprepared, this becomes more like someone actively trying to sell their slip ups.

I'm not sure if it's a form of RL hacking making it through to the final model or what, but even OpenAI seems to have noticed it in testing based on their model cards.

alkyon•6mo ago
The more accurate word would be confabulation
Wowfunhappy•6mo ago
You lost this battle, sorry. It's not going to happen.

Both terms are "inaccurate" because we're talking about a computer program, not a person. However, at this point "hallucination" has been firmly cemented in public discourse. I don't work in tech, but all of my colleagues know what an AI hallucination is, as does my grandmother. It's only a matter of time until the word's alternate meaning gets added to the dictionary.

peterashford•6mo ago
Correct. This is the way language works. It's annoying when you know what words mean but this is the way it is.
alkyon•6mo ago
Maybe I lost this battle, but also in science the terminology evolves. If you replace AI hallucination with AI confabulation even your grandmaother would get it right. I also don't agree that both terms are equally inaccurate.
Wowfunhappy•6mo ago
> Maybe I lost this battle, but also in science the terminology evolves.

Ah yes, science, where we have fixed stars that move, imaginary numbers that are real, and atoms that can be divided into smaller particles.

alkyon•6mo ago
All of these examples are many centuries old, let's compare apples to apples.
alkyon•6mo ago
Obviously, hallucination is by definition a perception, so it incorrectly anthropomorphizes AI models. On the other hand, the term confabulation involves filling in gaps with fabrication, exactly what LLMs do (aka bullshitting).
more_corn•6mo ago
What an absurd prediction.
DoctorOetker•6mo ago
In the article it is argued that the brainfarts could be beneficial for exploration of new ideas.

I don't agree. The "temperature" parameter should be used for this. Confabulation / bluff / hallucination / unfounded guesses are undesirable at low temperatures.

Wowfunhappy•6mo ago
> “If you look at the models before they are fine-tuned on human preferences, they’re surprisingly well calibrated. So if you ask the model for its confidence to an answer—that confidence correlates really well with whether or not the model is telling the truth—we then train them on human preferences and undo this.

Now that is really interesting! I didn't realize RLHF did that.