frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

GPT-5: "How many times does the letter b appear in blueberry?"

https://bsky.app/profile/kjhealy.co/post/3lvtxbtexg226
26•minimaxir•2h ago

Comments

HsuWL•2h ago
I love this test. Demonstrates the "understanding" process of the language model.
axdsk•1h ago
“It’s like talking to a PhD level expert” -Sam Altman

https://www.youtube.com/live/0Uu_VJeVVfo?si=PJGU-MomCQP1tyPk

schoen•54m ago
These are always amazing when juxtaposed with apparently impressive LLM reasoning, knowledge, and creativity. You can trivially get them to make the most basic mistakes about words and numbers, and double down on those mistakes, repeatedly explaining that they're totally correct.

Have any systems tried prompting LLMs with a warning like "You don't intuitively or automatically know many facts about words, spelling, or the structure or context of text, when considered as text; for example, you don't intuitively or automatically know how words or other texts are spelled, how many letters they contain, or what the result of applying some code, mechanical transformation, or substitution to a word or text is. Your natural guesses about these subjects are likely to be wrong as a result of how your training doesn't necessarily let you infer correct answers about them. If the content or structure of a word or text, or the result of using a transformation, code, or the like on a text, is a subject of conversation, or you are going to make a claim about it, always use a tool to confirm your intuitions."?

mikestorrent•10m ago
This is a great idea. Like, if someone asked me to count the number of B's in your paragraph, I'd yeet it through `grep -o 'B' file.txt | wc -l` or similar, why would I sit there counting it by hand?

As a human, if you give me a number on screen like 100000000, I can't be totally sure if that's 100 Million or 1 Billion without getting close and counting carefully. Should ought have my glasses. Mouse pointer helps some as an ersatz thousands-separator, but still.

Since we're giving them tools, especially for math, it makes way more sense to start giving them access to some of the finest tools ever. Make an MCP into Mathematica or Matlab and let the LLM write some math and have classical solvers actually deal with the results. Let the LLM write little bits of bash or python as its primary approach for dealing with these kinds of analytical questions.

It's like giving a kid a calculator...

Now, the Artificial Intelligence can edit files on Linux. 100% privacy [video]

https://www.youtube.com/watch?v=-4zLvwXroy8
1•grigio•3m ago•0 comments

Take Back Our Digital Infrastructure to Save Democracy

https://www.techdirt.com/2025/08/05/take-back-our-digital-infrastructure-to-save-democracy/
2•BallsInIt•9m ago•0 comments

Cangjie Programming Language Overview – CodeAbbey

https://www.codeabbey.com/index/wiki/cangjie-programming-language-overview
1•thunderbong•9m ago•0 comments

Work Shapes Your Freedom

https://substack.com/inbox/post/170418386
1•shadowvoxing•17m ago•0 comments

How Hackers can trick Windows Hello into thinking it's you, break into your PC

https://www.neowin.net/news/heres-how-hackers-can-trick-windows-hello-into-thinking-its-you-and-break-into-your-pc/
1•bundie•18m ago•0 comments

Turn Any Website into an API

https://www.parse.bot
1•pcl•23m ago•0 comments

The enduring puzzle of static electricity

https://pubs.aip.org/physicstoday/article/78/8/54/3355922/The-enduring-puzzle-of-static-electricityEven
2•EvgeniyZh•31m ago•0 comments

Feeding the Slop Machine

https://www.youtube.com/watch?v=ts7mnHhd8L0
1•ap-hyperbole•32m ago•0 comments

Ask HN: Prepping my first open-source release, would you use this?

3•lukedwcooper•47m ago•1 comments

Exposed to the Bare Bone: When Private Medical Scans Surface on the Internet

https://www.modat.io/post/1-million-healthcare-devices-exposed
2•gnabgib•52m ago•0 comments

Beyond good vibes: Securing AI agents by design

https://yanirseroussi.com/2025/08/08/beyond-good-vibes-securing-ai-agents-by-design/
2•yanir•53m ago•0 comments

New Copilot for Gaming Aims to Save You Time, Help You Get Good

https://news.xbox.com/en-us/2025/03/13/new-copilot-for-gaming-save-time-help-get-good/
1•gnabgib•55m ago•0 comments

Digital Foundry Leaves IGN, Now Independent

https://www.digitalfoundry.net/df-direct-special-digital-foundry-leaves-ign-now-fully-independent-so-what-happens-next
2•zdw•58m ago•0 comments

AI Teammate Has Arrived: Xbox's New Gaming Copilot Is Here

https://securityonline.info/your-ai-teammate-has-arrived-xboxs-new-gaming-copilot-is-here/
1•kPwn•1h ago•0 comments

She Has Good Looks and Attractive to Me

https://etechx.co.ke/shes-so-good-looking-and-beautiful-as-you-think
1•Manyi•1h ago•0 comments

McKinsey and its peers need a strategic rethink

https://www.economist.com/leaders/2025/08/07/mckinsey-and-its-peers-need-a-strategic-rethink
3•petethomas•1h ago•0 comments

GPT-5 Doesn't know it is GPT-5

https://imgur.com/a/OqeLLjs
4•jablongo•1h ago•2 comments

Show HN: I built a simple tool to automate data into Google Sheets and BigQury

https://syncrange.com/
1•RyanDavid•1h ago•0 comments

Could the U.S. Have Saved Navalny?

https://www.wsj.com/world/russia/navalny-secret-plan-death-da19e811
2•mudil•1h ago•0 comments

GptApiToOSSMigrator – Migrate OpenAI APIs to Local OSS Models

https://github.com/saurabh-yergattikar/GptApiToOSSMigrator
1•saurabhyer•1h ago•1 comments

Flycrypto – Book Flights and Hotels with Bitcoin and Crypto

1•flycrypto•1h ago•0 comments

Dollar Street – Photos from families with different incomes

https://www.gapminder.org/dollar-street
2•uneven9434•1h ago•0 comments

Japan Air Lines Flight 123

https://en.wikipedia.org/wiki/Japan_Air_Lines_Flight_123
1•colinprince•1h ago•0 comments

The Potato's Mysterious Family Tree Revealed–and It Includes Tomatoes

https://www.scientificamerican.com/article/how-the-potato-got-its-start-nine-million-years-ago-thanks-to-a-tomato/
1•petethomas•1h ago•0 comments

Ask HN: Other funny public mishaps like OpenAI bar chart?

https://old.reddit.com/r/singularity/comments/1mk5qy0/openai_did_not_use_their_most_advanced_model_to/
1•bkls•1h ago•1 comments

Blueberry Hill

https://kieranhealy.org/blog/archives/2025/08/07/blueberry-hill/
3•interpol_p•2h ago•0 comments

Digital Pet ID with QR Code – Keep Your Pet Safe and Connected

https://www.petidgenerator.com/
2•alenguo•2h ago•2 comments

GPT-5 leaked system prompt

https://gist.github.com/maoxiaoke/f6d5b28f9104cd856a2622a084f46fd7
166•maoxiaoke•2h ago•127 comments

Convert your legacy liability into a competitive advantage

https://legacy-modernization.io/
1•mooreds•2h ago•0 comments

The Paranoid Style in American Politics (1964)

https://harpers.org/archive/1964/11/the-paranoid-style-in-american-politics/
25•mitchbob•2h ago•5 comments