news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Agentic Coding and the Problem of Oracles

https://epkconsulting.substack.com/p/agentic-coding-and-the-problem-of

1•qingsworkshop•20s ago•0 comments

Malicious packages for dYdX cryptocurrency exchange empties user wallets

https://arstechnica.com/security/2026/02/malicious-packages-for-dydx-cryptocurrency-exchange-empt...

1•Bender•23s ago•0 comments

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

https://github.com/pheonix-delta/axiom-voice-agent

1•shubham-coder•1m ago•0 comments

Penisgate erupts at Olympics; scandal exposes risks of bulking your bulge

https://arstechnica.com/health/2026/02/penisgate-erupts-at-olympics-scandal-exposes-risks-of-bulk...

1•Bender•1m ago•0 comments

Arcan Explained: A browser for different webs

https://arcan-fe.com/2026/01/26/arcan-explained-a-browser-for-different-webs/

1•fanf2•3m ago•0 comments

What did we learn from the AI Village in 2025?

https://theaidigest.org/village/blog/what-we-learned-2025

1•mrkO99•3m ago•0 comments

An open replacement for the IBM 3174 Establishment Controller

https://github.com/lowobservable/oec

1•bri3d•6m ago•0 comments

The P in PGP isn't for pain: encrypting emails in the browser

https://ckardaris.github.io/blog/2026/02/07/encrypted-email.html

2•ckardaris•8m ago•0 comments

Show HN: Mirror Parliament where users vote on top of politicians and draft laws

https://github.com/fokdelafons/lustra

1•fokdelafons•8m ago•1 comments

Ask HN: Opus 4.6 ignoring instructions, how to use 4.5 in Claude Code instead?

1•Chance-Device•10m ago•0 comments

We Mourn Our Craft

https://nolanlawson.com/2026/02/07/we-mourn-our-craft/

1•ColinWright•12m ago•0 comments

Jim Fan calls pixels the ultimate motor controller

https://robotsandstartups.substack.com/p/humanoids-platform-urdf-kitchen-nvidias

1•robotlaunch•16m ago•0 comments

Exploring a Modern SMTPE 2110 Broadcast Truck with My Dad

https://www.jeffgeerling.com/blog/2026/exploring-a-modern-smpte-2110-broadcast-truck-with-my-dad/

1•HotGarbage•16m ago•0 comments

AI UX Playground: Real-world examples of AI interaction design

https://www.aiuxplayground.com/

1•javiercr•17m ago•0 comments

The Field Guide to Design Futures

https://designfutures.guide/

1•andyjohnson0•17m ago•0 comments

The Other Leverage in Software and AI

https://tomtunguz.com/the-other-leverage-in-software-and-ai/

1•gmays•19m ago•0 comments

AUR malware scanner written in Rust

https://github.com/Sohimaster/traur

3•sohimaster•22m ago•1 comments

Free FFmpeg API [video]

https://www.youtube.com/watch?v=6RAuSVa4MLI

3•harshalone•22m ago•1 comments

Are AI agents ready for the workplace? A new benchmark raises doubts

https://techcrunch.com/2026/01/22/are-ai-agents-ready-for-the-workplace-a-new-benchmark-raises-do...

2•PaulHoule•27m ago•0 comments

Show HN: AI Watermark and Stego Scanner

https://ulrischa.github.io/AIWatermarkDetector/

1•ulrischa•27m ago•0 comments

Clarity vs. complexity: the invisible work of subtraction

https://www.alexscamp.com/p/clarity-vs-complexity-the-invisible

1•dovhyi•28m ago•0 comments

Solid-State Freezer Needs No Refrigerants

https://spectrum.ieee.org/subzero-elastocaloric-cooling

2•Brajeshwar•28m ago•0 comments

Ask HN: Will LLMs/AI Decrease Human Intelligence and Make Expertise a Commodity?

1•mc-0•30m ago•1 comments

From Zero to Hero: A Brief Introduction to Spring Boot

https://jcob-sikorski.github.io/me/writing/from-zero-to-hello-world-spring-boot

1•jcob_sikorski•30m ago•1 comments

NSA detected phone call between foreign intelligence and person close to Trump

https://www.theguardian.com/us-news/2026/feb/07/nsa-foreign-intelligence-trump-whistleblower

12•c420•31m ago•2 comments

How to Fake a Robotics Result

https://itcanthink.substack.com/p/how-to-fake-a-robotics-result

1•ai_critic•31m ago•0 comments

It's time for the world to boycott the US

https://www.aljazeera.com/opinions/2026/2/5/its-time-for-the-world-to-boycott-the-us

3•HotGarbage•31m ago•0 comments

Show HN: Semantic Search for terminal commands in the Browser (No Back end)

https://jslambda.github.io/tldr-vsearch/

1•jslambda•31m ago•1 comments

The AI CEO Experiment

https://yukicapital.com/blog/the-ai-ceo-experiment/

2•romainsimon•33m ago•0 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode

5•surprisetalk•36m ago•1 comments

Open in hackernews

Memes Are Smarter Than AI (and That Should Terrify Silicon Valley)

https://substack.com/home/post/p-168500363

3•anonym29•6mo ago

Comments

techpineapple•6mo ago

One of the reasons I’m a bit bearish in AI is related to this, there still seem to be some basic tests that AI fail when trying to figure out, is it combining vectors in its training data, or actually understanding. In places where the training data is robust, it feels like understanding, but hit any place where it’s sparse and it stays around the problem.

ChatGPT used to not be able to generate an image of a full glass of wine(you can search YouTube for a long video of this). Probably because there had never been a full glass of wine in its training data.

They fixed that, maybe altering how they label things to include factors they hadn’t thought of? Maybe they literally watched the video and just added images of full glasses of wine? But there was a big article that went on to say “the problem has been solved, they think now”

So i went into chatGPT and started saying “generate me an image with x% full glass of wine, and it doesn’t do it.

Occasionally it gets close but with no consistency. And this is a problem that is very logical/mathematical, I could develop a script to understand the nature of the task easily. weird.

anonym29•6mo ago

Have you tried with frontier models from other providers, like Anthropic, Alphabet, xAI, etc? I am finding the capabilities with this specific test to be vary quite wildly among "leading" models.