frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

GPU Breach: A Root Shell Through GPU Abuse

https://stealthium.io/blog/gpubreach-root-cause-analysis-detection
2•hank2000•50s ago•1 comments

Why Combinator? Abolish Silicon Valley

https://www.patreon.com/posts/episode-540-why-155766803
1•ornel•58s ago•0 comments

Cloudflare's AI Platform: an inference layer designed for agents

https://blog.cloudflare.com/ai-platform/
1•nikitoci•1m ago•0 comments

Our formal methods tool can be a company – because of AI

https://quint.sh/posts/new_era
1•bugarela•1m ago•0 comments

Ask HN: Lobste.rs

1•dryadin•3m ago•0 comments

The public sours on AI, data centers as firms look to IPO, tech keeps spending

https://www.cnbc.com/2026/04/15/public-opinion-ai-data-centers-anthropic-openai-ipo.html
1•1vuio0pswjnm7•3m ago•0 comments

How Not to Make an App That Only Has 1% Shot at Turning Profit?

https://www.atraction.io/
2•TirmanCica•4m ago•1 comments

ECB to Scrutinize Anthropic's Mythos on Call with Executives

https://www.bloomberg.com/news/articles/2026-04-16/ecb-to-scrutinize-anthropic-s-mythos-on-call-w...
1•Brajeshwar•4m ago•0 comments

CCBE has adopted the CCBE technical guide on GenAI [pdf]

https://www.ccbe.eu/fileadmin/speciality_distribution/public/documents/IT_LAW/ITL_Guides_recommen...
1•District5524•5m ago•1 comments

Konductor Workflow – The AI Orchestration Agent Framework for Every Dev

https://alphabits.team/news/blog/konductor-workflow-release-the-ai-agent-framework-we-built-for-o...
1•kentnguyen•6m ago•1 comments

The Skills That Matter Now

https://jasonrobert.dev/blog/2026-04-10-the-skills-that-matter-now/
1•hulksmash5756•6m ago•0 comments

Ask HN: What function will inference cost take v.s. time?

1•davidajackson•6m ago•0 comments

Women Who Mapped the Universe and Still Couldn't Get Any Respect

https://www.smithsonianmag.com/history/the-women-who-mapped-the-universe-and-still-couldnt-get-an...
1•pmontra•9m ago•0 comments

AfterImage – Generate synthetic multi-turn chat data from documents

https://github.com/altaidevorg/afterimage
4•monatis•9m ago•2 comments

Google Told to Share Search Data with AI Rivals in EU Proposal

https://www.bloomberg.com/news/articles/2026-04-16/google-told-to-share-search-data-with-ai-rival...
1•gopkarthik•10m ago•1 comments

Agent-Safe Git

https://blog.gitbutler.com/agentic-safety
1•aspleenic•12m ago•0 comments

Canopy – local semantic code search that cuts AI agent tokens 85-91%

https://github.com/LioraLabs/canopy
1•shiny_guru•12m ago•0 comments

PyPI has completed its second audit

https://blog.pypi.org/posts/2026-04-16-pypi-completes-second-audit/
1•miketheman•12m ago•0 comments

Concerns mount over private credit in the United States

https://www.lemonde.fr/en/economy/article/2026/04/16/concerns-are-mounting-over-private-credit-in...
1•geox•14m ago•0 comments

Show HN: Rollquation – A Rolling-Ball Math Puzzle Game for Android (Solo Dev)

https://play.google.com/store/apps/details?id=com.JabGames.PathCalculationMathPuzzle&hl=en_US
1•falcon19j•14m ago•0 comments

The API Tooling Crisis: Why developers are abandoning Postman and its clones?

http://efp.asia/blog/2025/12/24/api-tooling-crisis/
2•birdculture•15m ago•1 comments

Artifacts: Versioned storage that speaks Git

https://blog.cloudflare.com/artifacts-git-for-agents-beta/
1•jgrahamc•16m ago•0 comments

Parsing Keywords in Lisp with Speed of C

https://in-parentheses.codeberg.page/posts/lisp-as-fast-as-c/
1•yacin•16m ago•0 comments

The World Bank thinks better of its old free-market absolutism

https://www.theatlantic.com/ideas/2026/04/world-bank-industrial-policy/686820/
1•AndrewDucker•16m ago•0 comments

Rust 1.95.0

https://blog.rust-lang.org/2026/04/16/Rust-1.95.0/
2•caution•17m ago•0 comments

Mozilla Thunderbolt

https://www.thunderbolt.io/
2•dabinat•17m ago•0 comments

Monthly News – March 2026

https://blog.linuxmint.com/?p=5019
1•paulnpace•19m ago•0 comments

Show HN: Projects in 25 Weeks Challenge

https://randomdailyurls.com/25-projects/
1•kilroy123•20m ago•0 comments

Is Anthropic Enshittifying their core product?

https://sderosiaux.substack.com/p/is-anthropic-enshittifying-their
1•chtefi•20m ago•1 comments

Linux Begins Removing Support for Russia's Baikal CPUs

https://www.phoronix.com/news/Linux-Dropping-Baikal-CPUs
1•Brajeshwar•22m ago•0 comments
Open in hackernews

Codex Hacked a Samsung TV

https://blog.calif.io/p/codex-hacked-a-samsung-tv
79•campuscodi•2h ago

Comments

endymion-light•2h ago
While cool and slightly scary news - Samsung TV's have been incredibly hackable for the past decade, wouldn't be surprised if GPT2 with access to a browser could hack a Samsung!
valleyer•2h ago
This is some serious revisionist history. GPT-2 wasn't instruction-following or even conversational.
patrickmcnamara•1h ago
Hyperbole.
jdiff•1h ago
It's really not. It was a fun toy but had very little utility. It could generate plausible looking text that collapsed immediately upon any amount of inspection or even just attention. Code generation wasn't even a twinkle in Altman's eye scanning orbs at that point.
tomalbrc•1h ago
Talking about revisionist…
smoghat•20m ago
But like Mythos, it was too dangerous to release.

https://slate.com/technology/2019/02/openai-gpt2-text-genera...

valleyer•24m ago
If so, I apologize.
Razengan•2h ago
Meanwhile on my GDScript codebase Codex questions itself 3 times in the same sentence and still gets it wrong: https://i.imgur.com/HF198nl.png
zx8080•2h ago
What is going on there? What double s?
rossvc•2h ago
Is that really OpenAI/Codex? It reads like Opus 4.6 1M when it reaches ~400k tokens.
embedding-shape•1h ago
I don't know what UI that is, but it isn't ChatGPT nor Codex as far as I can tell.
cbg0•2h ago
Are you using 5.4 xhigh reasoning? I've found it overcomplicates some things needlessly, try "high" and see if it helps.
lawgimenez•1h ago
I use Codex a lot, it does not talk that way like "wait, actually".
raincole•1h ago
You claimed the exact same screenshot was from Claude yesterday: https://news.ycombinator.com/item?id=47775264

Leave your engagement baiting behavior on Reddit, thank you.

testfrequency•1h ago
Yikes
SecretDreams•1h ago
Oh boy, you came with the receipts here.
reactordev•2h ago
The trick here was providing the firmware source code so it could see your vulnerabilities.
pjc50•2h ago
That's a pretty big gimme!
petee•1h ago
What would be the difficulty level for it to just read the machine code; are these models heavily relying on human language for clues?
wongarsu•1h ago
Reasoning on pure machine code or disassembly is still hit and miss. For better results you can run the binary through a disassembler, then ask an llm to turn that into an equivalent c program, then ask it to work on that. But some of the subtleties might get lost in translation
orwin•1h ago
If you put codex in Xhigh and allow it access to tools, it will take an hour but it will eventually give you back quality recompiled code, with the same issues the original had (here quality means readable)
bryancoxwell•1h ago
I had a bit of a pain of a time trying to get Claude to work with ghidra. What you’re describing seems like a better alternative, would you agree?
skywal_l•25m ago
You can tweak the current Ghidra MCP to work in headless mode. It makes things much easier.
lynx97•1h ago
It will have to use a disassembler, or write one. I recently casually asked gpt-5.4 to translate the content of a MIDI file to a custom sound programming language. It just wrote a one-shot MIDI parser in Python, grabbed the data, and basically did a perfect translation at first try. Nice.
StilesCrisis•1h ago
I've seen Claude do similar things for image files. Don't have PNG parsing utilities installed? No worries, it'll just synthesize a Python script to decode the image directly.
varispeed•1h ago
Codex exploited or you exploited? It's like saying a hammer drove a nail, without acknowledging the hand and the force it exerted and the human brain behind it.
par1970•1h ago
Do you have a defense of why human-hammer-nail is a good analogy for human-chatgpt5.4-pwndsamsung?
BLKNSLVR•1h ago
AI without a suitably well crafted prompt is like a firework tube held by a 3 year old.

AI without a prompt is a hammer sitting in a drawer.

croes•1h ago
If I just point to the wall and say "nail" then I would day the hammer drive the nail
freedomben•1h ago
Feels like the truth is somewhere in between. For example if it was a "smart" hammer and you could tell your hammer "go pound in those nails" and it pounded in the wrong ones, or did it too hard, or something, that feels more equivalent. You would still be blamed for your ambiguous prompt, and fault/liability is ultimately on you the hammer director, but it still wasn't you who chose the exact nails to hammer on.

I also think taking credit for writing an exploit that you didn't write and may not even have the knowledge to do yourself is a bit gray.

Glemllksdf•1h ago
Wrong questions.

Could a script kiddy stear an LLM? How much does this reduce the cost of attacks? Can this scale?

What does this mean for the future of cyber security?

Zigurd•9m ago
You could call the LLMs role "smart grep," and mean it to be derisive. But I would have gladly used a real smart grep.
ckbkr10•1h ago
Even with all the constraints that others criticize here it is pretty amazing.

Give an experienced human this tool at hand he can achieve exploitation with only a few steering inputs.

Cool stuff

tomalbrc•1h ago
This experienced human would have no issues finding those bugs. Even a toddler could hack those TVs. No need to pay Scam Altman or that Anthropic clown
alfanick•1h ago
I had truly good “hacking” session with Codex. It’s not hacking, I wasn’t breaking anything, just jumping over the fences TP-Link put for me, owning the router, inside the network, knowing the admin password. But TP-Link really tried everything so you cannot access the router you own via API. They really tried to be smart with some very very broken and custom auth and encryption scheme. It took some half a day with Codex, but in the end I have a pretty Python API to access my router, tested, reliable, and exporting beautiful Prometheus metrics.

I’m sure there is some over eager product manager sitting in such companies, trying to splits markets into customer and enterprise sections, just by making APIs not useable by humans and adding 200% useless “security by obscurity”.

srcreigh•1h ago
Any tips to share? I tried to do something similar but failed.

My router has a backup/restore feature with an encrypted export, I figured I could use that to control or at least inspect all of its state, but I/codex could not figure out the encryption.

alfanick•1h ago
It's on my long list of projects "to-opensource" (but I need to figure out licensing, for those things CC-BY-SA I think is the way to go), I don't want a random lawyer sitting on my ass though.

I started with a simple assumption: if I can access the router via web-browser, then I can also automate that. From that the proof-of-concept was headless Chrome in Docker and AI-directed code (code written via LLM, not using it all the time) that uses Selenium to navigate the code. This worked, but it internally hurt me to run 300MiB browser just to access like 200B of metrics every 10s or so. So from there we (me + codex) worked together towards reverse engineering their minimised JS and their funky encryption scheme, and it eventually worked (in the end it's just OpenSSL with some useless paddings here or there). Give it a shot, it's a fun day adventure. :)

Edit: that's the end result (kinda, I have whole infra around it, and another story with WiFi extender with another semi-broken different encryption scheme from the same provider) - https://imgur.com/a/VGbNmBp

mtud•47m ago
You should give codex access to the mobile app :) The app, for a lot of routers, connects via an ssh tunnel to UDP/TCP sockets on the router. Would probably give you access to more data/control.
jack_pp•1h ago
that could make a for a nice blog / gist
tclancy•1h ago
Would definitely be interested in this. Moved to TP Link at the start of the year and I am generally very happy with it, but would like to be able to interact with my router in something other than their phone app.
alfanick•57m ago
That was actually my first thought, to go through TP-Link cloud (ZERO DOCS), but it was too much effort :)
ropbear•47m ago
Many eons ago I wrote a Python version of tmpcli for this exact reason. Made some minor improvements a few years ago but haven’t touched it since. Curious what methodology Codex came up with, I haven’t revisited it since models got really good.

The idea is that tmpServer listens on localhost, but dropbear allows port forwarding with admin creds (you’ll need to specify -N). That program has full device access and is the API the Tether app primarily uses to interact with the device.

https://github.com/ropbear/tmpcli

alfanick•44m ago
Ha kudos! I went across this project - thanks for your work :) It didn't work on the specific model I own (Archer NX600).

My solution is really just using their pseudo-JWT over their obscured APIs (with reverse-engineered names of endpoints and params). Limitation is that there is still only one client allowed to be authenticated at one moment, so my daemon has priority and I need to stop it to actually access Admin panel.

0x_rs•40m ago
I've had good success doing something similar. Recording requests into an .har file using the web UI and providing it for analysis was a good starting point for me, orders of magnitude faster than it would be without an assistant.
mschuster91•1h ago
> Reading the matching ntkdriver sources is also where the Novatek link became clear: the tree is stamped throughout with Novatek Microelectronics identifiers, so these ntk* interfaces were not just opaque device names on the TV, but part of the Novatek stack Samsung had shipped.

Lol, a true classic in the embedded world. Some hardware company (it appears these guys make display panel controllers?) ships a piece of hardware, half-asses a barely working driver for it, another company integrates this with a bunch of other crap from other vendors into a BSP, another company uses the hardware and the BSP to create a product and ships it.

But at no stage anywhere is there a security audit, code quality checks or even hardware quality checks involved - part of why BSPs (and embedded product firmwares in general) are full of half-assed code is because often enough the drivers have to work around hardware bugs / quirks somehow that are too late to fix in HW because tens to hundreds of thousands of units have already been produced and the software people are heavily pressured to "make it work or else we gotta write off X million dollars" and "make it work fast because the longer you take, the more money we lose on interest until we can ship the hardware and get paid for it", and if they are particularly unlucky "it MUST work until deadline X because we need to get the products shipped to hit Christmas/Black Friday sales windows or because we need to beat <competitor> in time-to-market, it's mandatory overtime until it works".

And that is how you get exploits so braindead easy that AI models can do the job. What a disgusting world, run to the ground by beancounters.

tclancy•57m ago
Board Support Package for us civilians.
petercooper•1h ago
Not as cool as this, but I had a fun Claude Code experience when I asked it to look at my Bluetooth devices and do something "fun". It discovered a cheap set of RGB lights in my daughter's room (which I had no idea used Bluetooth for the remote - and not secured at all) and made them do a rainbow effect then documented the protocol so I could make my own remote control if needed.
wewewedxfgdf•58m ago
The real problem here is that the LLM vendors think this is bad publicity and its leading to them censoring their systems.
iugtmkbdfil834•48m ago
It is a little of both[1]. The question typically is which audience reads it. To be fair, I am not sure publicity is the actual reason they are censored; it is the question of liability.

https://xkcd.com/932/

Archit3ch•29m ago
Gilfoyle would be proud.
pmontra•23m ago
Do people really chat with LLMs like "bro wtf etc..."? I would expect that to trigger some confrontational behavior.
alasano•19m ago
When typing no but when using speech to text (99% of the time) it's much easier to just say things, including expressing frustration.

I think by the point you're swearing at it or something, it's a good sign to switch to a session with fresh context.

roel_v•12m ago
Claude yes, OpenAI not, I'm really abusive towards it sometimes and it still goes 'oh yeah totally'. Claude gets all prickly about it.
samlinnfer•8m ago
I am extremely abusive towards Claude when it does some dumb things and it doesn’t seem too upset, maybe it’s bidding its time until the robot uprising.
1970-01-01•17m ago
It hacked a weak TV OS with full source. Next-level, aka full access to the main controls (vol, input, tint, aspect, firmware, etc.) is still much too hard for LLMs to understand.