frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Brute Force Colors (2022)

https://arnaud-carre.github.io/2022-12-30-amiga-ham/
1•erickhill•3m ago•0 comments

Google Translate apparently vulnerable to prompt injection

https://www.lesswrong.com/posts/tAh2keDNEEHMXvLvz/prompt-injection-in-google-translate-reveals-ba...
1•julkali•3m ago•0 comments

(Bsky thread) "This turns the maintainer into an unwitting vibe coder"

https://bsky.app/profile/fullmoon.id/post/3meadfaulhk2s
1•todsacerdoti•4m ago•0 comments

Software development is undergoing a Renaissance in front of our eyes

https://twitter.com/gdb/status/2019566641491963946
1•tosh•4m ago•0 comments

Can you beat ensloppification? I made a quiz for Wikipedia's Signs of AI Writing

https://tryward.app/aiquiz
1•bennydog224•5m ago•1 comments

Spec-Driven Design with Kiro: Lessons from Seddle

https://medium.com/@dustin_44710/spec-driven-design-with-kiro-lessons-from-seddle-9320ef18a61f
1•nslog•5m ago•0 comments

Agents need good developer experience too

https://modal.com/blog/agents-devex
1•birdculture•7m ago•0 comments

The Dark Factory

https://twitter.com/i/status/2020161285376082326
1•Ozzie_osman•7m ago•0 comments

Free data transfer out to internet when moving out of AWS (2024)

https://aws.amazon.com/blogs/aws/free-data-transfer-out-to-internet-when-moving-out-of-aws/
1•tosh•8m ago•0 comments

Interop 2025: A Year of Convergence

https://webkit.org/blog/17808/interop-2025-review/
1•alwillis•9m ago•0 comments

Prejudice Against Leprosy

https://text.npr.org/g-s1-108321
1•hi41•10m ago•0 comments

Slint: Cross Platform UI Library

https://slint.dev/
1•Palmik•14m ago•0 comments

AI and Education: Generative AI and the Future of Critical Thinking

https://www.youtube.com/watch?v=k7PvscqGD24
1•nyc111•14m ago•0 comments

Maple Mono: Smooth your coding flow

https://font.subf.dev/en/
1•signa11•15m ago•0 comments

Moltbook isn't real but it can still hurt you

https://12gramsofcarbon.com/p/tech-things-moltbook-isnt-real-but
1•theahura•19m ago•0 comments

Take Back the Em Dash–and Your Voice

https://spin.atomicobject.com/take-back-em-dash/
1•ingve•19m ago•0 comments

Show HN: 289x speedup over MLP using Spectral Graphs

https://zenodo.org/login/?next=%2Fme%2Fuploads%3Fq%3D%26f%3Dshared_with_me%25253Afalse%26l%3Dlist...
1•andrespi•20m ago•0 comments

Teaching Mathematics

https://www.karlin.mff.cuni.cz/~spurny/doc/articles/arnold.htm
2•samuel246•23m ago•0 comments

3D Printed Microfluidic Multiplexing [video]

https://www.youtube.com/watch?v=VZ2ZcOzLnGg
2•downboots•23m ago•0 comments

Abstractions Are in the Eye of the Beholder

https://software.rajivprab.com/2019/08/29/abstractions-are-in-the-eye-of-the-beholder/
2•whack•23m ago•0 comments

Show HN: Routed Attention – 75-99% savings by routing between O(N) and O(N²)

https://zenodo.org/records/18518956
1•MikeBee•23m ago•0 comments

We didn't ask for this internet – Ezra Klein show [video]

https://www.youtube.com/shorts/ve02F0gyfjY
1•softwaredoug•24m ago•0 comments

The Real AI Talent War Is for Plumbers and Electricians

https://www.wired.com/story/why-there-arent-enough-electricians-and-plumbers-to-build-ai-data-cen...
2•geox•27m ago•0 comments

Show HN: MimiClaw, OpenClaw(Clawdbot)on $5 Chips

https://github.com/memovai/mimiclaw
1•ssslvky1•27m ago•0 comments

I Maintain My Blog in the Age of Agents

https://www.jerpint.io/blog/2026-02-07-how-i-maintain-my-blog-in-the-age-of-agents/
3•jerpint•27m ago•0 comments

The Fall of the Nerds

https://www.noahpinion.blog/p/the-fall-of-the-nerds
1•otoolep•29m ago•0 comments

Show HN: I'm 15 and built a free tool for reading ancient texts.

https://the-lexicon-project.netlify.app/
5•breadwithjam•32m ago•2 comments

How close is AI to taking my job?

https://epoch.ai/gradient-updates/how-close-is-ai-to-taking-my-job
1•cjbarber•32m ago•0 comments

You are the reason I am not reviewing this PR

https://github.com/NixOS/nixpkgs/pull/479442
2•midzer•34m ago•1 comments

Show HN: FamilyMemories.video – Turn static old photos into 5s AI videos

https://familymemories.video
1•tareq_•36m ago•0 comments
Open in hackernews

Claude Code vs. Codex: I built a sentiment dashboard from Reddit comments

https://www.aiengineering.report/p/claude-code-vs-codex-sentiment-analysis-reddit
141•waprin•3mo ago

Comments

visiondude•3mo ago
Ah bots analyzing bots. Seems openai has a larger bot army than Anthropic rn
aaronSong•3mo ago
openai's crawling is the best. just following anthropic's way
candiddevmike•3mo ago
Need to go to conferences and actually talk to people to understand what Real People (TM) think of GenAI.
TheRoque•3mo ago
And what would that be ? Please make me save a plane ticket
samhclark•3mo ago
I went to a b-sides yesterday (so: small, local, cybersecurity-focused) where someone described their feelings toward GenAI as "praying for Star Trek, but planning for Terminator." Someone else described AGI as a short term inevitability.

Not many others addressed it directly. The vibe I got from offhand remarks was that people felt it was a thing being forced upon them that they are resistant to use.

the_duke•3mo ago
In my experience gpt5-codex (medium) and codex-cli is notably better than Sonnet 4.5 and claude-code. (note: never tried Opus)

It is slower, but the results are much more often correct and it doesn't rush into half-baked solutions/dumb approaches as eagerly.

I'd much rather wait 5 minutes than have to clean up manually or try to coax a model into doing things differently.

I also wouldn't be surprised if the slowness was partially due to OpenAI being quite resource constrained. They are repeatedly complaining about not having sufficient compute.

Bigger picture: I think all the AI coding environments are incredibly immature. There are many improvements to be unlocked.

ripped_britches•3mo ago
That’s falsifiable quite easily by measuring tokens per second.

Rather, the real reason codex takes longer is that it does more work to read more context.

IMO the results are much better with codex, not even close

fragmede•3mo ago
Where codex falls short is in background processing, both running a daemon in the background and using its output as context while simultaneously being interactive for the user, and with subagents, ie, do multiple things in parallel. Presumably codex will catch up, but for now, that puts Claude Code ahead of things for me.

As far as which one is better, it's highly dependent on what we're each doing, but I will say that I have this one project where bare "make" won't work, and I have a script that needs to be run instead. I have instructions to call that script in multiple .md files, and codex is able to call the script instead of make, but it keeps forgetting that and tries to run make which fails and it gets confused. (Claude code running on macOS host but build on Linux vm.) I could work around it, but that really takes the "shiny" factor off of codex+GPT-5 for me.

another_twist•3mo ago
Honestly I think the simplicity of codex to not do anything fancy pants like background coding is what gives it an edge. I am happy to wait for a while and even to repeat context to it (helps me remember stuff anyway) if it types out the right thing.
extr•3mo ago
Notice how pricing is the top discussion theme. People love free shit and it's hard to deny codex usage limits are more generous. My 2c for someone who uses both tools pretty consistently in an enterprise context:

- Codex-medium is better if you have a well articulated plan you "merely" need to execute on, need help finding a bug, have some specific complex piece of logic you need to tweak, truly need a ton of long range context to reason about an issue. It's great and usage limits are very generous!

- Sonnet 4.5 is better for everything else. That means for me: non-coding CLI ops, git ops, writing code with it as a pair programmer, OOD tasks, big new chunks of functionality that are highly conceptual, architectural discussion, etc. I generally approve every edit and often interrupt it. The fast iteration and feedback is key.

I probably use CC 80% of the time with Codex the other 20%. My company pays for CC and I don't even look at the cost. Most of my coworkers use CC over Codex. We do find the Codex PR reviewer to be the best of any tool out there.

Codex gets a lot of play on twitter also because a lot of the most prolific voices there are solo devs who are "building in public". A greenfield, solo project is the ideal (only?) use case for running 5 agents in parallel or whatever. Codex is probably amazing at that. But it's not practical for building in enterprise contexts IMO.

quintu5•3mo ago
For larger tasks that I know are parallelizable, I just tell Claude to figure out which steps can be parallelized and then have it go nuts with sub-agents. I’ve had pretty good success with that.
DanielAtDev•3mo ago
I need to try this because I've never deliberately told it to, but I've had it do it on it's own before. Now I'm wondering if that project had instructions somewhere about that, which could it explain why it happened.
quintu5•3mo ago
It sometimes does it on its own, but to get it to do so consistently, it needs to be told. Doubly so if you want it to split off more than one sub-agent.

This works great for refactors that touch a large number of files. You can knock out a refactor that might take 30 minutes, a persistent checklist, and possibly multiple conversations, and one-shot it in two minutes and a single prompt.

qsort•3mo ago
They are similar enough that using one over the other is at most a small mistake. I prefer Claude models (perhaps I'm more used to them?) but Codex is also very good.
extr•3mo ago
Totally agree. A lot of it is simply personal preference at this point.
another_twist•3mo ago
> better for everything else. That means for me: non-coding CLI ops, git ops, writing code with it as a pair programmer, OOD tasks, big new chunks of functionality that are highly conceptual, architectural discussion..

I would argue this is the wrong way of using these tools. Writing out a defined plan in plain english and then have codex / claude write it out is better since that way we understand the intention. You can always have codex come up with an abstract plan first, iterate on it and then implement. Kind of like how we would implement software in real life.

loveparade•3mo ago
Interesting, my experience has been the opposite. I've been running Codex and Sonnet 4.5 side by side the past few weeks, and Codex gives me better results 90% of the time, pretty much across all tasks. Where Claude really shines is that it's much faster than codex. So if I know exactly what I want or if it's a simpler task I feel comfortable giving it to Claude because I don't want to wait for Codex to work through it. Claude cli is also a much better user experience than codex cli. But Codex gets complex things right more consistently.
distances•3mo ago
My experience is similar. So most of the work I do with Claude as I like the small tasks / fast iteration pair coding experience. When I need to investigate some issues I let Codex handle it, and check back in 10 minutes when it's ready. But Codex is way too slow for the pair programming style of work.

Also, most of the time Codex opts to use Python to edit files. Those edits are unreviewable so it's even less interactive, you just have to let it finish and check the outcome.

kazinator•3mo ago
Sucks-Rules-o-Meter, but 2025.
mikeocool•3mo ago
Reading the comments and posts about both Claude Code and Codex on Reddit (and often hacker news), it’s hard to imagine they’re not extremely astroturfed.

There seems to be constant stream of not terribly interesting or unique “my Claude code/codex success story” blog posts that mange to solicit so many upvotes.

fragmede•3mo ago
In life, it helps to be skeptical, so the real question is where do I find real life humans to ask about their experiences? And even then, they could still be paid actors. Though, I've often wondered how would that work. Like, the marketing department staffed by hot people finds developers and then offers to Venmo them $500 to write something nice online about the product? It's a big Internet, and there's a lot of people on Upwork, so I'm not saying it isn't happening, but I've never gotten an email asking me to write something nice about Claude Code in exchange for a couple of bucks.
100721•3mo ago
One thing worth taking into account is the practice of finding people who actually like the product, and then paying them to write an honest review. I find this to be much closer to ethical than paying exclusively for positive reviews to people who may not have ever used the product, but it has a similar net effect of distorting the sentiment by amplifying a subset of opinions, so still not ideal but at least it’s rooted in honesty.

If you haven’t been vocal about your support of products in general, you wouldn’t show up on the radar for these “opportunities.”

fragmede•3mo ago
I recently got a comma.ai. I really like it. I tell everyone I know to get one. But I'm embarrassed to talk about it on the open Internet because I don't want to be accused of being paid to say good things about them.

I haven't been angling for an opportunity, but the world of marketing to developers isn't the same as for, say, a new face cream.

Paying for a good review on a site that features reviews, eg Amazon or Yelp is one thing. Paying people to troll the Internet at large and make random comments on random sites or discord/etc just seems a bit much.

Then again, the appearance of money make people doubt people are sincere about other things. Specifically, my employer is an AI tech company means that anything pro-AI, even for a different company that's competing with mine, or in a totally different area than my employer's, is suspect.

Human psychology is weird

nickstinemates•3mo ago
Meanwhile I am talking about unique shit with Claude Code trying to draft on that sentiment for little to no traction with them. We've built the best way to automate and manage production infrastructure using these models and no one gives a shit. It's so weird.
typpilol•3mo ago
Best according to?
100721•3mo ago
> Meanwhile I am talking about unique shit with Claude Code trying to draft on that sentiment for little to no traction with them.

What does this mean? What do you mean unique shit? What do you mean when you say you’re trying to draft on the sentiment? What is “them” referring to?

Genuinely. I’m not being (deliberately) obtuse, just trying to follow. Thanks

malshe•3mo ago
Thanks for asking this because for a moment I thought I was too dense to read this correctly
konishipolis•3mo ago
my suspicion is that much (or at least some) of the negative sentiment towards claude code is from folks that were on it early (when code was even more widely used than codex) and created intensive workflows using it. when anthropic tightened quotas to make it more equitable across plan users they were much more likely to be impacted.

this is obviously pure conjecture, but perhaps the OE folks had automated their multiple roles and now they need to be more involved.

ffsm8•3mo ago
Eh, honestly I had some health issues since the vibe coding craze started. Normally I'm one of the people that try things like that out - mostly cuz I don't actually have any hobbies beyond coding and generally find such things funny.

As I got better round June/July I finally found the energy to try it out. It was working incredibly well at the time. It was so fun (for me), that I basically kept playing with it every day after finishing work. So for roughly 1.5 months basically every free minute each day, along with side explorations during work hours when I could get away with it.

Then I had to take another business trip mid August, when I finally came back in September it was unrecognizable - and from my perspective, it definitely hasn't recovered to how ultrathink+opus performed back then.

You can definitely still use it, but you need to take a massively more hands-on approach.

At least my opinion is not swayed by their reduced quota ... But to stay in line with the sentiment analysis this article is about - neither have I tried Codex to this point. Which I will, eventually.

eddiewithzato•3mo ago
Truthfully I find sonnet-4.5 better at Rust code than Codex (medium/high). Haven't tried anything else (like react/typescript) since I only use AI for issues/problems I don't understand.
ralusek•3mo ago
> since I only use AI for issues/problems I don't understand

I only use [coding assistants] for problems I DO understand.

nl•3mo ago
I dunno.

I've been coding for 30 years.

Using Codex I'm finally enjoying it again for the first time in maybe 15 years. Outsource all that annoying part? Heck yeah - bring it on.

And I tell everyone I can how transformational it has been for me.

jxf•3mo ago
Did you also use Claude, and you like Codex better, or are you making a more general observation about the leapfrog in creative power agents are bringing to engineering?
Aeolun•3mo ago
I’ll tell you something. I love working with Claude. It’s enthusiastic, it’s nice, it’ll give you suggestions. It’s an all around pleasant experience.

I hate working with codex. It feels like a machine. You tell it to do something, and it just does it. No pretension at being human, or enthusiastic, or anything really.

But codex almost always does it right. And the comments are right, I never run into random usage limits. Codex doesn’t arbitrarily decide to shrink the context window, or start compacting again after 3 messages.

The codex client sucks, claude code is much better. But the codex client is consistent, which is much more important. Claude was amazing 3 months ago. The model is still fine, but the quality of the experience has degraded so far it’s hard to consider using it.

ragequittah•3mo ago
This is my experience as well. Codex is very verbose which is annoying considering the limits. My work flow tends to be have Claude code describe the problem (succinct as it can) based on my mashing of the keyboard description of what I want done then send that to codex. I've tried it the other way around doesn't work nearly as good. Disclaimer: not using the 5 prompts per week opus.
elcritch•3mo ago
Personally I prefer Codex's less-chatty nature nice. I prefer to save my human emotions for humans.
Aeolun•3mo ago
I'm totally on board with some people preferring that. I don't. However I do prefer my AI assistant to work.
nl•3mo ago
> The codex client sucks, claude code is much better.

Are you using in in VSCode?

I use the web based Codex (which I love) and the VSCode clients of both. I don't think there is a huge difference in the VSCode plugins.

I've tried the cli versions but don't have enough experience to have a strong opinion.

nl•3mo ago
I've used Claude too and I prefer Codex. I generally have confidence in both to tackle large scale problems I won't tackle with Gemini 2.5

I've had a few small bug that Codex has fixed where Claude hasn't.

mewpmewp2•3mo ago
Your comment is obviously not AI generated, but since we were talking about astroturfing on Reddit and which presumably is done a lot by bots, it's interesting to me when I read comments on what kind of triggers my inner LLM detector.

> Outsource all that annoying part? Heck yeah - bring it on.

This sentence really and some of your other cadence somehow triggers my sense a lot. Or the comment somehow feels sloganish, formulaic. Not trying to criticise or offend, just thought it's interesting how it triggers this in my brain. And I do agree with you.

camel-cdr•3mo ago
I'm quite confused by the comments you got on this one, surely half of them must be satirical?
resonious•3mo ago
I think I'm partly responsible. I've been having a lot of fun with these tools, and so seeing other people doing the same just makes me want to engage even if the discussion isn't particularly sophisticated. I swear I'm not paid to do this (actually I pay out the wazoo for Claude..)
_heimdall•3mo ago
Its interesting to me that Codex has such high sentiment. I'm definitely an outlier on the more principled end of the spectrum, but I refuse to use OpenAI products.

I take issue with the AI industry in general and the hand-wavy approach to risk, but OpenAI really is on another level in my book. While I don't trust the industry's approach to AI development, with OpenAI I don't trust the leaderships' intentions.

gorjusborg•3mo ago
> Its interesting to me that Codex has such high sentiment.

Me too, so much so that I doubt this is legitimate. This blog post is the only place I've seen people 'raving' about codex.

Claude Code is the current standard all others are measured against.

psandor•3mo ago
It was true until GPT-5. That model hugely improved Codex, so it being comparable with CC is a recent thing.
radial_symmetry•3mo ago
If you want to compare Codex and Claude Code side by side you can do it in Crystal in worktrees from one prompt https://github.com/stravu/crystal
another_twist•3mo ago
Regular codex user. Its my typing assistant. Allows me to be the ideas guy when writing software. Codex makes plenty of mistakes when generating large blocks of code but its easier to cleanup and consolidate with a refactoring pass once the typing had been done.
spott•3mo ago
I really like codex… but without the ability to launch sub-agents, it kinda struggles with context.

The biggest thing I use agents for is getting good search with less context.

Codex just struggles when the model needs to search too much because of this. Codex also struggles with too much context: there have been a number of times when it has just ran up on the context limit and couldn’t compact, so you just loose everything since your last message, which has been a lot of lost context/work for me.

drooby•3mo ago
I'm still using cursor and it seems fine. What does CC and Codex offer that's so much better than Cursor. Idgi
omgitspavel•3mo ago
it has a fatal flaw: 80% of the screen is taken up by the code editor window and a file explorer. Why do you need to waste 80% of your screen on things you don't use anyway?
atlgator•3mo ago
It looks like you have not reviewed r/ClaudeAI. This is a much larger subreddit and most of the posts are about Claude Code. Many comparisons of CC vs Codex.
sunaookami•3mo ago
This sub is full of "vibe coders" that use "prompt engineered" 1000 line prompts with 500 MCPs and then complain that they reach their limit in the first day while using the 200$ max plan.
aiisthefiture•3mo ago
Seems like HN is slowly split between “ai sucks” and everyone else who is slowly discovering what it can do, while Twitter is leagues ahead using other tools to build stuff.
prameshbajra•3mo ago
For a good month I juggled between Claude Code and Codex CLI and found that Codex CLI did the job better. I recently ditched Claude Code and am currently only using Codex CLI.
k__•3mo ago
Is Aider done for?
daliusd•3mo ago
No MCP support (https://github.com/Aider-AI/aider/pull/3937) makes it less useful than other tools (opencode.ai, Claude Code and etc)
kachapopopow•3mo ago
for me I don't understand codex the same way I don't understand gemini.

In my day to day tasks the only models that actually do what I want are the antrophic ones all other ones just fall flat on their face most of the time and end up creating more work than antrophic models.

I wonder if it's because I tend to abuse my models and constantly tell them that they're stupid

projektfu•3mo ago
The struggle is real. When you say, "do this but don't do that" and it does "that", you know AGI is only a step away.
velcrovan•3mo ago
comparing vibe coding tools based on vibes — makes sense!
d4rkp4ttern•3mo ago
I like each at different times in different ways. Now I have both running in separate Tmux panes and have one talk to the other to ask/delegate/verify/validate, using my Tmux-cli tool (now a Claude skill of course):

https://github.com/pchalasani/claude-code-tools

Now my work on a project often spans multiple sessions of these agents. So I use a session-finder and resume/dump tool (also in that repo). I often ask Claude or codex to extract all useful details from a jsonl session log file so I can continue the work.

mpaepper•3mo ago
What about the security aspects, can it run anything?
d4rkp4ttern•3mo ago
I assume by “it” you mean Claude code or codex-cli — that depends on how you launched them or how you modified the permissions within the CLI chat; that’s orthogonal to my CLI tools.