frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Blackboard System

https://en.wikipedia.org/wiki/Blackboard_system
1•rgbrgb•1m ago•0 comments

Quantum Signatures of Proper Time in Optical Ion Clocks

https://journals.aps.org/prl/abstract/10.1103/qhj9-pc2b
1•bookofjoe•3m ago•0 comments

Samsung workers reject $340K bonus, demanding annual $900K like SK hynix

https://www.tomshardware.com/tech-industry/big-tech/samsung-chip-workers-reject-usd340-000-one-ti...
2•speckx•7m ago•0 comments

Writting AI evals is not the point

https://www.youtube.com/watch?v=sXusaxZeB7Q
1•paulaq•7m ago•0 comments

EU GPU prices vary by 15%+ depending on which country you buy from

https://www.pricesquirrel.com
1•iluuvdata•7m ago•0 comments

Article Previews in RSS

https://entropicthoughts.com/article-previews-in-rss
1•ibobev•8m ago•0 comments

Extracting experiments from materials science literature

https://www.radical-ai.com/news/litxbench-a-benchmark-for-extracting-experiments-from-materials-l...
1•jryio•8m ago•0 comments

Why street artist Peter Drew decided to bring back his 'Aussie' poster series

https://www.abc.net.au/news/2026-04-27/street-artist-peter-drew-aussie-posters/106602092
1•Tomte•9m ago•0 comments

Fix mojibake in Unicode text, after the fact

https://github.com/rspeer/python-ftfy
1•ankitg12•10m ago•0 comments

Schildbürger

https://en.wikipedia.org/wiki/Schildb%C3%BCrger
1•jjgreen•10m ago•0 comments

Making your own programming language is easier than you think (but also harder)

https://lisyarus.github.io/blog/posts/making-your-own-programming-language.html
1•ibobev•11m ago•0 comments

Moving on from GitHub

https://www.matthewlang.me/posts/2026/05/02/moving-on-from-github/
2•speckx•12m ago•0 comments

Remote Code Execution on Langfuse with a single OTel trace request

https://aisafe.io/blog/traces-of-rce-exploiting-langfuse-prototype-pollution
2•fedex_00•13m ago•0 comments

The race to save US Government datasets before they're deleted

https://www.theguardian.com/us-news/2026/may/07/save-government-datasets-deleted
2•robaato•13m ago•0 comments

Using Python to win Payphone Tag (and get some exercise)

https://www.declanscott.dev/blog/payphone-tag-with-python/
1•declanjscott•13m ago•1 comments

New York Told Ken Griffin to Leave and He Listened

https://quoththeraven.substack.com/p/new-york-told-ken-griffin-to-leaveand
1•ytNumbers•16m ago•0 comments

Advanced Bootleg – Samsung SSD 870 Evo

https://www.s-config.com/advanced-bootleg-samsung-ssd-870-evo/
1•speckx•20m ago•0 comments

Using the Tiptap rich text editor with a Go back end

https://www.clarityboss.com/blog/tips-using-tiptap-golang-backend
1•dan_sbl•20m ago•1 comments

Ten simple rules for optimal and careful use of generative AI in science m

https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1013588
1•Anon84•23m ago•0 comments

Prescription digital therapeutics vs. meditation apps: what's different?

https://6th.tech/en/blog/prescription-digital-therapeutics-vs-meditation-apps
1•smanuel•24m ago•1 comments

An AI powered track recommendation app for DJs

https://www.nexttrack.club/
1•AlexBahlk•26m ago•0 comments

Show HN: Discover – a curated RSS directory with mentions via RSS

https://discover.brine.dev/
1•brine•26m ago•0 comments

LLM Inference Series: 4. KV caching, a deeper look

https://medium.com/@plienhar/llm-inference-series-4-kv-caching-a-deeper-look-4ba9a77746c8
1•bjourne•26m ago•0 comments

Potential Consequences of Using Postgres as a Job Queue

https://richyen.com/postgres/2026/05/04/postgres_job_queue.html
1•xngbuilds•27m ago•0 comments

Show HN: Tokid – Token-native IDs for LLM-facing systems

https://github.com/Tetra-Research/tokid
1•tawb•27m ago•0 comments

Deepfakes Are Coming for Your Bank Account

https://www.theatlantic.com/technology/2026/05/chatgpt-images-deepfakes-fraud/687023/
3•speckx•28m ago•0 comments

The bounded process: agent-assisted coding

1•steelkilt•28m ago•0 comments

The Roadmap to Mastering Tool Calling in AI Agents

https://machinelearningmastery.com/the-roadmap-to-mastering-tool-calling-in-ai-agents/
1•eigenBasis•29m ago•0 comments

Show HN: Mex – persistent memory for AI coding agents (~60% token reduction)

1•daksh_jaitly•30m ago•0 comments

New material could shield against radiation in next-gen space tech

https://www.space.com/technology/thinner-than-a-hair-and-stretchy-like-rubber-new-material-could-...
1•mpweiher•31m ago•0 comments
Open in hackernews

Anthropic Economic Index: AI's Impact on Software Development

https://www.anthropic.com/research/impact-software-development
28•meetpateltech•1y ago

Comments

jinay•1y ago
They really need to fix the kerning on some of their charts
Flux159•1y ago
The graphs around programming language & use cases makes sense from my personal usage. What's interesting is that they call out "automation" vs "augmentation", but also mention "directive" vs "feedback loop" for both claude code and claude ai - directive in this case being similar to solving a problem without iteration while feedback loop is the human iterating with an agent.

I think for the near future we're going to continue seeing feedback loop be the primary way we interact with coding agents simply because we generally don't specify all coding tasks with no ambiguity. If we did, then we'd already have some pseudo code that the agent could just directly translate into working code / validate with testing.

nullbyte•1y ago
https://archive.ph/BUAw3
dr_kiszonka•1y ago
> “Feedback Loop” interactions still require user input (even if that input is simply pasting error messages back to Claude).

I often feel that instead of being a developer, my role is reduced to being an entry-level intern* whose only job is to copy and paste errors and screenshots. In many cases, like when using Artifacts or Canvas, this process should be fully automated. For me, besides hallucinations and forgetting, this is the most annoying part of code development with AI.

* Interns should not be doing that either, but I didn't know how to convey it better.

zoogeny•1y ago
> AI is fundamentally changing the ways developers work. Our analysis implies that this is particularly true where specialist agentic systems like Claude Code are used, is particularly strong for user-facing app development work, and might be giving particular advantages to startups as opposed to more established business enterprises.

I think seeing a correlation between software startup adoption and user-facing app development work makes a lot of sense. The advantage for a startup in delivering user appreciable features is high. An entrenched business might be more focused on reliability, security, enterprise readiness and these seem like more risky things to outsource to LLMs which are unproven. But a startup that is trying to get their first 1000 customers is in a more "move fast and break things" mentality and the additional risk of unproven LLM code is much lower.

But those adoption rates and the percentage of code written by LLMs is high. What we aren't seeing in this graph are the total set of startups and the percentage of use within that larger set. But if LLM code does make startups faster to market, faster to PMF, faster to growth, etc. then this is going to be transformative. If the next billion dollar unicorn is written 79% by an AI, we're in for a new world.

> As AI systems become capable of building larger-scale pieces of software, will developers shift to mostly managing and guiding these systems, rather than writing code themselves?

I'm coding about 4 hours per day using LLMs primarily. I am trying to get the LLM to write as much of the code as I can. My experience, with the best LLM coding agents I have tried (Cursor using Claude variants and Gemini 2.5 Pro in the browser) is that they need very careful and continual guidance. The errors creep in and compound if you aren't vigilant.

The pace of improvement, however, is pretty remarkable. I think about 75% of the code in a new project I am working on has been written by the LLM. My biggest challenge, actually, is to avoid taking the reigns. Sometimes, the LLM goes off the rails and I think "It will be easier just to do it myself". However, if I catch myself and instead try to write a more detailed instruction, I can often get the LLM to do the thing perfectly.

It is a similar shift in mindset to when I became a manager. I can no longer just roll up my sleeves and fix the problem, I have to manage the individual who I assign to the problem. Every instinct I have is "I know how to do this, I can do it better than you", but I have to resist.

> But in a relative sense, coding is among the most developed uses of AI in the economy.

This is a bit scary for some developers, but if the skill of instructing AIs effectively is deeply learned then it may be even more valuable than coding. This use of AI is new and if we can manage to instruct ourselves out of a job, we'll be the seasoned AI whisperers that other industries look to for guidance on how to replicate the result.

dr_kiszonka•1y ago
"However, if I catch myself and instead try to write a more detailed instruction."

It probably depends on what you are working on, but I find it more time effective to fix smaller problems myself instead of spending context space on revisions. There are benefits to training juniors, but LLMs learning doesn't extend beyond the current session. For this reason, I focus on revising my system prompts, which now include having LLMs review the proposed code against current documentation and specs for consistency, updating the docs after each code change (pending my approval), etc.

I still haven't figured out how to have separate agents do coding, reviews, and testing, which would be incredibly useful and possibly more context/price-efficient. It would be particularly helpful in projects where small code changes have profound impact but are hard to detect. Unlike when working on UIs where it is relatively easy to spot deviations from the spec, when I am doing more analytical work, an LLM dropping a few of a 100 features or subtly changing my sampling algorithm requires constant vigilance.

zoogeny•1y ago
So we are clearly having the same experiences using LLMs, I can tell based on your comment.

I was doing exactly like you. There is a budget with Cursor and I hit it one day. I was "iterating" with the LLM and getting it to make small tidy up fixes. Tiny "rename that function" or "move that logic from here to there" kind of requests after the initial PR was written (or the dreaded "fix this error", "fix this new error" dreaded loop). That adds up in the course of a day. It also tends to go sideways. Sometimes "just tweak this little thing" ends up with the LLM completely borking a working solution. So I started to get it to do the first major PR and then I would come and clean it up after.

Now I have a new strategy that seems better. If the LLM doesn't one shot my request there are two reasons: first I asked it to do too much and it got confused, second I wasn't clear. So now, instead of iterating I blow away the imperfect attempt and start again. I break the task down even smaller and/or I give much more precise description.

This is proving, for now, to lead to a better end result (for me, in my project, how I do things, YMMV, etc.) I spend more time on the initial prompt with the goal to have 0 iteration. That means giving it as much context as it needs and explaining in minute detail exactly what I want.

While this works for me, I do find I sometimes resist writing a detailed and clear description for tasks I could just do myself. That is what my comment was referencing. I have to relinquish that desire if I am to embrace this new approach fully.

dr_kiszonka•1y ago
Oh, that is quite brilliant! Thanks for describing it. I will give it a shot.

If you felt comfortable sharing one of your perfect, 0-iteration prompts for non-microscopic changes, I am sure more folks than just I would appreciate it.

zoogeny•1y ago
That is exactly what I am working on actually. I am trying to build a web app where the LLM writes 80%+ of the code. The end goal is to present the learnings as encountered on a realistic project. I am currently streaming it on Twitch but the goal is to make it into YouTube videos that document the process.

I should clarify the "perfect 0-iteration" since that would be an exaggeration. Like I said, that is the goal, but not always the outcome. For example, the latest versions of Cursor have an agentic flow that does things like notice when there are linter errors and it automatically fixes them. So a single prompt can end up with the agent itself making iterative changes, without my input. I also make exceptions to my "single prompt" aim when it is reasonable. It is a goal, not a restriction.

I more mean it is a change in intent when I am starting a new task. I do not expect to take a stab at the prompt and then iterate my way to a result. I expect to got 99% of the task done in a single prompt.

As an example from yesterday, I was integrating Prisma (an ORM for Typescript) into my project. Here is the task breakdown (which I had Gemini help to create):

* Install Prisma: Install Prisma: npm install prisma --save-dev, npm install @prisma/client.

* Create schema.prisma: Define models for User, Conversation, Message (reflecting Phase 1 structure initially + Phase 2 additions like User). Configure the datasource db block to use postgresql and read the connection URL from an environment variable (DATABASE_URL provided by Docker Compose).

* Run Migration Run Initial Migration: npx prisma migrate dev --name initial_schema. This will generate the SQL for Postgres and apply it to your local Dockerized DB. Generate Prisma client Generate Prisma Client: npx prisma generate

* Refactor DB code to use Prisma: Refactor existing Phase 1 database interaction logic (currently using sqlite3 driver?) to use the generated Prisma Client.

* Create the users table: Create a SQLite migration (or update DB init). Define a table named users with columns: <snipped SQL column description> Create a UNIQUE index on (provider, provider_id). Update type definitions in src/types/database.types.ts to include DatabaseUser."

* Update existing tables (conversations, artifacts) to use user_id:Ensure the user_id column in conversations and artifacts tables is NOT NULL and is intended to be a foreign key referencing users.id (though SQLite doesn't enforce FKs by default, design it that way). Remove any 'local' default value if previously set. Update database repository functions (conversationRepository, artifactRepository) to require userId for all creation and query operations (e.g., listArtifactsByUser(userId), getConversationById(conversationId, userId)).

Each of those bullets roughly corresponds to a single prompt. Before I start each task I create a brand new Cursor agent context. I don't use the prompt recommendations directly, instead I craft each one using Cursor @file references to give the necessary context. After each task is accomplished I check in the result.

As I mentioned, getting the LLM to write out the broken down tasks is super helpful. I can give a high level description of what I want to do (e.g. replace my hand written SQL with Prisma) and then tell the LLM "break this goal down into small tasks that an LLM should be able to complete".

I'm not an expert on this stuff (no one is yet, field is too new) so I'm just sharing what I'm doing and saying what has worked for me based on my limited experience.