frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Eric Lu Wins International Chopin Piano Competition

https://www.nytimes.com/2025/10/21/arts/music/chopin-piano-competition-eric-lu.html
1•laserson•2m ago•0 comments

AmigaOS 3.3 will be released in 2026

https://www.amiga-news.de/en/news/AN-2025-10-00103-EN.html
1•doener•2m ago•0 comments

The hottest term in AI is completely made up

https://www.washingtonpost.com/technology/2025/10/21/nvidia-ai-factories/
1•reaperducer•4m ago•0 comments

PageIndex Chat – Human-Like Long Document AI Analyst

https://pageindex.ai/blog/pageindex-chat
1•mingtianzhang•4m ago•1 comments

Using Async Functions in Celery with Django Connection Pooling

https://mrdonbrown.blogspot.com/2025/10/using-async-functions-in-celery-with.html
1•ipeev•6m ago•0 comments

Thinking Sparks: Emergent Attention Heads in Reasoning Models

https://arxiv.org/abs/2509.25758
1•diwank•10m ago•0 comments

Modern AI on Vintage Hardware: Llama 2 Runs on Windows 98

https://hackaday.com/2025/01/13/modern-ai-on-vintage-hardware-llama-2-runs-on-windows-98/
1•JumpCrisscross•13m ago•1 comments

Amazon Plans to Replace More Than Half a Million Jobs with Robots

https://www.nytimes.com/2025/10/21/technology/inside-amazons-plans-to-replace-workers-with-robots...
1•bookofjoe•22m ago•2 comments

Show HN: ProfiTree, Tax Optimization Tool for DIY Investors

https://www.profitree-tax.com/
1•shahakshat609•24m ago•0 comments

Large Language Models Inference Engines Based on Spiking Neural Networks

https://arxiv.org/abs/2510.00133
2•PaulHoule•24m ago•0 comments

Effects of Swallowable Intragastric Balloon on Weight Loss, Metabolic Syndrome (2017)

https://www.gavinpublishers.com/article/view/effect-of-a-new-swallowableintragastric-balloon-elip...
1•nateb2022•25m ago•0 comments

Surfacing LLM Biases Through Graffiti

https://nullpxl.com/post/surfacing-llm-biases-through-graffiti/
2•nullpxl•25m ago•0 comments

Dangerous and invisible worm found in Visual Studio Code extensions

https://www.heise.de/en/news/Dangerous-and-invisible-worm-found-in-Visual-Studio-Code-extensions-...
2•croes•27m ago•1 comments

Ask HN: Codex / Claude Code vs. Cursor?

2•mholubowski•27m ago•0 comments

Daniel J. Bernstein updated cdb (Constant database) to go beyond 4GB

https://cdb.cr.yp.to/
4•kreco•28m ago•0 comments

Did people in the 90s worry about the efficiency of the internet

2•burgiee•28m ago•1 comments

GitHub Copilot's "Free Plan Limit" Bug: A Year-Long Oversight?

https://danielraffel.me/2025/10/22/github-copilots-free-plan-limit-bug-a-year-long-oversight/
1•atupem•30m ago•0 comments

DHS Asks OpenAI to Unmask User Behind ChatGPT Prompts, Possibly First Such Case

https://gizmodo.com/dhs-asks-openai-to-unmask-user-behind-chatgpt-prompts-possibly-the-first-such...
2•mrtesthah•32m ago•0 comments

Show HN: Streaky – GitHub Streak Monitor with Distributed Cron Processing

https://github.com/0xReLogic/Streaky
1•0xrelogic•33m ago•0 comments

Fork Buckets Like You Fork Code

https://www.tigrisdata.com/blog/fork-buckets-like-code/
1•raoufchebri•33m ago•0 comments

A ritual and the toxic effects of ranking

https://mailchi.mp/f6f9b751ce8c/resilience-postcard-lonely-1669813
1•pcfwik•36m ago•0 comments

Pathom 3 – a Clojure library modelling information systems as attribute graphs

https://pathom3.wsscode.com/
1•Tevo•38m ago•0 comments

Rematch Accelerated by Network Next

https://mas-bandwidth.com/rematch-accelerated-by-network-next/
2•gafferongames•43m ago•0 comments

China Accuses US of Cyberattack on National Time Center

https://www.securityweek.com/china-accuses-us-of-cyberattack-on-national-time-center/
1•Bender•49m ago•2 comments

When yesterday's code becomes today's threat

https://www.scworld.com/perspective/when-yesterdays-code-becomes-todays-threat
1•Bender•49m ago•0 comments

Can you make the slippery road sign in real life

https://www.youtube.com/watch?v=_eL_r7NbAvA
1•lifeisstillgood•53m ago•0 comments

Should We Look on New Technologies with Awe and Dread?

https://www.newyorker.com/culture/open-questions/should-we-look-on-new-technologies-with-awe-and-...
1•fortran77•53m ago•2 comments

John Titor

https://en.wikipedia.org/wiki/John_Titor
2•cryptoz•53m ago•0 comments

Fake home invasion vid lands woman in real trouble

https://www.theregister.com/2025/10/21/woman_pranks_husband_ai_intruder/
2•Bender•53m ago•1 comments

M5 iPad Pro: An AI and Gaming Upgrade for AI and Games That Aren't There Yet

https://www.macstories.net/stories/m5-ipad-pro-review/
1•walterbell•54m ago•0 comments
Open in hackernews

Show HN: Realizing Karpathy's dream of Natural Language Programming

3•amthewiz•8h ago
https://github.com/playbooks-ai/playbooks

Comments

amthewiz•7h ago
Andrej Karpathy posted in early 2023 (https://x.com/karpathy/status/1617979122625712128) -

> "The hottest new programming language is English"

I've built a Natural Language Programming stack for building AI Agents. I think it is the first true Software 3.0 stack.

The core idea: Use LLMs as CPUs! You can finally step debug through your prompts and get reliable, verifiable execution. The stack includes a new language, compiler, developer tooling like VSCode extension.

Programs are written as markdown. H1 tags are agents, H2 tags are natural language playbooks (i.e. functions), python playbooks. All playbooks in an agents run on the same call stack. NL and python playbooks can call each other.

Quick intro video: https://www.youtube.com/watch?v=ZX2L453km6s

Github: https://github.com/playbooks-ai/playbooks (MIT license)

Documentation: https://playbooks-ai.github.io/playbooks-docs/getting-starte...

Project website: runplaybooks.ai

Example Playbooks program -

    # Country facts agent
    This agent prints interesting facts about nearby countries

    ## Main
    ### Triggers
    - At the beginning
    ### Steps
    - Ask user what $country they are from
    - If user did not provide a country, engage in a conversation and gently nudge them to provide a country
    - List 5 $countries near $country
    - Tell the user the nearby $countries
    - Inform the user that you will now tell them some interesting facts about each of the countries
    - process_countries($countries)
    - End program

    ```python
    from typing import List

    @playbook
    async def process_countries(countries: List[str]):
        for country in countries:
            # Calls the natural language playbook 'GetCountryFact' for each country
            fact = await GetCountryFact(country)
            await Say("user", f"{country}: {fact}")
    ```

    ## GetCountryFact($country)
    ### Steps
    - Return an unusual historical fact about $country
There are a bunch of very interesting capabilities. A quick sample -

- "Queue calls to extract table of contents for each candidate file" - Effortless calling MCP tools, multi-threading, artifact management, context management

- "Ask Accountant what the tax rate would be" is how you communicate with other agents

- you can mix procedural natural language playbooks, ReAct playbooks, Raw prompt playbooks, Python playbooks and external playbooks like MCP tools seamlessly on the same call stack

- "Have a meeting with Chef, Marketing expert and the user to design a new menu" is how you can spawn multi-agent workflows, where each agent follows their own playbook for the meeting

- Coming soon: Observer agents (agents observing other agents - automated memory storage, verify/certify execution, steer observed agents), dynamic playbook generation for procedural memory, etc.

I hope this changes how we build AI agents going forward for the better. Looking forward to discussion! I'll be in the comments.

amthewiz•6h ago
It has puzzled me why someone hasn't already done this yet, given LLMs are so good at language now.

Probably the short answer is that is it hard to get this to actually work. There were many open questions that one has to tackle simultaneously -

- What is the right balance between relying on LLMs to do the right thing vs the runtime around LLMs? For example, I went back and forth a few times getting LLMs to manage stack as you call one playbook from another one. Finally decided that it is most reliable to let the runtime take care of that.

- Context engineering - What to put in the prompt, in what order, how to represent state, how to handle artifacts, how to make sure that we use LLM cache optimally as context grows, how to "unwind" context as playbook calls return, how to compact specific types of information, how to make sure important context isn't lost, etc

- LLMs today have vastly different capabilities than 2 years ago. I have had to rewrite the whole stack from scratch 4 times to adjust. Wasn't fun, but had to be done.

- Language(s): How to represent the pseudocode so that it is both fluid natural language and a capable programming language? How to transform it so that it can be executed reliably through LLMs? How to NOT lose the flexibility and fluidity in the process (e.g. easy to convert to a graph like LangGraph, but then you are stuck with control flow), how to create a semantic compiler for that transition, what primitives to use for the compiled language that I call Playbooks Assembly Language [1].

- Agents and multi-agent system considerations - How to represent agents, how they should communicate. Agents are classes and they expose public playbooks that other agents can call). Agents can send natural language messages to each other and engage in conversations. Agents can call multi-party meetings. How can the behavior across all these interaction patterns be defined so it remains intuitive. For example, lifetime of a meeting is tied to "meeting: true" playbooks so the agent simply returns from the playbook to exit a meeting and meeting lifecycle is tied to the host returning from its meeting playbook.

- Which LLMs to support? Go for "Bring your own LLM" or restrict the set? Which LLM? LLM selection impacts how all the internal prompts are implemented so prompt building had to occur simultaneously with LLM selection.

It felt like playing an N-dimensional game of chess!

[1] https://playbooks-ai.github.io/playbooks-docs/reference/play...