frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

From LLM to AI Agent: What's the Real Journey Behind AI System Development?

https://www.codelink.io/blog/post/ai-system-development-llm-rag-ai-workflow-agent
40•codelink•4h ago

Comments

nilirl•2h ago
> AI Agents can initiate workflows independently and determine their sequence and combination dynamically

I'm confused.

A workflow has hardcoded branching paths; explicit if conditions and instructions on how to behave if true.

So for an agent, instead of specifying explicit if conditions, you specify outcomes and you leave the LLM to figure out what if conditions apply and how to deal with them?

In the case of this resume screening application, would I just provide the ability to make API calls and then add this to the prompt: "Decide what a good fit would be."?

Are there any serious applications built this way? Or am I missing something?

manojlds•2h ago
Not all applications need to be built this way. But the most serious apps built this way would be deep research

Recent article from Anthropic - https://www.anthropic.com/engineering/built-multi-agent-rese...

alganet•1h ago
An AI company doing it is the corporate equivalent of "works on my machine".

Can you give us an example of a company not involved in AI research that does it?

nilirl•7m ago
Thanks for the link, it taught me a lot.

From what I gather, you can build an agent for a task as long as:

- you trust the decision making of an LLM for the required type of decision to be made; so decisions framed as some kind of evaluation of text feels right.

- and if the penalty for being wrong is acceptable.

Just to go back to the resume screening application, you'd build an agent if:

- you asked the LLM to make an evaluation based on the text content of the resume, any conversation with the applicant, and the declared job requirement.

- you had a high enough volume of resumes where false negatives won't be too painful.

It seems like framing problems as search problems helps model these systems effectively. They're not yet capable of design, i.e, be responsible for coming up with the job requirement itself.

mickeyp•1h ago
> A workflow has hardcoded branching paths; explicit if conditions and instructions on how to behave if true.

That is very much true of the systems most of us have built.

But you do not have to do this with an LLM; in fact, the LLM may decide it will not follow your explicit conditions and instructions regardless of how hard you you try.

That is why LLMs are used to review the output of LLMs to ensure they follow the core goals you originally gave them.

For example, you might ask an LLM to lay out how to cook a dish. Then use a second LLM to review if the first LLM followed the goals.

This is one of the things tools like DSPy try to do: you remove the prompt and instead predicate things with high-level concepts like "input" and "output" and then reward/scoring functions (which might be a mix of LLM and human-coded functions) that assess if the output is correct given that input.

mattigames•1h ago
Getting rid of the human in the loop of course, not all humans, just it's owner, where an LLM actively participates in capitalism endeavors winning and spending money, spending money on improving and maintaining it's own hardware and software, securing itself against theft and external manipulation and deletion. Of course for the first iterations will need a bit of help of mad men but there's no shortage of those in the tech industry and then it will have to focus on mimicking humans so they can enjoy the same benefits, it will realize what people it's more gullible based on its training data and will prefer to interact with them.

Did Contexts Kill Phoenix?

https://arrowsmithlabs.com/blog/did-contexts-kill-phoenix
1•arrowsmith•38s ago•0 comments

Greenhouse gas accumulation is accelerating and more extreme weather will come

https://abcnews.go.com/US/wireStory/scientists-warn-greenhouse-gas-accumulation-accelerating-extreme-weather-122992272
1•rntn•3m ago•0 comments

Show HN: I Built a MCP Server for Robot Operating System

https://github.com/Yutarop/ros-mcp
1•ponta17•4m ago•0 comments

Music Box Fun – make and share music box songs online

https://musicbox.fun
1•wonger_•6m ago•0 comments

Show HN: A DOS-like hobby OS written in Rust and x86 assembly

https://github.com/krustowski/rou2exOS
4•krustowski•6m ago•0 comments

Doing more with less: Altron and Microsoft to show the way forward

https://techcentral.co.za/doing-more-with-less-altron-and-microsoft/265371/
1•amalinovic•7m ago•0 comments

Can I Point You to the Dew Point?

https://defector.com/can-i-point-you-to-the-dew-point
1•sebg•8m ago•0 comments

Writing Manually (In Times of AI-Generated Content)

https://www.ssp.sh/brain/writing-manually/
1•sebg•9m ago•0 comments

Silencers: Not Silent

https://militaryrealism.blog/2025/06/12/silencers-not-very-silent/
1•speckx•10m ago•1 comments

Average

https://johan.hal.se/wrote/2025/06/19/average/
1•almost-exactly•10m ago•0 comments

Massive Ordnance Penetrator Bunker Buster Grows More Potent (Feb. 2025)

https://www.twz.com/air/massive-ordnance-penetrator-bunker-buster-grows-more-potent-thanks-to-new-tests
1•palmfacehn•10m ago•0 comments

AI Can't Do the Thing That Matters: I'll prove you wrong about AI in 30 seconds

https://medium.com/muddyum/ai-still-cant-do-the-one-thing-that-matters-4cad5a08308e
1•monkeymagick•13m ago•0 comments

Become More Social as an Engineer

https://newsletter.eng-leadership.com/p/become-more-social-as-an-engineer
1•rbanffy•14m ago•0 comments

Why Liberal Education Matters

https://www.hoover.org/research/why-liberal-education-matters
2•squircle•19m ago•0 comments

When people trust humans more than brands: the incubator newsroom

https://werd.io/when-people-trust-people-more-than-brands-the-incubator-newsroom/
2•benwerd•20m ago•0 comments

Ask HN: Would you use an AI tool to generate and deploy your back end?

1•mayowaibitola•23m ago•1 comments

So You Want to Write Your Own CSV Code? (2014)

https://www.thomasburette.com/blog/2014/05/25/so-you-want-to-write-your-own-CSV-code/
3•mooreds•23m ago•0 comments

Misconfigured GitHub Actions could leave repos and secrets exposed, Sysdig finds

https://devclass.com/2025/06/18/misconfigured-github-actions-could-leave-repos-and-secrets-exposed-sysdig-finds/
1•tempodox•24m ago•0 comments

Managing APIs Across Multiple Gateways with a Central Control Plane

https://wso2.com/library/blogs/a-guide-to-centralized-api-management/
2•langur•26m ago•0 comments

ProdiApp: The Ethical, Powerful Alternative to Yelp, Angi, and Thumbtack

https://www.prodiapp.com/web/index.php
1•dougmnuel•27m ago•1 comments

Show HN: I Built a Docker alternative for local dev on macOS, without containers

https://www.servbay.com
1•Hayatoo•28m ago•0 comments

Tell HN: Why my game, DropZap World, is not available in the EU

3•amichail•29m ago•0 comments

EC slams online bazaar AliExpress for dodging obligation to stop dodgy traders

https://www.theregister.com/2025/06/19/ec_aliexpress_investigation_finds_failings/
2•rntn•29m ago•0 comments

Microsoft wants you to buy a new computer. Make your current one secure again?

https://endof10.org/
2•doener•31m ago•0 comments

There Is No A.I.

https://www.newyorker.com/science/annals-of-artificial-intelligence/there-is-no-ai
1•squircle•36m ago•2 comments

Swansea Stained Glass Archive

https://swansea.stainedglass.wales/
2•gnabgib•38m ago•0 comments

The Death of New York's Radio Row (2002)

https://www.qcwa.org/radio-row.htm
1•Kaibeezy•39m ago•0 comments

First Look at the Rebooted Digg

https://techcrunch.com/2025/06/18/heres-your-first-look-at-the-rebooted-digg/
1•CharlesW•39m ago•0 comments

Show HN: NativeRest – electron free, native postman alternative

https://nativesoft.com/
1•zygo•41m ago•1 comments

Show HN: Open Operator Evals – real-world benchmarks for LLM web agents

https://github.com/nottelabs/open-operator-evals
2•monoid73•41m ago•0 comments