frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

What's your AI development path?

12•blueeyer•7mo ago
A lot of AI tools/agents startups popping up lately and are accepted to YC. I assume in most cases people are not building and training those models from scratch? So what are the common paths/options when building MVP? Any cool communities/tools you can suggest?

Comments

resiros•7mo ago
That's a big topic :D

First thing about building AI or LLM powered apps is that 90% of the time you don't need to train models from scratch. LLMs are models that have the world's knowledge, so you can "train them" by prompting them and giving them the right context.

Where to start? I think the first thing is to play around with the OpenAI playground to get an idea of what's possible with prompts.

You'll find tons of frameworks out there: LangChain, LlamaIndex, Agno, and the list goes on. But in my opinion, if you have a use case in mind, start very very simple. Almost a single prompt using the OpenAI SDK or compatible SDK like LiteLLM.

The core challenge isn't technical complexity, it's the stochastic nature of LLMs. Your prompt works perfectly on one example, breaks on another. You fix it for the second case, and it stops working for the first.

The whole trick of building an LLM powered application is finding how to deal with this in an iterative approach. You need to have a workflow that allows you to iterate quickly. For this, start very simple and then add complexity. Don't use frameworks at the beginning.

I think Hamel's approach of error analysis (https://hamel.dev/notes/llm/officehours/erroranalysis.html) is a very good starting point. The idea is that you create a prompt, run it against a test set, then identify clusters of error types and systematically fix them.

What tools do you need? In the beginning, just a playground and a Jupyter Notebook. At some point when you want to look into LLMOps tools. They provide you with observability (so if you have an agent with multiple steps you can see each step), running evals and comparing results easily, creating test sets from prod data... All the stuff that makes your life easier.

There are a lot of these, langsmith, helicone, braintrust, humanloop.. Full disclosure, I'm CEO of one of these tools (Agenta) [one of the reasons I'm answering this post :D].

But I think from your question you're not at the stage where you should look at LLMOps yet. Focus on understanding your use case first and starting a simple repeatable process.

tom_m•7mo ago
Learn about LangChain and MCP. Honestly, 90%+ of apps and startups aren't doing anything unique or crazy. They're orchestrating and applying AI to various things. That means there's no moat, no secret sauce, and in many cases little effort. There's this concept of "AI slop" for content and I think it extends to startups/software as well.

Here's the thing. We're very likely to continue seeing VC funding in the slop. Counterintuitive, right? Why invest in these startups that are known to fail?

...because, if you connect the dots, the same VCs investing in these are those invested in the companies selling the LLMs. So it's literally funding companies to use and promote LLMs. Keep growing the hype and bubble. Keep normalizing LLM token cost. Keep playing on the FOMO. Bring on the AI doom.

It's absolutely worth having all of these startups fail for them. They'll spend millions and maybe billions in marketing. That's what most of these startups are - marketing. Pawns.

Remember, if you want to follow the whole (what I call) "AI doomer timeline" towards the end of it you'll realize that all/most SaaS startups will have a near zero or zero value. Why? Because anyone will be able to simply prompt AI to build the software they need. Not just "one size fits all" SaaS software that comes with limitations, missing features you need (but not enough of the other users need), other multi-tenant issues, and security concerns, and of course recurring cost...but people are going to be able to build CUSTOM single-tenant software with AI without recurring costs.

SaaS was always based on the age old question of buy vs. build. The only reason it works is because it cost more to build (and maintain). On our "AI doomer timeline" this will eventually shift and no longer be the case.

Ergo, these startups are mostly doomed for failure and it's in the best interest of VCs to invest in them to accelerate the timeline. Their goal is to increase AI adoption and phase out SaaS.

Show HN: One-click AI employee with its own cloud desktop

https://cloudbot-ai.com
1•fainir•1m ago•0 comments

Show HN: Poddley – Search podcasts by who's speaking

https://poddley.com
1•onesandofgrain•2m ago•0 comments

Same Surface, Different Weight

https://www.robpanico.com/articles/display/?entry_short=same-surface-different-weight
1•retrocog•4m ago•0 comments

The Rise of Spec Driven Development

https://www.dbreunig.com/2026/02/06/the-rise-of-spec-driven-development.html
2•Brajeshwar•9m ago•0 comments

The first good Raspberry Pi Laptop

https://www.jeffgeerling.com/blog/2026/the-first-good-raspberry-pi-laptop/
2•Brajeshwar•9m ago•0 comments

Seas to Rise Around the World – But Not in Greenland

https://e360.yale.edu/digest/greenland-sea-levels-fall
1•Brajeshwar•9m ago•0 comments

Will Future Generations Think We're Gross?

https://chillphysicsenjoyer.substack.com/p/will-future-generations-think-were
1•crescit_eundo•12m ago•0 comments

State Department will delete Xitter posts from before Trump returned to office

https://www.npr.org/2026/02/07/nx-s1-5704785/state-department-trump-posts-x
2•righthand•15m ago•1 comments

Show HN: Verifiable server roundtrip demo for a decision interruption system

https://github.com/veeduzyl-hue/decision-assistant-roundtrip-demo
1•veeduzyl•16m ago•0 comments

Impl Rust – Avro IDL Tool in Rust via Antlr

https://www.youtube.com/watch?v=vmKvw73V394
1•todsacerdoti•16m ago•0 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
2•vinhnx•17m ago•0 comments

minikeyvalue

https://github.com/commaai/minikeyvalue/tree/prod
3•tosh•22m ago•0 comments

Neomacs: GPU-accelerated Emacs with inline video, WebKit, and terminal via wgpu

https://github.com/eval-exec/neomacs
1•evalexec•27m ago•0 comments

Show HN: Moli P2P – An ephemeral, serverless image gallery (Rust and WebRTC)

https://moli-green.is/
2•ShinyaKoyano•31m ago•1 comments

How I grow my X presence?

https://www.reddit.com/r/GrowthHacking/s/UEc8pAl61b
2•m00dy•32m ago•0 comments

What's the cost of the most expensive Super Bowl ad slot?

https://ballparkguess.com/?id=5b98b1d3-5887-47b9-8a92-43be2ced674b
1•bkls•33m ago•0 comments

What if you just did a startup instead?

https://alexaraki.substack.com/p/what-if-you-just-did-a-startup
5•okaywriting•40m ago•0 comments

Hacking up your own shell completion (2020)

https://www.feltrac.co/environment/2020/01/18/build-your-own-shell-completion.html
2•todsacerdoti•42m ago•0 comments

Show HN: Gorse 0.5 – Open-source recommender system with visual workflow editor

https://github.com/gorse-io/gorse
1•zhenghaoz•43m ago•0 comments

GLM-OCR: Accurate × Fast × Comprehensive

https://github.com/zai-org/GLM-OCR
1•ms7892•44m ago•0 comments

Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU

https://github.com/MikeVeerman/tool-calling-benchmark
1•MikeVeerman•45m ago•0 comments

Show HN: AboutMyProject – A public log for developer proof-of-work

https://aboutmyproject.com/
1•Raiplus•45m ago•0 comments

Expertise, AI and Work of Future [video]

https://www.youtube.com/watch?v=wsxWl9iT1XU
1•indiantinker•46m ago•0 comments

So Long to Cheap Books You Could Fit in Your Pocket

https://www.nytimes.com/2026/02/06/books/mass-market-paperback-books.html
4•pseudolus•46m ago•2 comments

PID Controller

https://en.wikipedia.org/wiki/Proportional%E2%80%93integral%E2%80%93derivative_controller
1•tosh•50m ago•0 comments

SpaceX Rocket Generates 100GW of Power, or 20% of US Electricity

https://twitter.com/AlecStapp/status/2019932764515234159
2•bkls•50m ago•0 comments

Kubernetes MCP Server

https://github.com/yindia/rootcause
1•yindia•51m ago•0 comments

I Built a Movie Recommendation Agent to Solve Movie Nights with My Wife

https://rokn.io/posts/building-movie-recommendation-agent
4•roknovosel•51m ago•0 comments

What were the first animals? The fierce sponge–jelly battle that just won't end

https://www.nature.com/articles/d41586-026-00238-z
2•beardyw•1h ago•0 comments

Sidestepping Evaluation Awareness and Anticipating Misalignment

https://alignment.openai.com/prod-evals/
1•taubek•1h ago•0 comments