What's your AI development path?

12•blueeyer•11h ago

A lot of AI tools/agents startups popping up lately and are accepted to YC. I assume in most cases people are not building and training those models from scratch? So what are the common paths/options when building MVP? Any cool communities/tools you can suggest?

Comments

resiros•9h ago

That's a big topic :D

First thing about building AI or LLM powered apps is that 90% of the time you don't need to train models from scratch. LLMs are models that have the world's knowledge, so you can "train them" by prompting them and giving them the right context.

Where to start? I think the first thing is to play around with the OpenAI playground to get an idea of what's possible with prompts.

You'll find tons of frameworks out there: LangChain, LlamaIndex, Agno, and the list goes on. But in my opinion, if you have a use case in mind, start very very simple. Almost a single prompt using the OpenAI SDK or compatible SDK like LiteLLM.

The core challenge isn't technical complexity, it's the stochastic nature of LLMs. Your prompt works perfectly on one example, breaks on another. You fix it for the second case, and it stops working for the first.

The whole trick of building an LLM powered application is finding how to deal with this in an iterative approach. You need to have a workflow that allows you to iterate quickly. For this, start very simple and then add complexity. Don't use frameworks at the beginning.

I think Hamel's approach of error analysis (https://hamel.dev/notes/llm/officehours/erroranalysis.html) is a very good starting point. The idea is that you create a prompt, run it against a test set, then identify clusters of error types and systematically fix them.

What tools do you need? In the beginning, just a playground and a Jupyter Notebook. At some point when you want to look into LLMOps tools. They provide you with observability (so if you have an agent with multiple steps you can see each step), running evals and comparing results easily, creating test sets from prod data... All the stuff that makes your life easier.

There are a lot of these, langsmith, helicone, braintrust, humanloop.. Full disclosure, I'm CEO of one of these tools (Agenta) [one of the reasons I'm answering this post :D].

But I think from your question you're not at the stage where you should look at LLMOps yet. Focus on understanding your use case first and starting a simple repeatable process.

tom_m•5h ago

Learn about LangChain and MCP. Honestly, 90%+ of apps and startups aren't doing anything unique or crazy. They're orchestrating and applying AI to various things. That means there's no moat, no secret sauce, and in many cases little effort. There's this concept of "AI slop" for content and I think it extends to startups/software as well.

Here's the thing. We're very likely to continue seeing VC funding in the slop. Counterintuitive, right? Why invest in these startups that are known to fail?

...because, if you connect the dots, the same VCs investing in these are those invested in the companies selling the LLMs. So it's literally funding companies to use and promote LLMs. Keep growing the hype and bubble. Keep normalizing LLM token cost. Keep playing on the FOMO. Bring on the AI doom.

It's absolutely worth having all of these startups fail for them. They'll spend millions and maybe billions in marketing. That's what most of these startups are - marketing. Pawns.

Remember, if you want to follow the whole (what I call) "AI doomer timeline" towards the end of it you'll realize that all/most SaaS startups will have a near zero or zero value. Why? Because anyone will be able to simply prompt AI to build the software they need. Not just "one size fits all" SaaS software that comes with limitations, missing features you need (but not enough of the other users need), other multi-tenant issues, and security concerns, and of course recurring cost...but people are going to be able to build CUSTOM single-tenant software with AI without recurring costs.

SaaS was always based on the age old question of buy vs. build. The only reason it works is because it cost more to build (and maintain). On our "AI doomer timeline" this will eventually shift and no longer be the case.

Ergo, these startups are mostly doomed for failure and it's in the best interest of VCs to invest in them to accelerate the timeline. Their goal is to increase AI adoption and phase out SaaS.

Why, Why, Why, Eliza?

Detroit and Baltimore Built Local State Capacity to Bring Crime to New Lows

The 800k Hours Career Guide

Icechunk 1.0: Production-Grade Cloud-Native Array Storage Is Here

A Game About Typing the Alphabet

'Intelligent' copper tariffs will 'wake people up', says mining billionaire

Width and Depth: A Mental Model for Clearer Communication

WaitGroup.Go() in 1.25

Humanity's First Recorded Kiss Was Earlier Than We Thought

Finding Compiler Bugs: Cross-Language Code Generator and Differential Testing

The AI window is now

Rimac just snagged another 24 world records the Nevera R, including EV top speed

NASA's Most Experienced Staff Are Taking Buyouts

Show HN: FaceBlurify – browser‑only face blurring in <10 s

Yes, Overclocking Can Damage Your CPU – Here Are the Warning Signs

Mcphub.com Is Officially Launched

When the Dutch Tried to Live in Concrete Spheres

AI-Enabled Trash Trucks Will Scan Your Trash to Scold You About Recycling

Show HN: Cursor Rules Generator

Building an AI app builder with an AI app builder built with an AI app builder

Mystery AI Hype Theater 3000

SpaceX's Starlink Is Now Beaming Wi-Fi to 1k Planes

Slow down on building power plants for all those new AI datacenters report warns

Holographic memory storage and information processing in Quantum Brain Dynamics

Bitchat: notes on the path forward ("yo!")

U.S. will review social media for foreign student visa applications

Devanagari

Yours on Crossword, with Jonathan Wold and Luke Carbis

Show HN: Prompt to Spreadsheet

Mastra is now Apache 2.0 licensed