frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Launch HN: Plexe (YC X25) – Build production-grade ML models from prompts

https://www.plexe.ai/
42•vaibhavdubey97•2h ago
Hey HN! We're Vaibhav and Marcello, founders of Plexe (https://www.plexe.ai). We create production-ready ML models from natural language descriptions. Tell Plexe what ML problem you want to solve, point it at your data, and it handles the entire pipeline from feature engineering to deployment.

Here’s a walkthrough: https://www.youtube.com/watch?v=TbOfx6UPuX4.

ML teams waste too much time on generic heavy lifting. Every project follows the same pattern: 20% understanding objectives, 60% wrangling data and engineering features, 20% experimenting with models. Most of this is formulaic but burns months of engineering time. Throwing LLMs at it isn't the answer as that just trades engineering time for compute costs and worse accuracy. Plexe automates this repetitive 80%, so your team can work faster on what actually has value.

You describe your problem in plain English ("fraud detection model for transactions" or "product embedding model for search"), connect your data (Postgres, Snowflake, S3, direct upload, etc), and then Plexe: - Analyzes data and engineers features automatically - Runs experiments across multiple architectures (logistic regression to neural nets) - Generates comprehensive evaluation reports with error analysis, robustness testing, and prioritized recommendations to provide actionable guidance - Deploys the best model with monitoring and automatic retraining

We did a Show HN for our open-source library five months ago (https://news.ycombinator.com/item?id=43906346). Since then, we've launched our commercial platform with interactive refinement, production-grade model evaluations, retraining pipeline, data connectors, analytics dashboards, and deployment for online and batch inference.

We use a multi-agent architecture where specialized agents handle different pipeline stages. Each agent focuses on its domain: data analysis, feature engineering, model selection, deployment, and so on. The platform tracks all experiments and generates exportable Python code.

Our open-source core (https://github.com/plexe-ai/plexe, Apache 2.0) remains free for local development. For the paid product, our pricing is usage-based, with a minimum top up of $10. Enterprises can self-host the entire platform. You can sign up on https://console.plexe.ai. Use promo code `LAUNCHDAY20` to get $20 to try out the platform.

We’d love to hear your thoughts on the problem and feedback on the platform!

Comments

johnsillings•2h ago
very cool – I like how opinionated the product approach is vs. a bunch of disconnected tools for specialists to use (which seems more common for this space).
marcellodb•2h ago
Thanks, we're pretty opinionated on "this should make sense to non-ML practitioners" being a defining aspect of the product vision. Behind the scenes, we've had quite a few conversations specifically about how to avoid features feeling "disconnected", which is always challenging at an early stage when you're getting pulled in several directions by users with different use cases. Happy to hear it came across that way to you.
oxml•1h ago
Great product!
vaibhavdubey97•1h ago
Thank you! :)
tnt128•59m ago
In the demo, you didn’t show the process of cleaning and labeling data, does your product do that somehow, or do you still expect the user to provide that after connecting the data source.
vaibhavdubey97•28m ago
We have a data enricher feature (still in a beta mode) which uses LLMs to generate labels for your data. For cleaning and feature engineering, we use agents that automatically handle it for you once you've connected your data and defined your ML problem.

P.S. Thanks for the feedback on the video! We'll update it to show the cleaning and labelling process :)

marcellodb•27m ago
Great question, this is super important. The agents in the platform have the ability to do some degree of cleaning on your data when building a model (for example, imputing missing values). However, major improvements to data quality are generally not possible without an understanding of the data domain (i.e. business context), so you'll get better results if you "help" the platform by providing data in a reasonably clean state, answering the agent's follow-up questions in the chat, etc. By doing so you can give the agent better context and help it understand your data better, in which case it will also be more capable of dealing with things like missing values, misnamed columns etc.

This also highlights the important role of the user as a (potentially non-technical) domain expert. Hope that makes sense!

brightstar18•48m ago
Product seems cool. But can you help me understand if what you are doing is different from the following: > you put a prompt > Plexe glorifies that prompt into a bigger prompt with more specific instructions (augmented by schema definitions, intent and whatnot) > plug it into the provided model/LLM > .predict() gives me the output (which was heavily guardrailed by the glorified prompt in the step 2)
marcellodb•34m ago
Great question, and yes, it's quite different: Plexe generates code for a pipeline that processes your dataset (analysis, feature engineering, etc) and trains a custom ML model for your use case. When you call `.predict()`, it is that trained custom model that provides the response, not an LLM. The model is also hosted for you, and Plexe takes care of MLOps things like letting you retrain the model on new data, evaluating the model performance for you, etc. Using custom specialised models is generally more effective, faster and cheaper compared to running your predictions through an LLM when you have a lot of data specific to your business.
ryanmerket•19m ago
Really diggin this. Can't wait to try it out.
sinanuozdemir•16m ago
Sounds interesting! I'm trying to train a model but it's still "processing" after a bit but fine-tuning takes a while I get it. I'm having trouble understanding how it's inferring schema. I used a sample dataset and yet the sample inference curl uses a blank json?

curl -X POST "XXX/infer" \ -H "Content-Type: application/json" \ -H "x-api-key: YOUR_API_KEY" \ -d '{}'

How do I know what the inputs/outputs are for one of my models? I see I could have set the response variable manually before training but I was hoping the auto-infer would work.

Separately it'd be ideal if when I ask for models that you seem to not be able to train (I asked for an embedding model as a test) the platform would tell me it couldn't do that instead of making me choose a dataset that isn't anything to do with what I asked for.

All in all, super cool space, I can't wait to see more!

I'm a former YC founder turned investor living in Dogpatch. I'd love to chat more if you're down!

lcnlvrz•9m ago
How does it perform when build computer vision models?

Brass Eye – Complete Series (All 7 Episodes) [video]

https://www.youtube.com/watch?v=pQCrcAFuI-E
1•physarum_salad•58s ago•0 comments

Sam Altman apparently subpoenaed moments into SF talk with Steve Kerr

https://www.sfgate.com/tech/article/openai-sam-altman-subpeona-steve-kerr-sf-talk-21137132.php
1•Stratoscope•1m ago•0 comments

Weather Forecasts Are Punctuated

https://entropicthoughts.com/weather-forecasts-are-punctuated
1•crescit_eundo•3m ago•0 comments

Scalable synthesis of CO2-selective porous single-layer graphene membranes

https://www.nature.com/articles/s44286-025-00203-z
2•westurner•3m ago•1 comments

'Big Short' Michael Burry bets $1B on AI bubble bursting

https://www.lbc.co.uk/article/big-short-michael-burry-ai-bubble-5HjdGLY_2/
2•belter•5m ago•0 comments

An Empirical Study of Knowledge Transfer in AI Pair Programming [pdf]

https://www.se.cs.uni-saarland.de/publications/docs/WSD+.pdf
1•heisenbit•5m ago•0 comments

Juturna is a data pipeline library written in Python

https://meetecho.github.io/juturna/
1•mooreds•6m ago•0 comments

GEN-0: SoTA 10B+ Foundation Model for Robotics with Harmonic Reasoning

https://generalistai.com/blog/nov-04-2025-GEN-0
2•e0m•9m ago•0 comments

Leo: Auto-Typing Tool for Teaching Coding

https://github.com/gniziemazity/LEO
1•t0mk•9m ago•0 comments

Two tiny banks are helping Trump's sons build a crypto empire

https://www.ft.com/content/39a4a5c9-aa33-40b4-addb-076ee0242430
1•TheAlchemist•10m ago•0 comments

Show HN: A better ZSA keyboard layout explorer

https://www.keyderboard.com/
1•dhdaadhd•11m ago•0 comments

Think for Yourself

https://kevlinhenney.medium.com/think-for-yourself-7d129aa959e3
1•mooreds•13m ago•0 comments

Sequoia Capital Leader Exits in VC Shake-Up

https://www.wsj.com/finance/investing/sequoia-capital-leader-steps-down-from-vc-giant-e599103b
3•cgoodmac•14m ago•0 comments

Open Source Context-Aware PII Classifier

https://corp.roblox.com/newsroom/2025/11/open-sourcing-roblox-pii-classifier-ai-pii-detection-chat
1•moneil971•18m ago•1 comments

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

https://arxiv.org/abs/2510.01171
1•JnBrymn•19m ago•0 comments

Visualrambling.space

https://visualrambling.space/
1•crummy•19m ago•1 comments

Show HN: Agentic semantic search, but with GitHub APIs

https://github.com/nilenso/ask-github
1•sriharis•21m ago•0 comments

HTTP Message Signatures

https://www.rfc-editor.org/rfc/rfc9421
2•mooreds•21m ago•0 comments

Digital Stamp Collection – The Weight of Paper

https://marijanapav.com/stamps
1•shashanktomar•24m ago•0 comments

Video‐rate tunable colour electronic paper with human resolution

https://www.nature.com/articles/s41586-025-09642-3
2•westurner•24m ago•1 comments

Implementing Soft Deletion in Prisma with Client Extensions

https://matranga.dev/true-soft-deletion-in-prisma-orm/
1•frankmatranga•25m ago•1 comments

Cheaper MacBook powered by iPhone chip coming in 2026, per new report

https://9to5mac.com/2025/11/04/cheaper-macbook-powered-by-iphone-chip-coming-in-2026-per-new-report/
4•spurgu•25m ago•0 comments

Phobos (RA2: YR engine extension) v0.4 – Release Highlights and Project News

https://www.moddb.com/mods/phobos-yr/news/phobos-v04-release-highlights-and-project-news
1•Kerbiter•26m ago•0 comments

Benchmarking the AMD EPYC 9V64H: Azure HBv5's Custom AMD CPU with HBM3

https://www.phoronix.com/review/azure-hbv5-amd-epyc-9v64h
1•ashvardanian•28m ago•0 comments

NASA releases robotic / flight app generation tool Ogma under Apache license

https://github.com/nasa/ogma
1•ivanperez-keera•29m ago•1 comments

Why Tech Needs Personalization

https://om.co/2025/10/29/why-tech-needs-personalization/
1•walterbell•30m ago•0 comments

Skyshelve: A Python Dictionary in the Cloud

https://github.com/s1liconcow/skyshelve
1•siliconc0w•31m ago•1 comments

Why do we need dithering?

https://typefully.com/DanHollick/why-do-we-need-dithering-Ut7oD4k
2•ibobev•32m ago•0 comments

Show HN: JobsAndAI – Personalized career risk analysis for AI disruption

https://jobsandai.com
1•jobsandai•32m ago•0 comments

First Brands Found Some Fake Invoices

https://www.bloomberg.com/opinion/newsletters/2025-11-04/first-brands-found-some-fake-invoices
1•ioblomov•33m ago•1 comments