frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

An autonomous MCP Data Engine that handles the dirty work

https://vespermcp.lovable.app/
1•sutaniese•1h ago

Comments

sutaniese•1h ago
Hey everyone, We’ve all been there. You have a cool idea for a model or a RAG pipeline, but before you can do anything interesting, you’re stuck in "Data Hell" for three hours. You’re jumping between tabs to find a dataset, manually checking for missing values, realizing the schema is a mess, and praying there’s no PII (emails/phones) hidden in the CSV. It’s tedious, repetitive, and frankly, it’s the reason many projects die before the first training run. I decided to fix this by building Vesper. It’s a Model Context Protocol (MCP) server that turns your AI into a full-stack data engineer. Instead of you writing cleaning scripts, you just tell your AI what you need. Here is what Vesper actually does: Universal Search: Query thousands of datasets across HuggingFace, Kaggle, and even specialized sources like UCI, GitHub, World Bank, and NASA simultaneously. Deep Quality Analysis: Runs automated audits to detect outliers, duplicates, and schema anomalies (like numbers stored as strings). Multimodal Support: Beyond tabular data (CSV/Parquet), it handles images, audio, and video, including automated annotation and quality checks. Self-Healing Pipelines: Automatically generates a cleaning plan to impute missing values, remove outliers using IQR, and encode categorical data. JIT Ingestion & Performance: Instantly downloads data and uses Dask or Spark for distributed processing of massive datasets. Privacy & Compliance: Vesper never sees your data, everything is local. Async Job Management: Long-running tasks run in the background with live progress bars streamed directly to your chat interface. Developer Collaboration: Features self-versioning, personalized recommendations, and easy export to Jupyter Notebooks or Git. I’m opening a Waitlist today because I need feedback from people who actually deal with messy data every day. I want to know which "janitor" tasks you hate the most so I can refine the engine.

(sorry for using lovable. I used it to spin up a waitlist quickly for validation while I focus on the tech) I'll be hanging out in the comments to answer anything technical! Thanks!

Show HN: Ember-mug – I made a CLI for the Ember Coffee Mug

https://ember-mug.benjaminjsinger.com/
1•singerbj•31s ago•0 comments

Show HN: Open-source taxonomy of 122 AI/LLM attack vectors

1•manuelnd•1m ago•0 comments

Show HN: AI Config – Keep Claude / Codex / Gemini / OpenCode Configs in Sync

https://github.com/azat-io/ai-config
1•azat_io•1m ago•0 comments

Artemis II Wet Dress Rehearsal: Test Terminated at T-5:15

https://www.nasa.gov/blogs/missions/2026/02/03/artemis-ii-wet-dress-rehearsal-test-terminated-at-...
1•bookofjoe•2m ago•0 comments

The Jule Programming Language

https://jule.dev/
1•PaulHoule•3m ago•0 comments

Show HN: ChibiGenerator – Generate chibi-style characters from photos using AI

https://www.chibigenerator.com/
1•hoxihan•5m ago•0 comments

Signal-First Architectures: Rethinking Front-End Reactivity

https://arxiv.org/abs/2506.13815
1•buibuibui•6m ago•0 comments

Show HN: I built a client-side AI background remover (100% Free)

https://toolsaid.com/image-background-remover
1•raihaninfo•6m ago•0 comments

A collection of packages for developing web applications with Node.js

https://github.com/radically-straightforward/radically-straightforward
1•mfbx9da4•8m ago•0 comments

Building a Sync Engine from Scratch

https://hakanshehu.com/posts/building-the-colanode-sync-engine/
1•hakanshehu•8m ago•1 comments

Philosophy of Science Is Fascinating

https://jrhawley.ca/2026/02/03/philosophy-of-science-is-fascinating
1•jrhawley•8m ago•0 comments

Ask HN: How do you manage long running AI conversations?

1•boh144•8m ago•0 comments

Private Equity's Giant Software Bet Has Been Upended by AI

https://www.bloomberg.com/news/articles/2026-02-03/private-equity-s-giant-software-bet-has-been-u...
1•swexbe•8m ago•0 comments

I hacked Datastar to support Web Components

https://ajmoon.com/posts/joyus-i-hacked-datastar-to-support-web-components
1•alex-moon•8m ago•1 comments

Show HN: Difi – Git diff TUI with NVIM support built with Go and Bubbletea

https://github.com/oug-t/difi
1•oug-t•10m ago•1 comments

Are We in a Software Bubble?

https://bystam.github.io/takes/2026/02/02/are-we-in-a-software-bubble.html
1•byrre_b•11m ago•0 comments

The Disconnected Git Workflow

https://ploum.net/2026-01-31-offline-git-send-email.html
1•birdculture•11m ago•0 comments

China bans hidden car door handles after deadly incidents [video]

https://www.youtube.com/watch?v=bJ386noAZQ8
2•mgh2•11m ago•0 comments

The AI Productivity Paradox

https://www.platformer.news/ai-productivity-paradox-metr-pwc-workday/
1•speckx•12m ago•0 comments

CLI Is the New MCP

https://oneuptime.com/blog/post/2026-02-03-cli-is-the-new-mcp/view
1•ndhandala•13m ago•0 comments

Tell HN: OpenAI's Codex CLI is currently free to use

2•davidpolberger•14m ago•0 comments

Escape from the Monolith, Green CI and Red Production, the Zombie Project

https://failhub.substack.com/p/failhub-issue-4
1•khambir•14m ago•0 comments

MindGuard: Open-source safety classifiers for mental health AI

https://swordhealth.com/newsroom/introducing-mindguard
4•RicardoRei•15m ago•1 comments

Software Development: Sixty years of learning the same lesson

https://blog.robbowley.net/2026/01/30/sixty-years-of-learning-the-same-lesson/
1•dpflan•16m ago•0 comments

X marks the raid: French cops swoop on Musk's Paris ops

https://www.theregister.com/2026/02/03/french_police_raid_x/
3•coloneltcb•17m ago•0 comments

Website accepts claims for 1940s atomic weapons radiation exposure in New Mexico

https://www.koat.com/article/online-portal-launched-for-radiation-exposure-claims/70210559
1•bookofjoe•17m ago•1 comments

Ask HN: How much does ATS parsing penalize modern CV layouts?

1•ATSPASSKIT•19m ago•1 comments

Show HN: FunBox – A suite of interactive tools for board games and parties

https://funbox.space
1•zealer•22m ago•1 comments

Displaying Letterboxd Like Counts on My Movie Reviews

https://www.joshbeckman.org/blog/practicing/displaying-letterboxd-like-counts-on-my-movie-reviews
1•blenderob•22m ago•0 comments

3D map of the sun's magnetic interior could improve predictions of solar flares

https://phys.org/news/2026-01-3d-sun-magnetic-interior-disruptive.html
1•wglb•25m ago•1 comments