frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Doctor – tool to crawl and index websites and MCP server for LLM agents

https://github.com/sisig-ai/doctor
1•kixpanganiban•3h ago

Comments

kixpanganiban•3h ago
Hi! I wrote Doctor because I keep struggling with grounding on docs when working with agentic code editing (ex Roo, Claude Code).

Doctor uses crawl4ai to crawl websites, and then chunks and embeds them with langchain + litellm + openai, and finally stores all the vectors in duckdb. This allows your LLM to query the docs using semantic search over MCP, giving it grounded and up-to-date information for the things you're working on.

It requires an OpenAI key for the embedding process, but I'm working on giving users options in the future (different providers, local embedding using something like DPR or other transformer libs, etc.)

downrightmike•3h ago
And now we have a new bot for cloudflare to send to the infinite AI crawler labyrinth

Hardware testing automation: a status update

https://postmarketos.org/blog/2025/05/13/hw-ci-status/
1•yorwba•7m ago•0 comments

The Problem with Washout Periods

https://www.exfatloss.com/p/the-problem-with-washout-periods
1•paulpauper•7m ago•0 comments

India-Pakistan Risks Loom

https://www.bloomberg.com/news/features/2025-05-11/trump-negotiated-india-pakistan-ceasefire-adds-new-risks-to-kashmir-conflict
1•colonCapitalDee•9m ago•1 comments

U.S. EPA takes aim at start-stop systems in cars

https://www.wral.com/news/local/epa-targets-start-stop-systems-may-2025/
1•walterbell•10m ago•0 comments

A Live Look at the Senate AI Hearing

https://thezvi.substack.com/p/a-live-look-at-the-senate-ai-hearing
1•paulpauper•10m ago•0 comments

Show HN: Ad Sniper. a simple distraction remover for Firefox

https://github.com/cab11150904/AdSniper
1•WarcrimeActual•19m ago•0 comments

The Case for the Death Penalty

https://unherd.com/2025/05/the-case-for-the-death-penalty/
1•Tomte•24m ago•0 comments

Tesla Model Y Indoor Cabin Radar Teardown [video]

https://www.youtube.com/watch?v=QSMJeUvjAcs
1•sudonanohome•24m ago•0 comments

Revolutionizing SaaS for Legal, Finance and Compliance – Meet Agami Technologies

https://agamitechnologies.com/
1•qareena•25m ago•1 comments

Bye, bye Solaris, it was a nice ride while it lasted (2017)

https://itwire.com/opinion-and-analysis-sp-481/open-sauce/79738-bye,-bye-solaris,-it-was-a-nice-ride-while-it-lasted.html
1•TMWNN•28m ago•0 comments

Google forced publishers to accept AI scraping as price of appearing in search

https://pressgazette.co.uk/platforms/how-google-forced-publishers-to-accept-ai-scraping-as-price-of-appearing-in-search/
2•thm•28m ago•0 comments

There Is a Monster in the Forest

https://bsky.app/profile/joles.bsky.social/post/3logjuqggkk2q
1•kentbrew•30m ago•0 comments

Acoustic modulation of mechanosensitive genes and adipocyte differentiation

https://www.nature.com/articles/s42003-025-07969-1
1•walterbell•31m ago•0 comments

Open-source ML agent turns natural language into trained models

https://old.reddit.com/r/artificial/comments/1kkag85/we_built_an_opensource_ml_agent_that_turns/
1•felineflock•34m ago•0 comments

Google Facing at Least €12B in Civil Claims Across Europe

https://www.bloomberg.com/news/articles/2025-05-13/google-facing-at-least-12-billion-in-civil-claims-across-europe
2•mfiguiere•35m ago•0 comments

Bioelectrical synchronization of Picea abies during a solar eclipse

https://royalsocietypublishing.org/doi/10.1098/rsos.241786
1•doodlebugging•37m ago•1 comments

An LLM That Remembers over 300 Conversation Turns: HEMA Research Paper

https://www.haebom.dev/archive?tl=en&post=7916x82r8k8j524kpyg3
1•haebom•37m ago•0 comments

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•42m ago•0 comments

Lovart AI – Revolutionary AI Design Agent

https://www.lovart.ai
1•xingwu82•44m ago•1 comments

How I Enhanced Loki to Support Petabyte-Scale Log Queries

https://honganan.github.io/2025/03/07/How-I-Enhanced-Loki-to-Support-Petabyte-Scale-Log-Queries/
2•PrayagS•48m ago•0 comments

PSTodoWarrior – PS CLI for Todo.txt Format

https://github.com/pauby/PSTodoWarrior
2•keepamovin•51m ago•0 comments

Quik – beautiful SMS app for Android revived

https://github.com/octoshrimpy/quik
1•keepamovin•52m ago•1 comments

List of Blog Platforms

https://manuelmoreale.com/blog-platforms
1•surprisetalk•56m ago•0 comments

I built a beautiful and powerful admin dashboard template – and it's open source

https://github.com/Daymychen/art-design-pro
1•TT9601•57m ago•0 comments

Eucalyptus for Brazil's steelmaking dries out communities in Minas Gerais

https://news.mongabay.com/2025/04/eucalyptus-for-brazils-steelmaking-dries-out-communities-in-minas-gerais/
2•PaulHoule•1h ago•0 comments

Show HN: Triplex, a visual workspace for React / Three Fiber

https://github.com/trytriplex/triplex
2•madou•1h ago•0 comments

TTY Trainable Game Engine

https://github.com/ivanbelenky/tty_games
2•ivanbelenky•1h ago•0 comments

One Alien's Trash Is Another Alien's Treasure

https://thebsdetector.substack.com/p/one-aliens-trash-is-another-aliens
2•surprisetalk•1h ago•0 comments

Module Catalog: Reusable building blocks for your software factory

https://dagger.io/blog/module-catalog-insights
1•gk1•1h ago•0 comments

TransMLA: Multi-Head Latent Attention Is All You Need

https://arxiv.org/abs/2502.07864
2•ocean_moist•1h ago•0 comments