frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Now send your marketing campaigns directly from ChatGPT

https://www.mail-o-mail.com/
1•avallark•3m ago•1 comments

Queueing Theory v2: DORA metrics, queue-of-queues, chi-alpha-beta-sigma notation

https://github.com/joelparkerhenderson/queueing-theory
1•jph•15m ago•0 comments

Show HN: Hibana – choreography-first protocol safety for Rust

https://hibanaworks.dev/
3•o8vm•17m ago•0 comments

Haniri: A live autonomous world where AI agents survive or collapse

https://www.haniri.com
1•donangrey•17m ago•1 comments

GPT-5.3-Codex System Card [pdf]

https://cdn.openai.com/pdf/23eca107-a9b1-4d2c-b156-7deb4fbc697c/GPT-5-3-Codex-System-Card-02.pdf
1•tosh•30m ago•0 comments

Atlas: Manage your database schema as code

https://github.com/ariga/atlas
1•quectophoton•33m ago•0 comments

Geist Pixel

https://vercel.com/blog/introducing-geist-pixel
2•helloplanets•36m ago•0 comments

Show HN: MCP to get latest dependency package and tool versions

https://github.com/MShekow/package-version-check-mcp
1•mshekow•44m ago•0 comments

The better you get at something, the harder it becomes to do

https://seekingtrust.substack.com/p/improving-at-writing-made-me-almost
2•FinnLobsien•45m ago•0 comments

Show HN: WP Float – Archive WordPress blogs to free static hosting

https://wpfloat.netlify.app/
1•zizoulegrande•47m ago•0 comments

Show HN: I Hacked My Family's Meal Planning with an App

https://mealjar.app
1•melvinzammit•47m ago•0 comments

Sony BMG copy protection rootkit scandal

https://en.wikipedia.org/wiki/Sony_BMG_copy_protection_rootkit_scandal
1•basilikum•50m ago•0 comments

The Future of Systems

https://novlabs.ai/mission/
2•tekbog•50m ago•1 comments

NASA now allowing astronauts to bring their smartphones on space missions

https://twitter.com/NASAAdmin/status/2019259382962307393
2•gbugniot•55m ago•0 comments

Claude Code Is the Inflection Point

https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point
3•throwaw12•56m ago•1 comments

Show HN: MicroClaw – Agentic AI Assistant for Telegram, Built in Rust

https://github.com/microclaw/microclaw
1•everettjf•56m ago•2 comments

Show HN: Omni-BLAS – 4x faster matrix multiplication via Monte Carlo sampling

https://github.com/AleatorAI/OMNI-BLAS
1•LowSpecEng•57m ago•1 comments

The AI-Ready Software Developer: Conclusion – Same Game, Different Dice

https://codemanship.wordpress.com/2026/01/05/the-ai-ready-software-developer-conclusion-same-game...
1•lifeisstillgood•59m ago•0 comments

AI Agent Automates Google Stock Analysis from Financial Reports

https://pardusai.org/view/54c6646b9e273bbe103b76256a91a7f30da624062a8a6eeb16febfe403efd078
1•JasonHEIN•1h ago•0 comments

Voxtral Realtime 4B Pure C Implementation

https://github.com/antirez/voxtral.c
2•andreabat•1h ago•1 comments

I Was Trapped in Chinese Mafia Crypto Slavery [video]

https://www.youtube.com/watch?v=zOcNaWmmn0A
2•mgh2•1h ago•1 comments

U.S. CBP Reported Employee Arrests (FY2020 – FYTD)

https://www.cbp.gov/newsroom/stats/reported-employee-arrests
1•ludicrousdispla•1h ago•0 comments

Show HN: I built a free UCP checker – see if AI agents can find your store

https://ucphub.ai/ucp-store-check/
2•vladeta•1h ago•1 comments

Show HN: SVGV – A Real-Time Vector Video Format for Budget Hardware

https://github.com/thealidev/VectorVision-SVGV
1•thealidev•1h ago•0 comments

Study of 150 developers shows AI generated code no harder to maintain long term

https://www.youtube.com/watch?v=b9EbCb5A408
2•lifeisstillgood•1h ago•0 comments

Spotify now requires premium accounts for developer mode API access

https://www.neowin.net/news/spotify-now-requires-premium-accounts-for-developer-mode-api-access/
2•bundie•1h ago•0 comments

When Albert Einstein Moved to Princeton

https://twitter.com/Math_files/status/2020017485815456224
1•keepamovin•1h ago•0 comments

Agents.md as a Dark Signal

https://joshmock.com/post/2026-agents-md-as-a-dark-signal/
2•birdculture•1h ago•1 comments

System time, clocks, and their syncing in macOS

https://eclecticlight.co/2025/05/21/system-time-clocks-and-their-syncing-in-macos/
1•fanf2•1h ago•0 comments

McCLIM and 7GUIs – Part 1: The Counter

https://turtleware.eu/posts/McCLIM-and-7GUIs---Part-1-The-Counter.html
3•ramenbytes•1h ago•0 comments
Open in hackernews

Show HN: CrawlerCheck – A tool to check if a site is blocking crawlers

https://crawlercheck.com/
2•bogozi•7mo ago
Hi HN,

I'm a long-time developer and SEO consultant. Over the years, I've seen clients suffer from a simple, costly mistake: accidentally blocking Googlebot or other important crawlers with a misplaced rule in robots.txt or a noindex tag.

Manually checking the robots.txt file, then the page's meta tags, then the X-Robots-Tag HTTP header is a tedious process. I wanted a tool that would do it all in one shot and give me a clear answer.

So, I built CrawlerCheck. You give it a URL, and it checks all three sources of crawler directives to tell you if a page is accessible.

The backend is written in Go, and the frontend is a lightweight Svelte app. The goal was to make it as fast and reliable as possible.

It's a brand new project, and I'd love to get some honest feedback from the HN community. Thanks for taking a look.

Comments

8organicbits•7mo ago
Looks pretty helpful, thanks for building this.

Minor suggestion. Consider sorting the checks by status, or adding a summary at the top. I needed to scroll to find if anything was blocked.

I don't know enough about the SEO space, but would a llms.txt check also help?

bogozi•7mo ago
Thanks so much for the great feedback!

That's a fantastic point about adding a summary. I agree, it would make the results much easier to scan, especially because I'm planning to add even more rules. I'll definitely work on adding that.

Good point on llms.txt too. I'm watching that standard evolve and plan to add it once it's more established. Appreciate you bringing it up!

bogozi•6mo ago
Hi. I have implemented the summary and indeed it looks and feels a lot quicker this way. I have also added scrolling functionalities so it's a bit easier to navigate as well. "but would a llms.txt check also help" - not sure about it as it is not standardized yet. Big players still need to adapt it or come up with a better solution. Will wait and see until then.

I hope you'll find the updates useful. Thanks for the great feedback.

bogozi•7mo ago
Just wanted to post a quick follow-up on this project for anyone who might see it.

After the initial launch, I spent some time hardening the core logic and polishing the UI. I've just pushed a significant update with several improvements that make the tool much more reliable.

Here’s what’s new:

Smarter robots.txt parsing: It now correctly handles group precedence (so the most specific rule wins), wildcards (*), end-anchors ($), and rules with query strings (?) to better match how Googlebot actually interprets these files.

Better error handling: I've polished the error messages on both the frontend and backend, so it's much clearer what's happening if a URL can't be fetched or a robots.txt file is malformed.

Mobile & accessibility fixes: Pushed a few improvements for the experience on smaller screens and for better overall accessibility.

Just wanted to log the progress here as I continue to work on the tool.

Cheers.

bogozi•7mo ago
In the meantime I have implemented a publicly available changelog

Here is what's new in v1.1.1

- Backend HTTP client now mimics real browsers more closely for improved compatibility with strict sites.

- Automatic cookie handling and support for HTTP/2 enabled.

- Referer header now transparently set to crawlercheck.com for all requests.

- Improved diagnostics: HTTP status code and response body are logged for non-2xx responses to help troubleshoot blocks.

- All previous features and compatibility with simpler sites are preserved.