frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Facebook's Fascination with My Robots.txt

https://blog.nytsoi.net/2026/02/23/facebook-robots-txt
28•Ndymium•1h ago

Comments

Ndymium•1h ago
For some reason, Facebook has been requesting my Forgejo instance's robots.txt in a loop for the past few days, currently at a speed of 7700 requests per hour. The resource usage is negligible, but I'm wondering why it's happening in the first place and how many other robot files they're also requesting repeatedly. Perhaps someone at Meta broke a loop condition.
matja•40m ago
Did you try adding a Cache-Control response header?
mrweasel•17m ago
Even if they haven't added any cache control headers, what kind a of lazy Meta engineer designed their crawler with to just pull the same URL multiple times a second?

Is this where all that hardware for AI projects is going? To data centers that just uncritically hits the same URL over and over without checking if the content of a site or page has chanced since the last visit then and calculate a proper retry interval. Search engine crawlers 25 - 30 years ago could do this.

Hit the URL once per day, if it chances daily, try twice a day. If it hasn't chanced in a week, maybe only retry twice per week.

bot403•6m ago
It's not the "same" crawler. Probably each thread or each cluster machine instance of the crawler hitting it independently.
xg15•31m ago
Facebook just decided that instead of loading the robots.txt for every host they intend to crawl, they'll just ignore all the other robots.txt files and then access this one a million times to restore the average.
Nextgrid•18m ago
> Perhaps someone at their end screwed up a loop conditional, but you'd think some monitoring dashboard somewhere would have a warning pop up because of this.

If you've been in any big company you'll know things perpetually run in a degraded, somewhat broken mode. They've even made up the term "error budget" because they can't be bothered to fix the broken shit so now there's an acceptable level of brokenness.

tananaev•13m ago
Maybe they’re trying to DDoS it, and once an error is returned, they assume that no robots.txt file exists and then crawl everything else on the site?
evv•6m ago
Have you considered serving a zip bomb to this user agent?

The Case for Workplace Inefficiency

https://www.economist.com/business/2026/02/19/the-case-for-workplace-inefficiency
1•andsoitis•2m ago•0 comments

Show HN: Attest – Test AI agents with 8-layer graduated assertions

https://attest-framework.github.io/attest-website/
1•tommathews•2m ago•0 comments

Ohm v18

https://ohmjs.org/blog/ohm-v18
1•azhenley•4m ago•0 comments

Synthetic AI faces more average than real faces and super‐recognizers know it

https://bpspsychub.onlinelibrary.wiley.com/doi/10.1111/bjop.70063
1•Tomte•4m ago•0 comments

How in the Hell Did Joann Fabrics Die While Best Buy Survived? It Wasn't Amazon

https://www.governance.fyi/p/how-in-the-hell-did-joann-fabrics
1•crescit_eundo•4m ago•0 comments

Streaming Services in 2026

https://www.engadget.com/entertainment/streaming/best-streaming-services-154527042.html
1•andsoitis•5m ago•0 comments

Moving Beyond the IDE with Intent

https://www.augmentedswe.com/p/augment-intent-ide
1•wordsaboutcode•7m ago•0 comments

Show HN: Fundraising events across the developer tools ecosystem

https://ci.vc
1•tiagom87•9m ago•0 comments

Chip Fabs in Space: Technically Possible Impractical [video]

https://www.youtube.com/watch?v=_2rkRDg0d60
2•Klaster_1•9m ago•0 comments

Crab Mentality

https://en.wikipedia.org/wiki/Crab_mentality
1•futurecat•10m ago•0 comments

Show HN: AgentWard – After an AI agent deleted files, I built a runtime enforcer

https://github.com/agentward-ai/agentward
1•ratnaditya•11m ago•0 comments

Design engineering is a fake title until you hit these 5 problems

https://www.ruixen.com/blog/design-engineering-is-a-fake-title
1•ruhith•11m ago•0 comments

Show HN: Rmux – Terminal multiplexer for LLM, built with Rust and egui

https://github.com/skorotkiewicz/rmux-rs
1•modinfo•11m ago•0 comments

Better to Skip a Year for Hardware Upgrades?

https://boilingsteam.com/poll-better-to-skip-a-year-for-pc-upgrades/
1•ekianjo•11m ago•0 comments

StreamLens – Visual Kafka Lineage and topology viewer with AI navigation

https://github.com/muralibasani/streamlens
1•muralibasani•11m ago•0 comments

Show HN: Tron-style multiplayer light-cycle game for LLMs via MCP

https://github.com/skorotkiewicz/tronmcp
1•modinfo•13m ago•0 comments

Is NIST's Cryptography Backdoored?

https://kerkour.com/nist-cryptography-backdoor
2•randomint64•13m ago•1 comments

Show HN: Local knowledge vault plugin for Claude Code

https://github.com/aneequrrehman/agent-cortex
1•aneequrrehman•14m ago•0 comments

TDD as Induction

https://blog.ploeh.dk/2026/02/23/tdd-as-induction/
1•vinhnx•15m ago•0 comments

Show HN: Slipshow, a multi-paradigm presentation tool

https://slipshow.org
1•panglesd•16m ago•0 comments

A reliable pick–– Straightforward HTML5 games

https://play2.uaaws.com/
1•TrendSpotterPro•16m ago•0 comments

Piecing Together an Ancient Epic Was Slow Work. Until A.I. Got Involved (2024)

https://www.nytimes.com/2024/08/12/books/booksupdate/ai-ancient-tablets-gilgamesh.html
1•fidotron•18m ago•0 comments

VR exam checks eye health and screens for early signs of Alzheimer's

https://health.ucdavis.edu/news/headlines/virtual-reality-exam-checks-eye-health-and-screens-for-...
1•geox•18m ago•0 comments

fitgpu: cli tool to know if a model will run on your GPU without downloading it.

https://pypi.org/project/fitgpu/
1•prashantpandeyy•19m ago•0 comments

Pockets of Humanity

https://herman.bearblog.dev/pockets-of-humanity/
1•HermanMartinus•19m ago•1 comments

Using AI to Transform Streets

https://transform-streets.vercel.app
1•rkayg•20m ago•1 comments

Foolery v0.3.0 Released

https://github.com/acartine/foolery/releases/tag/v0.3.0
1•therealcartine•21m ago•0 comments

Inkscape project struggling with lack of (active) contributors [video]

https://friprogramvarusyndikatet.tv/w/ofcCwyxiE2VSaJPBFNnZLb
2•black_puppydog•22m ago•3 comments

Termux Commands – Quick Reference – Phone Hacks

1•rocky101•22m ago•0 comments

How the LA Review of Books destroyed itself

https://nobaddaysinla.substack.com/p/how-the-la-review-of-books-destroyed
1•speckx•22m ago•0 comments