frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast regex search: indexing text for agent tools

https://cursor.com/blog/fast-regex-search
16•jxmorris12•2d ago

Comments

mpalmer•1h ago
> No matter how fast ripgrep can match on the contents of a file, it has one serious limitation: it needs to match on the contents of all files.

The omission of rg's `-g` parameter is unsurprising in one sense, because it would mostly obviate this entire exercise. How often do you need to search what sounds like hundreds of millions of lines of source for a complex pattern, with zero constraints on paths searched?

> We routinely see rg invocations that take more than 15 seconds

I'm trying to understand the monorepo that is so large that ripgrep takes 15 seconds to return results, when it's benchmarked as searching for a literal in a 9.3GB file in 600ms, or 1.08s to search for `.*` in the entire Linux repo.

And again, that's without using `-g`.

piker•1h ago
> -g GLOB, --glob=GLOB

> Include or exclude files and directories for searching that match the given glob. This always overrides any other ignore logic. Multiple glob flags may be used. Globbing rules match .gitignore globs. Precede a glob with a ! to exclude it. If multiple globs match a file or directory, the glob given later in the command line takes precedence. As an extension, globs support specifying alternatives: -g 'ab{c,d}*' is equivalent to -g abc -g abd. Empty alternatives like -g 'ab{,c}' are not currently supported. Note that this syntax extension is also currently enabled in gitignore files, even though this syntax isn't supported by git itself. ripgrep may disable this syntax extension in gitignore files, but it will always remain available via the -g/--glob flag.

> When this flag is set, every file and directory is applied to it to test for a match. For example, if you only want to search in a particular directory foo, then -g foo is incorrect because foo/bar does not match the glob foo. Instead, you should use -g 'foo/*'.

https://man.archlinux.org/man/rg.1.en*

(for those who were unfamiliar with the switch like me)

mpalmer•57m ago
Has the Cursor team considered (for instance) that on MDM-managed machines, binaries which do pervasive arbitrary FS reads may be monitored, throttled, or otherwise controlled by overseer programs? That kitchen-sink Electron apps like Cursor using those binaries might compound the red-flag signals?
open-paren•39m ago
The creator of fff.nvim[0], Dmitriy Kovalenko, had an interesting analysis of this on Xitter[1]. The TL;DR of this is that Anysphere/Cursor is being somewhat disingenuous and does not include the index-creation and recreation time in the comparison nor do they include the CPU or memory overhead, where rg (and his tool, fff.nvim) are indexless.

---

0: https://github.com/dmtrKovalenko/fff.nvim

1: http://x.com/i/article/2036558670528651264

Why so many control rooms were seafoam green (2025)

https://bethmathews.substack.com/p/why-so-many-control-rooms-were-seafoam
444•Amorymeltzer•1d ago•76 comments

Deploytarot.com – tarot card reading for deployments

https://deploytarot.com/setup
56•rembish•1h ago•16 comments

DOOM Over DNS

https://github.com/resumex/doom-over-dns
163•Venn1•3d ago•47 comments

Cloudflare's Gen 13 servers: trading cache for cores for 2x performance

https://blog.cloudflare.com/gen13-launch/
44•wmf•3d ago•6 comments

CERN to host a new phase of Open Research Europe

https://home.cern/news/news/cern/cern-host-europes-flagship-open-access-publishing-platform
152•JohnHammersley•3h ago•14 comments

New York City hospitals drop Palantir as controversial AI firm expands in UK

https://www.theguardian.com/technology/2026/mar/26/new-york-hospitals-palantir-ai
137•chrisjj•2h ago•47 comments

My minute-by-minute response to the LiteLLM malware attack

https://futuresearch.ai/blog/litellm-attack-transcript/
251•Fibonar•6h ago•114 comments

Show HN: Turbolite – a SQLite VFS serving sub-250ms cold JOIN queries from S3

https://github.com/russellromney/turbolite
84•russellthehippo•3h ago•20 comments

John Bradley, author of xv, has died

https://voxday.net/2026/03/25/rip-john-bradley/
163•linsomniac•3h ago•49 comments

Moving from GitHub to Codeberg, for lazy people

https://unterwaditzer.net/2025/codeberg.html
472•jslakro•8h ago•242 comments

OpenTelemetry profiles enters public alpha

https://opentelemetry.io/blog/2026/profiles-alpha/
129•tanelpoder•6h ago•14 comments

How much precision can you squeeze out of a table?

https://www.johndcook.com/blog/2026/03/26/table-precision/
28•nomemory•2h ago•2 comments

We haven't seen the worst of what gambling and prediction markets will do

https://www.derekthompson.org/p/we-havent-seen-the-worst-of-what
422•mmcclure•2h ago•298 comments

HyperAgents: Self-referential self-improving agents

https://github.com/facebookresearch/hyperagents
98•andyg_blog•2d ago•43 comments

Colibri – chat platform built on the AT Protocol for communities big and small

https://colibri.social/
93•todotask2•5h ago•47 comments

Personal Encyclopedias

https://whoami.wiki/blog/personal-encyclopedias
781•jrmyphlmn•1d ago•163 comments

Using FireWire on a Raspberry Pi

https://www.jeffgeerling.com/blog/2026/firewire-on-a-raspberry-pi/
27•jandeboevrie•2h ago•7 comments

Anthropic Subprocessor Changes

https://trust.anthropic.com
8•tencentshill•58m ago•5 comments

The Little Book of C

https://little-book-of.github.io/c/books/en-US/book.html
47•ghostrss•1h ago•9 comments

Stripe Projects: Provision and manage services from the CLI

https://projects.dev/
89•piinbinary•6h ago•25 comments

Non-Messing-Up++: Diagonal Sorting and Young Tableaux

https://winwang.blog/posts/non-messing-up++
5•winwang•3d ago•1 comments

Fast regex search: indexing text for agent tools

https://cursor.com/blog/fast-regex-search
16•jxmorris12•2d ago•4 comments

From zero to a RAG system: successes and failures

https://en.andros.dev/blog/aa31d744/from-zero-to-a-rag-system-successes-and-failures/
265•andros•2d ago•79 comments

Running Tesla Model 3's computer on my desk using parts from crashed cars

https://bugs.xdavidhu.me/tesla/2026/03/23/running-tesla-model-3s-computer-on-my-desk-using-parts-...
839•driesdep•1d ago•295 comments

Taming LLMs: Using Executable Oracles to Prevent Bad Code

https://john.regehr.org/writing/zero_dof_programming.html
26•mad44•4h ago•14 comments

Building a Blog with Elixir and Phoenix

https://jola.dev/posts/building-a-blog-with-elixir-and-phoenix
66•shintoist•5h ago•4 comments

My home network observes bedtime with OpenBSD and pf

https://ratfactor.com/openbsd/pf-gateway-bedtime
102•ibobev•3d ago•29 comments

Olympic Committee bars transgender athletes from women’s events

https://www.nytimes.com/2026/03/26/world/olympics/ioc-transgender-athletes-ban.html
200•RestlessMind•8h ago•456 comments

Show HN: Fio: 3D World editor/game engine – inspired by Radiant and Hammer

https://github.com/ViciousSquid/Fio
8•vicioussquid•1h ago•0 comments

End of "Chat Control": EU parliament stops mass surveillance

https://www.patrick-breyer.de/en/end-of-chat-control-eu-parliament-stops-mass-surveillance-in-vot...
535•amarcheschi•10h ago•259 comments