frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•1y ago

Comments

yxhuvud•1y ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•1y ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•1y ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•1y ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•1y ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•1y ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

Anthropic's open-source framework for AI-powered vulnerability discovery

https://github.com/anthropics/defending-code-reference-harness
56•binyu•50m ago•18 comments

VoidZero Is Joining Cloudflare

https://blog.cloudflare.com/voidzero-joins-cloudflare/
501•coloneltcb•8h ago•232 comments

When AI Builds Itself: Our progress toward recursive self-improvement

https://www.anthropic.com/institute/recursive-self-improvement
148•meetpateltech•4h ago•178 comments

Castor: CERN Advanced STORage Manager

https://castor.web.cern.ch/content/home.html
22•naves•1h ago•1 comments

Retro-Tech Parenting

https://havenweb.org/2026/05/28/retro-tech.html
182•mawise•4h ago•119 comments

Meta's ships facial recognition on smart glasses

https://www.buchodi.com/meta-glasses-facial-recognition/
115•buchodi•1h ago•90 comments

Ian's Secure Shoelace Knot

https://www.fieggen.com/shoelace/secureknot.htm
414•mooreds•9h ago•160 comments

KVarN: Native vLLM backend for KV-cache quantization by Huawei

https://github.com/huawei-csl/KVarN
93•theanonymousone•5h ago•8 comments

JLink JTAG Access on the Pinecil

https://danielmangum.com/posts/jlink-jtag-pinecil/
21•hasheddan•2d ago•0 comments

Show HN: FFmpeg WebCLI – Full FFmpeg in Browser, Offline PWA, No Uploads(WASM)

https://github.com/tejaswigowda/ffmpeg-webCLI
8•tejaswigowda•29m ago•2 comments

They’re made out of weights

https://maxleiter.com/blog/weights
1336•MaxLeiter•21h ago•585 comments

Samurai City

https://worksinprogress.co/issue/samurai-city/
43•zdw•2d ago•3 comments

Failing grades soar with AI usage, dwindling math skills in Berkeley CS classes

https://www.dailycal.org/news/campus/academics/failing-grades-soar-as-professors-see-greater-ai-u...
689•littlexsparkee•20h ago•639 comments

Zettascale (YC S24) Is Hiring Founding FPGA Engineers

https://www.ycombinator.com/companies/zettascale/jobs/O9S1vqO-founding-engineer-fpga-rtl-asic-arc...
1•el_al•4h ago

Making Debian or Fedora persistent live images

https://sigwait.org/~alex/blog/2026/05/28/smdBC8.html
34•henry_flower•3d ago•3 comments

Sum-product, unit distances, and number fields

https://www.erdosproblems.com/forum/thread/blog:6
44•robinhouston•3d ago•5 comments

U.S. Army Corps of Engineers Bay Model

https://en.wikipedia.org/wiki/U.S._Army_Corps_of_Engineers_Bay_Model
189•tosh•2d ago•48 comments

Show HN: Uruky (EU-based Kagi alternative) now has Image Search and URL Rewrites

https://uruky.com/?il=en
188•BrunoBernardino•12h ago•177 comments

Gaussian Point Splatting

https://momentsingraphics.de/Siggraph2026.html
162•ibobev•10h ago•59 comments

Show HN: Cost.dev (YC W21) – making agents cost-aware and cheaper to call

https://cost.dev/
15•akh•9h ago•4 comments

3D-printed book turns its own G-code into raised lettering

https://www.designboom.com/design/3d-printed-book-manual-darius-ou-benson-chong/
62•surprisetalk•2d ago•24 comments

Elixir v1.20: Now a gradually typed language

https://elixir-lang.org/blog/2026/06/03/elixir-v1-20-0-released/
940•cloud8421•1d ago•377 comments

The desperation of NYTimes

https://rozumem.xyz/posts/16
253•rozumem•3h ago•244 comments

AI, Ashby Engineering, and the future

https://www.ashbyhq.com/blog/engineering/ai-ashby-engineering-and-the-future
20•fredley•6h ago•12 comments

Gemma 4 12B: A unified, encoder-free multimodal model

https://blog.google/innovation-and-ai/technology/developers-tools/introducing-gemma-4-12b/
1003•rvz•1d ago•375 comments

Mornings and nights no longer exist at 47C: A day in the hottest place in India

https://www.bbc.co.uk/news/articles/crmp0krp98ro
17•mellosouls•2d ago•1 comments

Artificial intelligence is not conscious – Ted Chiang

https://www.theatlantic.com/philosophy/2026/06/no-artificial-intelligence-is-not-conscious/687378/
703•lordleft•1d ago•1221 comments

Wind and solar generated more power than gas globally in April 2026

https://electrek.co/2026/05/20/in-a-first-wind-solar-generated-more-power-than-gas-globally-april...
335•speckx•6h ago•298 comments

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it

https://kasra.blog/blog/i-spent-1500-seeing-if-llms-could-hack-my-app/
363•jc4p•20h ago•197 comments

French-Iranian author Marjane Satrapi, author of 'Persepolis', dies at 56

https://www.france24.com/en/culture/20260604-french-iranian-author-marjane-satrapi-author-of-pers...
379•fidotron•9h ago•112 comments