frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•8mo ago

Comments

yxhuvud•8mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•8mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•8mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•8mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•8mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•8mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

A 40-line fix eliminated a 400x performance gap

https://questdb.com/blog/jvm-current-thread-user-time/
149•bluestreak•4h ago•29 comments

Every GitHub object has two IDs

https://www.greptile.com/blog/github-ids
116•dakshgupta•11h ago•35 comments

Why IRC is better than Real Life

https://everything2.com/node/e2node/Why%20IRC%20is%20better%20than%20Real%20Life
26•jskherman•1h ago•14 comments

vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep

https://blog.vllm.ai/2025/12/17/large-scale-serving.html
48•robertnishihara•11h ago•2 comments

ASCII Clouds

https://caidan.dev/portfolio/ascii_clouds/
18•majkinetor•51m ago•5 comments

The truth behind the 2026 J.P. Morgan Healthcare Conference

https://www.owlposting.com/p/the-truth-behind-the-2026-jp-morgan
73•abhishaike•8h ago•22 comments

The $LANG Programming Language

86•dang•2h ago•14 comments

Are two heads better than one?

https://eieio.games/blog/two-heads-arent-better-than-one/
128•evakhoury•10h ago•34 comments

The Emacs Widget Library: A Critique and Case Study

https://www.d12frosted.io/posts/2025-11-26-emacs-widget-library
14•whacked_new•1d ago•0 comments

Sei (YC W22) Is Hiring a DevOps Engineer (India/In-Office/Chennai/Gurgaon)

https://www.ycombinator.com/companies/sei/jobs/Rn0KPXR-devops-platform-ai-infrastructure-engineer
1•ramkumarvenkat•2h ago

No management needed: anti-patterns in early-stage engineering teams

https://www.ablg.io/blog/no-management-needed
96•tonioab•8h ago•132 comments

Japan's Skyscraper Factories (2021)

https://www.construction-physics.com/p/japans-skyscraper-factories
54•Pikamander2•6d ago•1 comments

The Tulip Creative Computer

https://github.com/shorepine/tulipcc
192•apitman•10h ago•41 comments

AI Generated Music Barred from Bandcamp

https://old.reddit.com/r/BandCamp/comments/1qbw8ba/ai_generated_music_on_bandcamp/
614•cdrnsf•8h ago•455 comments

When hardware goes end-of-life, companies need to open-source the software

https://www.marcia.no/words/eol
193•Marciplan•4h ago•45 comments

We can't have nice things because of AI scrapers

https://blog.metabrainz.org/2025/12/11/we-cant-have-nice-things-because-of-ai-scrapers/
293•LorenDB•5h ago•164 comments

How to make a damn website (2024)

https://lmnt.me/blog/how-to-make-a-damn-website.html
151•birdculture•9h ago•49 comments

Exa-d: How to store the web in S3

https://exa.ai/blog/exa-d
3•willbryk•1h ago•0 comments

Show HN: Nogic – VS Code extension that visualizes your codebase as a graph

https://marketplace.visualstudio.com/items?itemName=Nogic.nogic
85•davelradindra•8h ago•35 comments

Scott Adams has died

https://www.youtube.com/watch?v=Rs_JrOIo3SE
781•ekianjo•11h ago•1274 comments

Show HN: Microwave – Native iOS app for videos on ATproto

https://testflight.apple.com/join/cVxV1W3g
12•sinned•9h ago•1 comments

Let's be honest, Generative AI isn't going all that well

https://garymarcus.substack.com/p/lets-be-honest-generative-ai-isnt
136•7777777phil•8h ago•143 comments

A university got itself banned from the Linux kernel (2021)

https://www.theverge.com/2021/4/30/22410164/linux-kernel-university-of-minnesota-banned-open-source
66•italophil•8h ago•44 comments

Show HN: AsciiSketch a free browser-based ASCII art and diagram editor

https://files.littlebird.com.au/ascii-sketch.html
24•schappim•4h ago•5 comments

Is it a joke?

https://novalis.org/blog/2025-11-06-is-it-a-joke.html
22•luu•5h ago•3 comments

Vanderbilt University plans new campus in San Francisco

https://www.wsj.com/us-news/education/vanderbilt-san-francisco-cca-california-college-arts-expans...
3•noleary•2h ago•0 comments

A deep dive on agent sandboxes

https://pierce.dev/notes/a-deep-dive-on-agent-sandboxes
36•icyfox•1d ago•9 comments

Inlining – The Ultimate Optimisation

https://xania.org/202512/17-inlining-the-ultimate-optimisation
48•PaulHoule•4d ago•18 comments

Influencers and OnlyFans models are dominating U.S. O-1 visa requests

https://www.theguardian.com/us-news/2026/jan/11/onlyfans-influencers-us-o-1-visa
357•bookofjoe•10h ago•258 comments

Confer – End to end encrypted AI chat

https://confer.to/
74•vednig•13h ago•58 comments