frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•8mo ago

Comments

yxhuvud•8mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•8mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•8mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•8mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•8mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•8mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

The Most Popular Blogs of Hacker News in 2025

https://refactoringenglish.com/blog/2025-hn-top-5/
272•mtlynch•5h ago•64 comments

Microsoft kills official way to activate Windows 11/10 without internet

https://www.neowin.net/news/report-microsoft-quietly-kills-official-way-to-activate-windows-1110-...
222•taubek•2h ago•139 comments

The C3 Programming Language

https://c3-lang.org
211•y1n0•4h ago•134 comments

Show HN: Vibe Coding a static site on a $25 Walmart Phone

https://stetsonblake.com/%2425+Walmart+Phone+for+Hackers
12•stets•1h ago•0 comments

As deep-sea mining race ramps up, mission will assess whether ecosystems recover

https://www.science.org/content/article/deep-sea-mining-race-ramps-mission-will-assess-whether-ec...
10•rbanffy•2h ago•1 comments

Publish on your own site, syndicate elsewhere

https://indieweb.org/POSSE#
960•47thpresident•1d ago•223 comments

Sirius DB

https://www.sirius-db.com/
60•manoji•3d ago•8 comments

Exploring Dithering on Spectra 6-color E-Ink Displays

https://myembeddedstuff.com/e-ink-spectra-6-color
26•edent•3d ago•1 comments

Experiments with Ableton-MCP

https://jhurliman.org/post/804323197731373056/experiments-with-ableton-mcp-dec-2025
33•jhurliman•4d ago•2 comments

Daft Punk Easter Egg in the BPM Tempo of Harder, Better, Faster, Stronger?

https://www.madebywindmill.com/tempi/blog/hbfs-bpm/
711•simonw•1d ago•118 comments

Recursive Language Models

https://arxiv.org/abs/2512.24601
85•schmuhblaster•10h ago•13 comments

Xr0 verifier, guarantee the safety of C programs at compile time

https://xr0.dev
6•Alifatisk•3h ago•0 comments

Beating myself at chess

https://log.schemescape.com/posts/diy/beating-myself-at-chess.html
13•zdw•4d ago•8 comments

World's largest functioning musical instrument: Wanamaker Organ in Philadelphia

https://en.wikipedia.org/wiki/Wanamaker_Organ
52•bookofjoe•6d ago•8 comments

X-Clacks-Overhead

https://hleb.dev/post/x-clacks-overhead/
97•hleb_dev•9h ago•30 comments

Why your early 2000s photos are probably lost forever

https://www.bbc.com/future/article/20251212-why-your-early-2000s-photos-are-probably-lost-forever
8•slyall•5d ago•1 comments

ParadeDB (YC S23) Is Hiring Database Engineers

https://paradedb.notion.site/?p=172ea4ce9deb80898ef5d5097bd65544&pm=s
1•philippemnoel•7h ago

IPv6 just turned 30 and still hasn't taken over the world

https://www.theregister.com/2025/12/31/ipv6_at_30/
514•Brajeshwar•1d ago•1052 comments

Profiling with Ctrl-C (2024)

https://yosefk.com/blog/profiling-with-ctrl-c.html
64•hun3•10h ago•12 comments

Doesnt: An Esoteric Programming Language

https://lists.sr.ht/~rabbits/horadric/%3C5d708fd1-1c01-4fb6-a8e5-61213a1e88f8@sheeeeeeeep.art%3E
5•surprisetalk•2h ago•0 comments

A Beginner's Two-Component Crystal-Style Wi-Fi Detector

https://siliconjunction.wordpress.com/2025/12/12/a-beginners-two-component-crystal-style-wi-fi-de...
121•jensgk•3d ago•36 comments

Cadova: Swift DSL for parametric 3D modeling

https://github.com/tomasf/Cadova
80•bdcravens•3d ago•19 comments

Clicks Communicator

https://www.clicksphone.com/en/communicator
399•microflash•1d ago•248 comments

Show HN: Offline tiles and routing and geocoding in one Docker Compose stack

https://www.corviont.com/
63•packet_mover•5h ago•19 comments

Ask HN: Who is hiring? (January 2026)

335•whoishiring•1d ago•207 comments

2026 will be my year of the Linux desktop

https://xeiaso.net/notes/2026/year-linux-desktop/
741•todsacerdoti•21h ago•558 comments

UK company sends factory with 1,000C furnace into space

https://www.bbc.co.uk/news/articles/c62vx0pgyrgo
123•vekerdyb•3d ago•66 comments

Xsight Labs E1 DPU a 64-Core Arm Neoverse N2 800G DPU

https://www.servethehome.com/this-is-the-xsight-labs-e1-dpu-a-64-core-arm-neoverse-n2-800g-dpu/
9•ksec•1h ago•2 comments

Linux kernel security work

http://www.kroah.com/log/blog/2026/01/02/linux-kernel-security-work/
167•chmaynard•1d ago•81 comments

Trump says Venezuela’s Maduro captured after strikes

https://www.reuters.com/world/americas/loud-noises-heard-venezuela-capital-southern-area-without-...
1384•jumpocelot•14h ago•3289 comments