frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•12mo ago

Comments

yxhuvud•12mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•12mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•12mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•12mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•12mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•11mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

Talkie: a 13B vintage language model from 1930

https://talkie-lm.com/introducing-talkie
277•jekude•8h ago•86 comments

Microsoft and OpenAI end their exclusive and revenue-sharing deal

https://www.bloomberg.com/news/articles/2026-04-27/microsoft-to-stop-sharing-revenue-with-main-ai...
837•helsinkiandrew•17h ago•715 comments

Pgrx: Build Postgres Extensions with Rust

https://github.com/pgcentralfoundation/pgrx
44•luu•2d ago•1 comments

Is my blue your blue?

https://ismy.blue/
454•theogravity•10h ago•307 comments

Mo RAM, Mo Problems (2025)

https://fabiensanglard.net/curse/
97•blfr•2d ago•9 comments

High Performance Git

https://gitperf.com/
90•gnabgib•6h ago•12 comments

LingBot-Map: Streaming 3D reconstruction with geometric context transformer

https://technology.robbyant.com/lingbot-map
18•nateb2022•3h ago•1 comments

Three men are facing charges in Toronto SMS Blaster arrests

https://www.tps.ca/media-centre/stories/unprecedented-sms-blaster-arrests/
142•gnabgib•9h ago•65 comments

How I leared what a decoupling capacitor is for, the hard way

https://nbelakovski.substack.com/p/how-i-learned-what-a-decoupling-capacitor
69•actinium226•2d ago•24 comments

4TB of voice samples just stolen from 40k AI contractors at Mercor

https://app.oravys.com/blog/mercor-breach-2026
501•Oravys•20h ago•177 comments

Vibe Coding Will Break Your Company

https://www.forbes.com/sites/jasonwingard/2026/04/23/vibe-coding-will-break-your-company/
43•sminchev•1h ago•23 comments

Men who stare at walls

https://www.alexselimov.com/posts/men_who_stare_at_walls/
526•aselimov3•19h ago•229 comments

The quiet resurgence of RF engineering

https://atempleton.bearblog.dev/quiet-resurgence-of-rf-engineering/
181•merlinq•2d ago•96 comments

Integrated by Design

https://vivianvoss.net/blog/integrated-by-design-launch
92•vermaden•7h ago•38 comments

Meetings are forcing functions

https://www.mooreds.com/wordpress/archives/3734
103•zdw•2d ago•46 comments

Easyduino: Open Source PCB Devboards for KiCad

https://github.com/Hanqaqa/Easyduino
198•Hanqaqa•12h ago•31 comments

Networking changes coming in macOS 27

https://eclecticlight.co/2026/04/23/networking-changes-coming-in-macos-27/
222•pvtmert•15h ago•192 comments

Show HN: AgentSwift – Open-source iOS builder agent

https://github.com/hpennington/agentswift
30•hpen•5h ago•7 comments

The woes of sanitizing SVGs

https://muffin.ink/blog/scratch-svg-sanitization/
205•varun_ch•15h ago•87 comments

Lessons from building multiplayer browsers

https://www.alejandro.pe/writing/sail-muddy-lessons
41•alejandrohacks•1d ago•13 comments

Radar Laboratory – Interactive Radar Phenomenology

https://radarlaboratory.com/
49•jonbaer•2d ago•2 comments

Fully Featured Audio DSP Firmware for the Raspberry Pi Pico

https://github.com/WeebLabs/DSPi
282•BoingBoomTschak•2d ago•79 comments

Spanish archaeologists discover trove of ancient shipwrecks in Bay of Gibraltar

https://www.theguardian.com/science/2026/apr/15/hidden-treasures-spanish-archaeologists-discover-...
92•1659447091•2d ago•20 comments

FDA approves first gene therapy for treatment of genetic hearing loss

https://www.fda.gov/news-events/press-announcements/fda-approves-first-ever-gene-therapy-treatmen...
232•JeanKage•20h ago•86 comments

Pgbackrest is no longer being maintained

https://github.com/pgbackrest/pgbackrest
411•c0l0•19h ago•219 comments

GitHub Copilot is moving to usage-based billing

https://github.blog/news-insights/company-news/github-copilot-is-moving-to-usage-based-billing/
620•frizlab•14h ago•452 comments

China blocks Meta's acquisition of AI startup Manus

https://www.cnbc.com/2026/04/27/meta-manus-china-blocks-acquisition-ai-startup.html
358•yakkomajuri•18h ago•253 comments

“Why not just use Lean?”

https://lawrencecpaulson.github.io//2026/04/23/Why_not_Lean.html
273•ibobev•16h ago•191 comments

Super ZSNES – GPU Powered SNES Emulator

https://zsnes.com/
276•haunter•12h ago•79 comments

Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview

https://github.com/dirac-run/dirac
331•GodelNumbering•18h ago•121 comments