frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•8mo ago

Comments

yxhuvud•8mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•8mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•8mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•8mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•8mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•8mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

The 26,000-Year Astronomical Monument Hidden in Plain Sight

https://longnow.org/ideas/the-26000-year-astronomical-monument-hidden-in-plain-sight/
58•mkmk•49m ago•10 comments

The Unix Pipe Card Game

https://punkx.org/unix-pipe-game/
68•kykeonaut•2h ago•18 comments

IP Addresses Through 2025

https://www.potaroo.net/ispcol/2026-01/addr2025.html
120•petercooper•5h ago•58 comments

Show HN: wxpath – Declarative web crawling in XPath

https://github.com/rodricios/wxpath
20•rodricios•6d ago•0 comments

Unconventional PostgreSQL Optimizations

https://hakibenita.com/postgresql-unconventional-optimizations
107•haki•4h ago•11 comments

De-dollarization: Is the US dollar losing its dominance? (2025)

https://www.jpmorgan.com/insights/global-research/currencies/de-dollarization
408•andsoitis•3h ago•481 comments

I'm addicted to being useful

https://www.seangoedecke.com/addicted-to-being-useful/
318•swah•8h ago•178 comments

Nvidia Stock Crash Prediction

https://entropicthoughts.com/nvidia-stock-crash-prediction
186•todsacerdoti•3h ago•146 comments

Google co-founder reveals that "many" of the new hires do not have a degree

https://www.yahoo.com/news/articles/google-cofounder-reveals-tons-recent-231500103.html
22•01-_-•40m ago•11 comments

Linux kernel framework for PCIe device emulation, in userspace

https://github.com/cakehonolulu/pciem
173•71bw•11h ago•65 comments

The Zen of Reticulum

https://github.com/markqvist/Reticulum/blob/master/Zen%20of%20Reticulum.md
63•mikece•5h ago•41 comments

Level S4 solar radiation event

https://www.swpc.noaa.gov/news/g4-severe-geomagnetic-storm-levels-reached-19-jan-2026
560•WorldPeas•22h ago•184 comments

Show HN: Ocrbase – pdf → .md/.json document OCR and structured extraction API

https://github.com/majcheradam/ocrbase
61•adammajcher•5h ago•18 comments

IP over Avian Carriers with Quality of Service (1999)

https://www.rfc-editor.org/rfc/rfc2549.html
49•mig4ng•7h ago•23 comments

Apple testing new App Store design that blurs the line between ads and results

https://9to5mac.com/2026/01/16/iphone-apple-app-store-search-results-ads-new-design/
562•ksec•1d ago•461 comments

Channel3 (YC S25) Is Hiring

https://www.ycombinator.com/companies/channel3/jobs/3DIAYYY-backend-engineer
1•aschiff1•7h ago

Everyone's a Gangster, Till You Get Bundled in G-Suite

https://twitter.com/keropillay/status/2013454666244481244
17•keroshanpillay•46m ago•4 comments

Reticulum, a secure and anonymous mesh networking stack

https://github.com/markqvist/Reticulum
301•brogu•19h ago•72 comments

Running Claude Code dangerously (safely)

https://blog.emilburzo.com/2026/01/running-claude-code-dangerously-safely/
207•emilburzo•7h ago•176 comments

Increasing the performance of WebAssembly Text Format parser by 350%

https://blog.gplane.win/posts/improve-wat-parser-perf.html
85•gplane•5d ago•30 comments

What came first: the CNAME or the A record?

https://blog.cloudflare.com/cname-a-record-order-dns-standards/
431•linolevan•1d ago•149 comments

Kiss Launcher – fast launcher for Android

https://kisslauncher.com/
65•ifh-hn•9h ago•33 comments

Benchmarking a Baseline Fully-in-Place Functional Language Compiler [pdf]

https://trendsfp.github.io/papers/tfp26-paper-12.pdf
30•matt_d•4d ago•3 comments

The coming industrialisation of exploit generation with LLMs

https://sean.heelan.io/2026/01/18/on-the-coming-industrialisation-of-exploit-generation-with-llms/
228•long•1d ago•139 comments

Nanolang: A tiny experimental language designed to be targeted by coding LLMs

https://github.com/jordanhubbard/nanolang
209•Scramblejams•21h ago•166 comments

How Hightouch built their long-running agent harness

https://www.amplifypartners.com/blog-posts/how-hightouch-built-their-long-running-agent-harness
31•thecr0w•33m ago•1 comments

Prediction markets are ushering in a world in which news becomes about gambling

https://www.theatlantic.com/technology/2026/01/america-polymarket-disaster/685662/
439•krustyburger•2d ago•426 comments

Notes on Apple's Nano Texture (2025)

https://jon.bo/posts/nano-texture/
236•dsr12•1d ago•124 comments

3D printing my laptop ergonomic setup

https://www.ntietz.com/blog/3d-printing-my-laptop-ergonomic-setup/
140•kurinikku•19h ago•66 comments

Squishy Go

https://puyogo.app/en/
21•kqr•3d ago•8 comments