frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•8mo ago

Comments

yxhuvud•8mo ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•8mo ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•8mo ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•8mo ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•8mo ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•8mo ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

Douglas Adams on the English–American cultural divide over "heroes"

https://shreevatsa.net/post/douglas-adams-cultural-divide/
128•speckx•1h ago•82 comments

Design Thinking Books You Must Read

https://www.designorate.com/design-thinking-books/
120•rrm1977•3h ago•52 comments

We will ban you and ridicule you in public if you waste our time on crap reports

https://curl.se/.well-known/security.txt
552•latexr•4h ago•317 comments

Show HN: Sweep, Open-weights 1.5B model for next-edit autocomplete

https://huggingface.co/sweepai/sweep-next-edit-1.5B
427•williamzeng0•15h ago•79 comments

ISO PDF spec is getting Brotli – ~20 % smaller documents with no quality loss

https://pdfa.org/want-to-make-your-pdfs-20-smaller-for-free/
43•whizzx•4h ago•19 comments

Doctors in Brazil using tilapia fish skin to treat burn victims

https://www.pbs.org/newshour/health/brazilian-city-uses-tilapia-fish-skin-treat-burn-victims
172•kaycebasques•9h ago•61 comments

In Praise of APL (1977)

https://www.jsoftware.com/papers/perlis77.htm
60•tosh•6h ago•35 comments

30 Years of ReactOS

https://reactos.org/blogs/30yrs-of-ros/
74•Mark_Jansen•6h ago•25 comments

Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant

https://www.media.mit.edu/publications/your-brain-on-chatgpt/
381•misswaterfairy•16h ago•256 comments

Palantir, Meta, OpenAI Execs Appointed Lieutenant Colonels in US Army

https://twitter.com/SecArmy/status/1933693585183965372
21•alexmorley•27m ago•4 comments

You can now download Windows 7 and Vista ISO images with all the updates

https://www.neowin.net/news/you-can-now-download-windows-7-and-vista-iso-images-with-every-possib...
25•XzetaU8•45m ago•10 comments

Flowtel (YC W25) Is Hiring

https://www.ycombinator.com/companies/flowtel/jobs/LaddaEz-founding-engineer-staff-senior
1•eylonmiz•2h ago

Threat actors expand abuse of Microsoft Visual Studio Code

https://www.jamf.com/blog/threat-actors-expand-abuse-of-visual-studio-code/
223•vinnyglennon•14h ago•202 comments

eBay explicitly bans AI "buy for me" agents in user agreement update

https://www.valueaddedresource.net/ebay-bans-ai-agents-updates-arbitration-user-agreement-feb-2026/
209•bdcravens•17h ago•226 comments

Gathering Linux Syscall Numbers in a C Table

https://t-cadet.github.io/programming-wisdom/#2026-01-17-gathering-linux-syscall-numbers
72•phi-system•4d ago•28 comments

A Year of 3D Printing

https://brookehatton.com/blog/making/a-year-of-3d-printing/
43•nindalf•4d ago•37 comments

Hands-On Introduction to Unikernels

https://labs.iximiuz.com/tutorials/unikernels-intro-93976514
83•valyala•5d ago•29 comments

Waiting for dawn in search: Search index, Google rulings and impact on Kagi

https://blog.kagi.com/waiting-dawn-search
384•josephwegner•21h ago•212 comments

Claude's new constitution

https://www.anthropic.com/news/claude-new-constitution
494•meetpateltech•22h ago•568 comments

Show HN: ChartGPU – WebGPU-powered charting library (1M points at 60fps)

https://github.com/ChartGPU/ChartGPU
629•huntergemmer•1d ago•194 comments

Skip is now free and open source

https://skip.dev/blog/skip-is-free/
456•dayanruben•23h ago•205 comments

The Science of Life and Death in Mary Shelley's Frankenstein

https://publicdomainreview.org/essay/the-science-of-life-and-death-in-mary-shelleys-frankenstein/
5•Anon84•4d ago•0 comments

The Human in the Loop

https://adventures.nodeland.dev/archive/the-human-in-the-loop/
22•artur-gawlik•3d ago•17 comments

Meet the Alaska Student Arrested for Eating an AI Art Exhibit

https://www.thenation.com/article/society/alaska-student-arrested-eating-ai-art-exhibit/
9•petethomas•47m ago•2 comments

Binary fuse filters: Fast and smaller than xor filters (2022)

https://arxiv.org/abs/2201.01174
118•redbell•5d ago•10 comments

Lix – universal version control system for binary files

https://lix.dev/blog/introducing-lix/
95•onecommit•15h ago•37 comments

From stealth blackout to whitelisting: Inside the Iranian shutdown

https://www.kentik.com/blog/from-stealth-blackout-to-whitelisting-inside-the-iranian-shutdown/
140•oavioklein•14h ago•104 comments

Now with Mqtts

https://daniel.haxx.se/blog/2026/01/19/now-with-mqtts/
13•firesteelrain•1h ago•1 comments

TrustTunnel: AdGuard VPN protocol goes open-source

https://adguard-vpn.com/en/blog/adguard-vpn-protocol-goes-open-source-meet-trusttunnel.html
170•kumrayu•21h ago•58 comments

Significant US farm losses persist, despite federal assistance

https://www.fb.org/market-intel/significant-farm-losses-persist-despite-federal-assistance
232•toomuchtodo•13h ago•300 comments