frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Fast(er) regular expression engines in Ruby

https://serpapi.com/blog/faster-regular-expression-engines-in-ruby/
60•davidsojevic•1y ago

Comments

yxhuvud•1y ago
Eww, pretending to support utf8 matchers while not supporting them at all was not pretty to see.
gitroom•1y ago
Honestly that part bugs me, fake support is worse than no support imo
kayodelycaon•1y ago
> Another nuance was found in ruby, which cannot scan the haystack with invalid UTF-8 byte sequences.

This is extremely basic ruby: UTF-8 encoded strings must be valid UTF-8. This is not unique to ruby. If I recall correctly, python 3 does the same thing.

    2.7.1 :001 > haystack = "\xfc\xa1\xa1\xa1\xa1\xa1abc"
    2.7.1 :003 > haystack.force_encoding "ASCII-8BIT"
    => "\xFC\xA1\xA1\xA1\xA1\xA1abc" 
    2.7.1 :004 > haystack.scan(/.+/)
    => ["\xFC\xA1\xA1\xA1\xA1\xA1abc"]
This person is a senior engineer on their Team page. All they had to do was google "ArgumentError: invalid byte sequence in UTF-8". Or ask a coworker... the company has Ruby on Rails applications. headdesk
burntsushi•1y ago
The nuance is specifically relevant here because neither of the other two regex engines benchmarked have this requirement. It's doubly relevant because that means running a regex search doesn't require a UTF-8 validation step, and is therefore likely beneficial from a perf perspective, dependening on the workload.
kayodelycaon•1y ago
That’s a good point. I hadn’t considered it because I’ve hit the validation error long before getting to search. It is possible to avoid string operations with careful coding prior to the search.

Edit: After a little testing, the strings can be read from and written to files without triggering validation. Presumably this applies to sockets as well.

DmitryOlshansky•1y ago
I wonder how std.regex of dlang would fare in such test. Sadly due to a tiny bit of D’s GC use it’s hard to provide as a library for other languages. If there is an interest I might take it through the tests.

Agents can now create Cloudflare accounts, buy domains, and deploy

https://blog.cloudflare.com/agents-stripe-projects/
198•rolph•3h ago•107 comments

StarFighter 16-Inch

https://us.starlabs.systems/pages/starfighter
211•signa11•4h ago•119 comments

CARA 2.0 – "I Built a Better Robot Dog"

https://www.aaedmusa.com/projects/cara2
35•hakonjdjohnsen•2d ago•0 comments

.de TLD offline due to DNSSEC?

https://dnssec-analyzer.verisignlabs.com/nic.de
610•warpspin•10h ago•306 comments

Telus Uses AI to Alter Call-Agent Accents

https://letsdatascience.com/news/telus-uses-ai-to-alter-call-agent-accents-a3868f63
112•debo_•5h ago•70 comments

245TB Micron 6600 ION Data Center SSD Now Shipping

https://investors.micron.com/news-releases/news-release-details/industry-leading-245tb-micron-660...
39•neilfrndes•3h ago•30 comments

Accelerating Gemma 4: faster inference with multi-token prediction drafters

https://blog.google/innovation-and-ai/technology/developers-tools/multi-token-prediction-gemma-4/
530•amrrs•14h ago•242 comments

Write some software, give it away for free

https://nonogra.ph/write-some-software-give-it-away-for-free-05-05-2026
225•nohell•9h ago•145 comments

YouTube, your RSS feeds are broken

https://openrss.org/blog/youtube-your-feeds-are-broken
68•veeti•5h ago•25 comments

Computer Use is 45x more expensive than structured APIs

https://reflex.dev/blog/computer-use-is-45x-more-expensive-than-structured-apis/
378•palashawas•14h ago•215 comments

Three Inverse Laws of AI

https://susam.net/inverse-laws-of-robotics.html
418•blenderob•15h ago•282 comments

Ombudsman column: The Pentagon is trying to silence me

https://www.stripes.com/opinion/2026-04-23/stripes-former-ombudsman-pentagon-trying-to-silence-21...
146•petethomas•3h ago•19 comments

EEVblog: The 555 Timer is 55 years old [video]

https://www.youtube.com/watch?v=6JhK8iCQuqI
272•brudgers•15h ago•67 comments

Why most product tours get skipped

https://productonboarding.com/articles/why-product-tours-get-skipped
130•pancomplex•9h ago•102 comments

Make some art with your phone sensors

https://tautme.github.io/phone-sensors/sensor-etch.html
18•adm4•2d ago•5 comments

Wiki Builder: Skill to Build LLM Knowledge Bases

https://academy.dair.ai/blog/wiki-builder-claude-code-plugin
45•omarsar•2d ago•5 comments

Five Banana Lessons

https://allensthoughts.com/2026/05/03/five-banana-lessons/
3•herbertl•2d ago•0 comments

Google Chrome silently installs a 4 GB AI model on your device without consent

https://www.thatprivacyguy.com/blog/chrome-silent-nano-install/
1379•john-doe•23h ago•921 comments

Show HN: Explore color palettes inspired by 3000 master painter artworks

https://paletteinspiration.com/
149•ouli•12h ago•58 comments

Today I've made the difficult decision to reduce the size of Coinbase by ~14%

https://twitter.com/brian_armstrong/status/2051616759145185723
337•adrianmsmith•18h ago•514 comments

I'm scared about biological computing

https://kuber.studio/blog/Reflections/I%27m-Scared-About-Biological-Computing
193•kuberwastaken•14h ago•156 comments

Show HN: Airbyte Agents – context for agents across multiple data sources

112•mtricot•15h ago•29 comments

Agents for financial services and insurance

https://www.anthropic.com/news/finance-agents
226•louiereederson•15h ago•168 comments

Should I run plain Docker Compose in production in 2026?

https://distr.sh/blog/running-docker-in-production/
387•pmig•5d ago•272 comments

When everyone has AI and the company still learns nothing

https://www.robert-glaser.de/when-everyone-has-ai-and-the-company-still-learns-nothing/
348•youngbrioche•21h ago•233 comments

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

https://arxiv.org/abs/2604.26752
135•gmays•12h ago•28 comments

Feds Fine Durham Energy Efficiency Co $722M

https://www.theassemblync.com/news/business/american-efficient-ferc-durham-fine/
17•ChuckMcM•2d ago•9 comments

I completed 100 Days of Java over 5 years and mapped the journey as a graph

https://mohibulsblog.netlify.app/java/100daysofjava/graph/
50•celurian92•2d ago•18 comments

California farmers to destroy 420k peach trees following Del Monte bankruptcy

https://www.sfgate.com/centralcoast/article/usda-aid-california-farmers-22240694.php
317•littlexsparkee•12h ago•371 comments

Zuckerberg 'Personally Authorized and Encouraged' Meta's Copyright Infringement

https://variety.com/2026/digital/news/meta-ai-mark-zuckerberg-copyright-infringement-lawsuit-publ...
361•spankibalt•12h ago•322 comments