frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: I enhanced Soundex to correctly handle multi-word strings

4•ogora•5h ago
Hello HN.

I built Flookup Data Wrangler, a powerful Google Sheets add-on for data cleaning without writing single line of code.

Traditional Soundex is designed for single words like "John" and "Jonny", making data cleaning comparisons between such strings straightforward. However, typical Soundex outputs cannot be used to handle multi-word or reordered string comparisons like "John Doe" vs "Doe Jonny", as this would produce inaccurate results.

To address this, I modified the Soundex algorithm to support multi-word and reordered strings by adding a helper function that re-encodes the output into a format that can be used for accurate text-to-text comparisons. The optimisation keeps overhead minimal, ensuring negligible impact on performance.

By leveraging this enhancement, Flookup users can do the following:

+ Fuzzy matching and merging

+ Duplicate highlighting and removal

+ Extracting a list of unique values

... all based on the sound the strings or parts of the strings make (as pronounced in English).

I would love feedback, especially from those into data cleaning (which I'm guessing is everyone).

If you are curious to give it a try, here is a quick start guide: https://www.getflookup.com/get-started

Show HN: A free, privacy preserving, archive of public Discord servers

https://searchcord.io
3•searchcord•44m ago•0 comments

Show HN: A MCP server to evaluate Python code in WASM VM using RustPython

https://github.com/tuananh/hyper-mcp/tree/main/examples/plugins/eval-py
25•tuananh•2d ago•9 comments

Show HN: Bobber Game (Go Down to Go Up)

https://stan-stani.github.io/minigames/?game=bobber
2•EstanislaoStan•1h ago•0 comments

Show HN: Windows 98 themed website in 1 HTML file for my post punk band

https://corp.band
166•jealousgelatin•10h ago•37 comments

Show HN: A native Hacker News reader with integrated todo/done tracking

https://github.com/haojiang99/hacker_news_reader
27•coolwulf•9h ago•15 comments

Show HN: Near-perfect English AI translation of a classic of Japanese literature

https://www.booktranslate.ai/public-translations/nzdd5fxnqbgx1rtmmky0ibsh
3•sunwood•2h ago•0 comments

Show HN: Goboscript, text-based programming language, compiles to Scratch

https://github.com/aspizu/goboscript
152•aspizu•21h ago•56 comments

Show HN: Cogitator – A Python Toolkit for Chain-of-Thought Prompting

https://github.com/habedi/cogitator
48•habedi0•4d ago•8 comments

Show HN: Combine Emojis to generate a new one

https://emojis.directory/emoji-merge/
2•azeemkafridi•3h ago•0 comments

Show HN: Job board aggregator for best paying remote SWE jobs in the U.S.

https://www.remoteswe.fyi
179•xitang•1d ago•118 comments

Show HN: A highly extensible framework for building OCR systems

https://github.com/robbyzhaox/myocr
13•robbyzhao•13h ago•0 comments

Show HN: A platform to find tech conferences, discounts, and ticket giveaways

https://www.tech.tickets/
100•danthebaker•3d ago•35 comments

Show HN: Hardtime.nvim – break bad habits and master Vim motions

https://github.com/m4xshen/hardtime.nvim
201•m4xshen•1d ago•81 comments

Show HN: Vibe coding from your phone

https://vibecodego.com
7•chrisnolet•8h ago•0 comments

Show HN: Visualization of job openings by US based employers

https://jobswithgpt.com/blog/jobs-density-visualization/
6•jobswithgptcom•8h ago•2 comments

Show HN: Chat with 19 years of HN

https://app.camelai.com/log-in?next=/hn/
146•vercantez•1d ago•113 comments

Show HN: I modeled the Voynich Manuscript with SBERT to test for structure

https://github.com/brianmg/voynich-nlp-analysis
376•brig90•1d ago•129 comments

Show HN: GrowthBook MCP Server for Feature Flagging and Experimentation

https://github.com/growthbook/growthbook-mcp
2•royalfig•5h ago•0 comments

Show HN: I enhanced Soundex to correctly handle multi-word strings

4•ogora•5h ago•0 comments

Show HN: Python Simulator of David Deutsch’s “Constructor Theory of Time”

https://github.com/gvelesandro/constructor-theory-simulator
81•SandroG•1d ago•11 comments

Show HN: Vaev – A browser engine built from scratch (It renders google.com)

https://github.com/skift-org/vaev
222•monax•1d ago•134 comments

Show HN: Buckaroo – Data table UI for Notebooks

https://github.com/paddymul/buckaroo
103•paddy_m•1d ago•9 comments

Show HN: Mirror World, create an AI clone of anyone

https://mirr.world/
20•p-sharpe•12h ago•3 comments

Show HN: Jar.tools – View, Change, Decompile Java Jar Files

https://jar.tools
8•Igor_Wiwi•15h ago•0 comments

Show HN: Inkwell: book/article authoring platform exports to PDF and ePub

https://inkwell.net
2•winstonewert•6h ago•0 comments

Show HN: A web browser agent in your Chrome side panel

https://github.com/parsaghaffari/browserbee
146•parsabg•1d ago•61 comments

Show HN: Open-Source AlphaEvolve Clone Using GPT-4.1 and Genetic Programming

14•Sai_Praneeth•16h ago•0 comments

Show HN: I Built a Prompt That Makes LLMs Think Like Heinlein's Fair Witness

https://fairwitness.bot/
9•9wzYQbTYsAIc•13h ago•6 comments

Show HN: Turn any workflow diagram into compilable, running and stateful code

https://workflows.diagrid.io/
105•yaronsc•5d ago•32 comments

Show HN: Model2vec-Rs – Fast Static Text Embeddings in Rust

https://github.com/MinishLab/model2vec-rs
59•Tananon•1d ago•14 comments