frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Fast dialog aware sentence splitter

https://github.com/KnowSeams/KnowSeams
2•SteveJS•5h ago

Comments

SteveJS•5h ago
I made this to try out Claude code. I have the $17/mo thing, and i don’t know rust. (I do know plenty of other languages.) Rust felt like a scripting language when used this way. I used a task system to force getting to a git commit before auto-compact. The completed tasks are in the repo allowing one to see what starting context kicked off the changes. It worked much of the time. It’s 8k of rust, 12k of markdown and i think the markdown helps to correctly interact with a codebase using agents just as unit tests assist in refactoring. On my I9 with a local gutenberg mirror this e2e discovers 20k+ english novels, splits them into sentences, normalizes the sentences, keeps the origination text span and writes it out as tsv’s. It takes 7 seconds to complete that for 7 Gb of novels. Most importantly it splits the sentences the way i needed for the start of my pipeline.

Definitely interested if anyone find cases where it mis-splits english text from a novel.

The Sinclair ZX Spectrum Next Issue 3 Is Coming to Kickstarter This Saturday

https://www.specnext.com/the-sinclair-zx-spectrum-next-issue-3-is-coming-to-kickstarter-this-saturday/
1•whobre•3m ago•0 comments

Smart assistant to be developed to help people with dementia

https://www.uu.nl/en/news/smart-assistant-to-be-developed-to-help-people-with-dementia
1•geox•4m ago•0 comments

Can You Drink Saturn's Rings?

https://www.scientificamerican.com/article/can-you-drink-saturns-rings/
1•Bluestein•6m ago•0 comments

Cursor snaps up enterprise startup Koala in challenge to GitHub Copilot

https://techcrunch.com/2025/07/18/cursor-snaps-up-enterprise-startup-koala-in-challenge-to-github-copilot/
1•pseudolus•8m ago•0 comments

AI Powered Cat Flap

https://www.onlycat.com/
2•nikolayasdf123•16m ago•0 comments

A major AI training data set contains millions of examples of personal data

https://www.technologyreview.com/2025/07/18/1120466/a-major-ai-training-data-set-contains-millions-of-examples-of-personal-data/
1•pseudolus•20m ago•1 comments

The sumerian game early computer game

https://spillhistorie.no/2025/07/10/the-sumerian-game-the-ancestor-of-modern-city-builders/
2•christkv•21m ago•1 comments

Fstrings.wtf

https://fstrings.wtf/
5•darkamaul•21m ago•0 comments

I avoid using LLMs as a publisher and writer

https://lifehacky.net/prompt-0b953c089b44
1•tombarys•21m ago•1 comments

IIA team decodes reason behind May 2024 solar eruptions

https://www.thehindu.com/news/national/karnataka/iia-team-decodes-reason-behind-may-2024-solar-eruptions/article69827818.ece
1•Bluestein•22m ago•0 comments

Show HN: Vlm in 3D PC, 16 shot scanobjectnn top1 acc: 99.91

https://github.com/genji970/3d-vlm-gaussian-splatting-pointclip-on-modelnet40
1•genji970•27m ago•0 comments

First Space-Based Gravitational Wave Detector Begins Construction

https://spectrum.ieee.org/laser-interferometer-space-antenna
2•pseudolus•27m ago•0 comments

Petition: Repeal the Online Safety Act

https://petition.parliament.uk/petitions/722903
3•Bogdanp•28m ago•0 comments

Felix Baumgartner, Who Jumped from Stratosphere, Dies in Italy

https://www.theinternational.at/felix-baumgartner-who-jumped-from-stratosphere-dies-in-italy/
2•signa11•35m ago•0 comments

Base44 – build fully-functional apps in minutes with just your words

https://base44.com/?via=b44d
1•bubblehack3r•38m ago•0 comments

Homelab Tour (2022)

https://taoofmac.com/space/blog/2022/02/12/1930
1•rcarmo•41m ago•0 comments

China committee chair calls out Admin's decision to resume GPU sales to China

https://www.theregister.com/2025/07/18/trump_gpu_china/
3•rntn•43m ago•0 comments

The Vibes

https://taoofmac.com/space/blog/2025/05/13/2230
1•rcarmo•43m ago•0 comments

Ask HN: Is TAOCP helpful in real life reasoning?

1•hamiecod•44m ago•0 comments

Singapore actively dealing with ongoing cyberattack on critical infrastructure

https://www.channelnewsasia.com/singapore/unc3886-cyber-security-threat-actor-attack-singapore-5245791
3•hongsy•50m ago•0 comments

It Takes Two to Tango

https://avivbenyosef.com/it-takes-two-to-tango/
1•kiyanwang•55m ago•0 comments

Kolmogorov Complexity [20:48]

https://www.lesswrong.com/posts/KqgujtM3vSAfZE2dR/on-ilya-sutskever-s-a-theory-of-unsupervised-learning
2•Bluestein•1h ago•0 comments

The Remarkable Incompetence at the Heart of Tech

https://www.wheresyoured.at/the-remarkable-incompetence-at-the-heart-of-tech/
3•vermilingua•1h ago•0 comments

CLI converts YAML exercise plan to guided audio files

https://github.com/mrclmr/w2a
4•defree•1h ago•4 comments

Hypseus Singe: A program to play laserdisc arcade games

https://github.com/DirtBagXon/hypseus-singe
2•exvi•1h ago•0 comments

OpenAI claiming gold medal standard at IMO 2025

https://github.com/aw31/openai-imo-2025-proofs
6•ocfnash•1h ago•7 comments

Show HN: API Radar – Track Leaked API Keys in Public GitHub Repos

https://api-radar.live
1•zaim_abbasi•1h ago•0 comments

NASA's Escapade Mars Mission Will Launch on New Glenn in 2025

https://in.mashable.com/science/97280/blue-origin-confirms-nasas-escapade-mars-mission-will-launch-on-new-glenn-in-2025
1•Bluestein•1h ago•0 comments

Coworker Refuses to Change Code He Didnt Write Himself

1•kvajsvem333•1h ago•1 comments

Show HN: Single file transformers implementation for learning

https://gist.github.com/guilt/7534274c972c6a44ea555c21e43917f7
1•vkaku•1h ago•1 comments