frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

AI Overviews Are Eating Your Search Traffic

https://www.animalz.co/blog/ai-overviews-search-traffic/
1•afaxwebgirl•1m ago•0 comments

Arcadia: Content-agnostic BitTorrent site/tracker built with Rust

https://github.com/Arcadia-Solutions/arcadia
1•Reveal3677•2m ago•0 comments

Unexplained Starlight Pulses Found in Optical SETI Searches

https://astrobiology.com/2025/05/unexplained-starlight-pulses-found-in-optical-seti-searches.html
1•CommieBobDole•3m ago•1 comments

If Planes Can Fly Themselves Then Why Can't IT Management Be Autonomous?

https://devops.com/if-planes-can-fly-themselves-then-why-cant-it-management-be-autonomous/
1•dhairya•3m ago•0 comments

Trump to sign bill criminalizing revenge porn and explicit deepfakes

https://techcrunch.com/2025/05/19/trump-to-sign-bill-criminalizing-revenge-porn-and-explicit-deepfakes/
1•Willingham•3m ago•0 comments

The Great Scrape

https://herman.bearblog.dev/the-great-scrape/
1•rglover•4m ago•0 comments

Making Video Games in 2025 (without an engine)

https://noelberry.ca/posts/making_games_in_2025/
1•spiffyk•5m ago•0 comments

The Onion has opened a creative agency

https://www.marketingbrew.com/stories/2025/05/19/the-onion-creative-agency
1•coloneltcb•5m ago•0 comments

Complex numbers 2: quaternions – lcamtuf's thing

https://lcamtuf.substack.com/p/complex-numbers-2-a-world-in-3d
2•rbanffy•6m ago•0 comments

The insane growth of America's millionaire class – The Hustle

https://thehustle.co/originals/the-insane-growth-of-americas-millionaire-class
1•rbanffy•6m ago•0 comments

Breaking the Sorting Barrier for Directed Single-Source Shortest Paths

https://arxiv.org/abs/2504.17033
1•pizza•7m ago•0 comments

DARPA zaps popcorn with laser power beamed 5.3 miles through air

https://www.theregister.com/2025/05/19/darpa_energy_beaming_record/
3•rntn•7m ago•0 comments

NLWeb

https://github.com/microsoft/NLWeb
2•runesoerensen•8m ago•0 comments

How AI Generates Creativity from Inauthenticity

https://arxiv.org/abs/2505.11463
1•badmonster•8m ago•0 comments

What's wrong with pcap filters? (2015)

https://www.snellman.net/blog/archive/2015-05-18-whats-wrong-with-pcap-filters/
1•Tomte•9m ago•0 comments

What the Declaration of Independence Claimed (2015)

https://www.washingtonpost.com/news/volokh-conspiracy/wp/2015/07/04/what-the-declaration-of-independence-really-claimed/
2•Tomte•9m ago•0 comments

The I-search Paper (1988)

https://archive.org/details/isearchpaper0000macr
1•turtleyacht•11m ago•0 comments

Ask HN: Residue Number Systems for GPU computing as indie-researcher. Thoughts?

1•muragekibicho•12m ago•0 comments

The State of Open-Source AI-Powered Test Automation

https://alumnium.ai/blog/state-of-open-source-ai-powered-test-automation/
4•p0deje•12m ago•0 comments

NLWeb: Bringing conversational interfaces directly to the web

https://news.microsoft.com/source/features/company-news/introducing-nlweb-bringing-conversational-interfaces-directly-to-the-web/
3•vyrotek•12m ago•0 comments

Microsoft Foundry Local for Windows and Mac

https://learn.microsoft.com/en-us/azure/ai-foundry/foundry-local/what-is-foundry-local
2•MysticOracle•14m ago•1 comments

NIST Special Publication 800-63: Digital Identity Guidelines Public Comments

https://pages.nist.gov/800-63-Public-Comments/
2•mooreds•15m ago•0 comments

Ask HN: What newsletters do you follow?

3•cyndunlop•15m ago•0 comments

Blog After Death

https://jmtd.net/log/blog_after_death/
3•veqq•18m ago•0 comments

A.I. Will Destroy Critical Thinking in K-12

https://www.nytimes.com/2025/05/14/opinion/trump-ai-elementary.html
1•bookofjoe•19m ago•1 comments

This article won't change your mind. Here's why

https://www.theguardian.com/commentisfree/2025/may/18/change-mind-evidence-arguing-social-relationships
2•nemoniac•19m ago•0 comments

Advancing Zero Trust Maturity Throughout the User Pillar [pdf]

https://media.defense.gov/2023/Mar/14/2003178390/-1/-1/0/CSI_Zero_Trust_User_Pillar_v1.1.PDF
1•mooreds•20m ago•0 comments

How to Start a New Internet Service Provider from Scratch [video]

https://www.youtube.com/watch?v=MHSqElgYjxw
1•dks8eksls•22m ago•0 comments

Company Reminder for Everyone to Talk Nicely About the Giant Plagiarism Machine

https://www.mcsweeneys.net/articles/a-company-reminder-for-everyone-to-talk-nicely-about-the-giant-plagiarism-machine
12•zdw•23m ago•1 comments

Go Cryptography Security Audit

https://go.dev/blog/tob-crypto-audit
2•bracewel•23m ago•0 comments
Open in hackernews

Morphology of a Marvel Movie

https://github.com/dhealy05/morphology_of_a_marvel_movie
3•higuidebot•2h ago

Comments

PaulHoule•2h ago
Cosine similarity works for this but the right way to think about it is as a classical ML classification problem with all the tools from

https://scikit-learn.org/stable/supervised_learning.html

For instance you will probably get better results with SVM or a not-so-deep perceptron or maybe random forest model than you will with cosine similarity. You can also probability calibrate such a model

https://scikit-learn.org/stable/modules/calibration.html

which is quite useful.

higuidebot•2h ago
What do you think a "better" result would be here? Better by what metric?
PaulHoule•1h ago
Accuracy.

If you got N people (say N=10) to classify different segments of the script you'd find that they'd mostly agree about how to classify them but they wouldn't agree perfectly. You can get closer to a "gold truth" if you sit people together to discuss the difficult cases.

Any given classifer is going to be like one individual, if it is any good it is going to mostly agree with the gold truth but sometimes it won't. It's also the truth that some classifications will be ambiguous as some segment of the script will have some characteristics of one class and some of another or just might not fit rationally into the schema.

This toolbox

https://scikit-learn.org/stable/model_selection.html

is helpful for the process of testing a number of different models for a range of parameters and deciding what works best. A classifier that is calibrated (returns a probability of class membership) can skip cases where it knows it doesn't know what it is talking about. In the financial world, a calibrated model + a Kelly better can make money trading, an uncalibrated model will lose money almost always.