frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Semlib – Semantic Data Processing

https://github.com/anishathalye/semlib
17•anishathalye•1h ago

Comments

Y_Y•1h ago

  >>> await sort(presidents, by="right-leaning")
  ['Jimmy Carter', 'Bill Clinton', 'George H. W. Bush', 'Ronald Reagan']
Is this supposed to be impressive? GIGO, if you want to vibe-classify your data then go right ahead, but I hope nobody serious relies on it.
hobofan•50m ago
Why not?

List-sorting/prioritizing list is among one of the best use-cases for LLMs, especially if the metrics for it are fuzzy, e.g. "what are the 10 sales lead of this list of 1000 that I should prioritize".

One of the more interesting approaches for that is arbitron[0], which does pairwise ranking with multiple metrics/agents to provide a multi-faceted sorting.

[0]: https://github.com/davidgasquez/arbitron

anishathalye•44m ago
That was a small self-contained example that fit above the fold in the README (and fwiw even last year’s models like GPT-4o give the right output there). That `sort` is based on pairwise comparisons, which is one of the best ways you can do it in terms of accuracy (Qin et al., 2023: https://arxiv.org/abs/2306.17563).

I think there are many real use cases where you might want a semantic sort / semantic data processing in general, when there isn’t a deterministic way to do the task and there is not necessarily a single right answer, and some amount of error (due to LLMs being imperfect) is tolerable. See https://semlib.anish.io/examples/arxiv-recommendations/ for one concrete example. In my opinion, the outputs are pretty high quality, to the point where this is practically usable.

These primitives can be _composed_, and that’s where this approach really shines. As a case study, I tried automating a part of performance reviews at my company, and the Semlib+LLM approach did _better_ than me (don’t worry, I didn’t dump AI-generated outputs on people, I first did the work manually, and shared both versions with an explanation of where each version came from). See the case study in https://anishathalye.com/semlib/

There’s also some related academic work in this area that also talks about applications. One of the most compelling IMO is DocETL’s collaboration to analyze police records (https://arxiv.org/abs/2410.12189). Some others you might enjoy checking out are LOTUS (https://arxiv.org/abs/2407.11418v1), Palimpzest (https://arxiv.org/abs/2405.14696), and Aryn (https://arxiv.org/abs/2409.00847).

Y_Y•24m ago
As you compose fuzzy operations your errors multiply! Nobody is asking for perfection, but this tool seems to me a straightforward way to launder bad data. If you want to do a quick check of an idea then it's probably great, but if you're going to be rigorous and use hard data and reproducible, understandable methods then I don't think it offers anything. The plea for citations at the end of the readme also rubs me the wrong way.
anishathalye•6m ago
I think semantic data processing in this style has a nonempty set of use cases (e.g., I find the fuzzy sorting of arXiv papers to be useful, I find the examples in the docs representative of some real-world tasks where this style of data processing makes sense, and I find many of the motivating examples and use cases in the academic work compelling). At the same time, I think there are many tasks for which this approach is not the right one to use.

Sorry you didn't like the wording in the README, that was not the intention. I like to give people a canonical form they can copy-paste if they want to cite the work, things have been a mess for many of my other GitHub repos, which makes it hard to find who is using the work (which can be really informative for improving the software, and I often follow-up with authors of papers via email etc.). For example, I heard about Amazon MemoryDB because they use Porcupine (https://dl.acm.org/doi/pdf/10.1145/3626246.3653380). Appreciate you sharing your feelings; I stripped the text from the README; if you have additional suggestions, would appreciate your comments or a PR.

esafak•58m ago
Instead of building a new data processing library, I would have offered only the novel NLP part and exposed it to existing libraries like pandas, polars, and spacy.

Does it batch requests?

ACP

https://sidequery.dev/blog
1•nicoritschel•37s ago•0 comments

Markdown Babel: execute source code blocks in your editor like Emacs org-mode

https://md-babel.org
1•ctietze•1m ago•1 comments

iOS and iPadOS 26: The MacStories Review

https://www.macstories.net/stories/ios-and-ipados-26-the-macstories-review/
1•ihuman•1m ago•0 comments

Anthropic Economic Index: Understanding AI's Effects on the Economy

https://www.anthropic.com/economic-index
1•praveenweb•4m ago•0 comments

Podman Desktop Apple container extension: Use macOS' built-in containers

https://podman-desktop.io/blog/apple-container-extension
2•twelvenmonkeys•4m ago•0 comments

Newspaper says it's blocked from Apple News UK

https://appleinsider.com/articles/25/09/14/newspaper-says-its-blocked-from-apple-news-uk
1•speckx•5m ago•0 comments

The Newest Face of Long-Term Unemployment? College Grads

https://www.nytimes.com/2025/09/15/business/long-term-unemployment-college-grads.html
1•2OEH8eoCRo0•6m ago•1 comments

A TikTok deal has been reached with China, the Trump administration says

https://www.cnn.com/2025/09/15/tech/trump-tiktok-china-deal
2•JumpCrisscross•6m ago•0 comments

Stop It Before It Enters: Resplice Institute Repackages Eugenics as Parent Love

https://autismanswersback.bearblog.dev/stop-it-before-it-enters-how-the-resplice-institute-repack...
1•warrenm•7m ago•0 comments

I Once Appeared in the Old New Thing

https://mtlynch.io/my-old-new-thing-cameo/
1•mtlynch•7m ago•0 comments

Why Governments Can't Count

https://asteriskmag.com/issues/11/why-governments-cant-count
1•Luc•8m ago•0 comments

Show HN: I built this to collect feedback from my website visitors

https://instantlyfeedback.com/
1•PictureRank•10m ago•0 comments

China accuses Nvidia of violating anti-monopoly laws

https://www.bbc.com/news/articles/cj9zedgkm21o
1•corvad•10m ago•0 comments

Area of the unit disk under a Möbius transformation

https://www.johndcook.com/blog/2025/09/15/area-mobius-transformation/
1•ibobev•11m ago•0 comments

Google just joined the $3T market cap club

https://qz.com/google-alphabet-3-trillion-club-market-cap
2•mikece•12m ago•0 comments

Semiconductor Laser with Electrically Modulated Frequency

https://www.mdpi.com/2304-6732/12/9/860
2•PaulHoule•12m ago•0 comments

X Design Notes: Pattern Matching I

https://blog.polybdenum.com/2025/09/15/x-design-notes-pattern-matching-i.html
2•ibobev•13m ago•0 comments

Spotify will now let free users pick and play tracks

https://techcrunch.com/2025/09/15/spotify-will-now-let-free-users-pick-and-play-tracks/
1•corvad•13m ago•0 comments

Show HN: Virtual Monitors in X11 and I3

https://mil.ad/blog/2025/virtual-monitor-x11.html
1•playnext•13m ago•0 comments

Backgrounds Are Important

https://leejo.github.io/2025/09/15/backgrounds/
2•leejo•13m ago•0 comments

Amazon announces its fall hardware event

https://www.theverge.com/news/778008/amazon-fall-hardware-event-announced-kindles-echo-alexa
2•corvad•13m ago•0 comments

China's Snub of U.S. Soybeans Is a Crisis for American Farmers

https://www.nytimes.com/2025/09/15/business/chinas-snub-of-us-soybeans-is-a-crisis-for-american-f...
5•JumpCrisscross•14m ago•1 comments

Provenance study shows 19th century looted 'Incan mummy' was Aymara man

https://phys.org/news/2025-09-provenance-19th-century-looted-incan.html
1•warrenm•14m ago•0 comments

The Bank of Mom and Dad: How Parents Insure Against Income Shocks

https://www.governance.fyi/p/the-bank-of-mom-and-dad-how-parents
1•toomuchtodo•14m ago•2 comments

They Went to Work for a Stock Exchange. Then the Scientology Ties Became Clear

https://www.wsj.com/business/dream-exchange-scientology-3e92e12c
3•JumpCrisscross•15m ago•1 comments

AI technologies are seeping into religious practice

https://www.nature.com/articles/d41586-025-02987-9
2•rntn•16m ago•0 comments

A Vulkan Introduction

https://www.4rknova.com//blog/2025/09/10/vulkan-intro
2•ibobev•16m ago•0 comments

Ask HN: Does anyone have any screenshots of fucked company?

2•iamflimflam1•16m ago•0 comments

Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL

https://arxiv.org/abs/2509.10446
1•omarsar•17m ago•0 comments

Car shoppers can get their own AI agent to help negotiate deal

https://www.autonews.com/retail/an-caredge-ai-consumer-negotation-tool-0915/
1•zachshefska•17m ago•0 comments