frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

How are you reducing LLM token costs for async workflows?

https://github.com/parallem-ai/parallem
1•alexliu79•1h ago

Comments

alexliu79•1h ago
We’ve been exploring one specific cost issue in AI products: a lot of async-friendly LLM workloads still run synchronously, which seems to create unnecessary token spend.

I’m curious how people here are handling this in practice for evals, extraction pipelines, classification jobs, or other multi-step workflows.

Are you using batch APIs already? Building internal tooling? Or just accepting the extra cost because batch workflows are too painful to adopt?

We’ve been building an open-source library called ParaLLeM to make it easier to move agent workflows from sync to batch without rewriting everything, and I’d love to understand how others are approaching this problem.

Repo: https://github.com/parallem-ai/parallem

vienneraphael•18m ago
For non-urgent workflows I mostly use Batch APIs. As you said, the bare batch APIs are a pain to use.

On top of that, I would add that most async-to-batch libraries force users to learn a new framework or refactor their existing code, which is a huge friction in itself.

I've been in those trenches as a developer and I decided to create a literal 2-liner python lib that gets you from async to batch: https://github.com/vienneraphael/batchling

You don't need to change your code and it supports most providers (Anthropic, Gemini, Groq, Mistral, OpenAI, Together, Vertex, XAI, Doubleword) and all imagineable python frameworks (Langchain, PydanticAI, Instructor, DSPy, LiteLLM, Pydantic Evals, ..)

The Economics of Software Teams: Why Most Engineering Orgs Are Flying Blind

https://www.viktorcessan.com/the-economics-of-software-teams/
1•kiyanwang•5m ago•0 comments

Relocating Rigor

https://aicoding.leaflet.pub/3mbrvhyye4k2e
1•kiyanwang•9m ago•0 comments

Trovare – I built a search aggregator with 39 adapters across 11 platforms

https://trovare.ro/
1•mariusmitrofan•9m ago•0 comments

X Randomly Banning Users for "Inauthentic Behavior"

https://old.reddit.com/r/LinusTechTips/comments/1rsdk7i/anybody_here_talking_about_the_massive/
3•crmrc114•11m ago•2 comments

How to Share Terraform State (Tutorial)

https://spacelift.io/blog/how-to-share-terraform-state
2•kat-w•16m ago•0 comments

Gall's Law – Laws of Software

https://www.laws-of-software.com/laws/gall/
1•fagnerbrack•18m ago•0 comments

Why successful people often have bad opinions online

https://greyenlightenment.com/2026/04/02/why-successful-people-often-have-bad-opinions-online/
2•paulpauper•18m ago•1 comments

Taking a Look at Compression Algorithms – Moncef Abboud

https://cefboud.com/posts/compression/
1•fagnerbrack•18m ago•0 comments

The Future Is Neuro-Symbolic: Where Has It Been, and Where Is It Going?

https://ojs.aaai.org/index.php/AAAI/article/view/42130
4•walterbell•22m ago•1 comments

The Synth Is Not the Box

https://www.lunchfirm.com/blog/posts/localdsp-claude/
1•mcdowell_atx•24m ago•0 comments

Nintendo's Greed Could Change the Tech Industry [video]

https://www.youtube.com/watch?v=Lh9gnkF-Imo
1•josephcsible•24m ago•0 comments

The Future of Everything Is Lies, I Guess: Psychological Hazards

https://aphyr.com/posts/416-the-future-of-everything-is-lies-i-guess-psychological-hazards
1•vermilingua•25m ago•0 comments

Why didn't IPv6 work in my home network?

https://gowtham.dev/blog/ipv6-problems.html
2•gowthamgts12•29m ago•0 comments

Why I Don't Ship Preferences

https://jorviksoftware.cc/notes/2026/04/13/why-i-do-not-ship-preferences
1•jonathan_hollin•31m ago•0 comments

Splitting Mounjaro pens for fun and profit

https://www.lesswrong.com/posts/EsYEuNDEm96DCb8Jy/splitting-mounjaro-pens-for-fun-and-profit
1•henryaj•36m ago•0 comments

Satoshi Has the Right to Hide. We Have the Right to Search for Him

https://www.thefp.com/p/satoshi-has-the-right-to-hide
1•fortran77•36m ago•1 comments

Private firms providing services to NHS made £1.6B profit in two years

https://www.theguardian.com/society/2026/apr/13/private-companies-nhs-services-profit-chpi-research
2•gpi•40m ago•0 comments

Clinical trial shows gene editing works for β-Thalassaemia, too

https://arstechnica.com/science/2026/04/clinical-trial-shows-gene-editing-works-for-%ce%b2-thalas...
1•bryanrasmussen•44m ago•0 comments

Show HW: Implementing denoising diffusion probabilistic models from scratch

https://github.com/aldipiroli/ddpm_from_scratch
2•tgnk2341•47m ago•0 comments

Show HN: Bad Apple (Oscilloscope-Like) – one stroke per frame

https://bad-apple-on-oscilloscope.pages.dev/
1•araniwa•49m ago•1 comments

The Paper Computer

https://jsomers.net/blog/the-paper-computer
1•jsomers•53m ago•0 comments

No Degree, $155K Pay: Trump's FAA Is Recruiting Gamers as Air Traffic Controller

https://www.ibtimes.co.uk/faa-recruits-gamers-air-traffic-control-1791239
6•onemoresoop•58m ago•2 comments

Altman Shooters Are Paid Actors

https://twitter.com/boneGPT/status/2043550790095134846
4•fakeshoot•1h ago•1 comments

Ask HN: Insta, X, Reddit or HN?

1•wasimsk•1h ago•0 comments

What web browser do you use?

1•nicp•1h ago•3 comments

Reconciling Two UC Berkeleys – The Soapbox

https://www.dailycal.org/opinion/the_soapbox/reconciling-two-uc-berkeleys/article_0fe38fc6-1399-4...
2•paulpauper•1h ago•1 comments

My Dialogue with Jonathan Zittrain

https://marginalrevolution.com/marginalrevolution/2026/04/my-dialogue-with-jonathan-zittrain.html
3•paulpauper•1h ago•0 comments

Your Single Use iPhone [video]

https://www.youtube.com/watch?v=NG-lLt5X3Rs
3•Klaster_1•1h ago•0 comments

A unified Go SDK for working with large language models

https://github.com/aarock1234/ai
1•abdelsabbah•1h ago•1 comments

Lightweight internet radio management tool

https://github.com/tchovi/AirBoneRadio
1•Indigenism•1h ago•1 comments