frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Special Dyslexia Fonts Are Based on Voodoo Pseudoscience

https://daringfireball.net/linked/2025/12/12/dyslexia-fonts-pseudoscience
1•erickhill•42s ago•0 comments

We Rebuilt Settings in Zed

https://zed.dev/blog/settings-ui
1•erhuve•3m ago•0 comments

US TikTok investors in limbo as deal set to be delayed again

https://www.bbc.com/news/articles/cp34442z25ko
1•1659447091•4m ago•0 comments

Compute in Space: a first principles interactive model

https://astrocompute.dev/
1•kvee•7m ago•0 comments

Industrialized Cybercrime Targets Trust in Public and Private Sectors

https://oilprice.com/Geopolitics/International/Industrialized-Cybercrime-Targets-Trust-in-Public-...
2•PaulHoule•7m ago•0 comments

Turning my reading list into podcasts

https://www.coryd.dev/posts/2025/turning-my-reading-list-into-podcasts
1•cdrnsf•8m ago•0 comments

Why RSS Matters

https://werd.io/why-rss-matters/
2•cdrnsf•8m ago•0 comments

Arduino UNO Q

https://www.arduino.cc/product-uno-q
2•swatson741•11m ago•0 comments

The Biggest Causes of Medical Device Recalls

https://spectrum.ieee.org/medical-device-recalls
2•sohkamyung•14m ago•0 comments

Show HN: Team-first Slack bot that turns bug reports into PRs using Claude

https://github.com/MattKilmer/claude-autofix-bot
1•madcash•14m ago•0 comments

Magit-insert-worktrees improves status buffers

https://huonw.github.io/blog/2025/12/magit-insert-worktrees/
2•dbaupp•14m ago•0 comments

Atmospheric CO₂ Monitoring Dashboard

https://climate.portaljs.com/co2-monitoring
1•CharlesW•16m ago•0 comments

Ukrainians sue US chip firms for powering Russian drones, missiles

https://arstechnica.com/tech-policy/2025/12/ukrainians-sue-us-chip-firms-for-powering-russian-dro...
6•voxadam•17m ago•0 comments

Show HN: Spacelink – link budget / comm system modeling Python library

https://github.com/cascade-space-co/spacelink
1•n6hpa•19m ago•0 comments

Rethinking Data Integrity: Why Domain-Driven Design Is Crucial

https://thenewstack.io/rethinking-data-integrity-why-domain-driven-design-is-crucial/
1•franckpachot•22m ago•0 comments

Post-Quantum Cryptography on CHERIoT

https://cheriot.org/pqc/2025/12/12/pqc-on-cheriot.html
3•todsacerdoti•27m ago•0 comments

Arkansas becoming first state to sever ties with PBS, effective July 1

https://www.ctvnews.ca/world/article/arkansas-becoming-1st-state-to-sever-ties-with-pbs-effective...
1•kotaKat•28m ago•0 comments

Trump signs order to block states from enforcing own AI rules

https://www.bbc.com/news/articles/crmddnge9yro
5•deliass•29m ago•0 comments

Defrag.exfat Is Inefficient and Dangerous

https://github.com/exfatprogs/exfatprogs/issues/318
3•dxdxdt•30m ago•0 comments

The Beauty of Dissonance

https://www.plough.com/en/topics/culture/music/the-beauty-of-dissonance
1•tintinnabula•34m ago•0 comments

A LLM trained only on data from certain time periods to reduce modern bias

https://github.com/haykgrigo3/TimeCapsuleLLM
3•jpalomaki•34m ago•0 comments

Show HN: StudioArt, A photo sharing website for creatives

https://atstudioart.netlify.app/
1•telui•37m ago•0 comments

Measuring postMessage Delays with the Delayed Message Timing API

https://blogs.windows.com/msedgedev/2025/12/09/making-complex-web-apps-faster/
2•joonehur•38m ago•1 comments

Rebuilding Our Website for the Agent Era

https://www.prefect.io/blog/rebuilding-our-website-for-the-agent-era
5•cicdw•40m ago•1 comments

Engineering analysis of 3I/ATLAS as a sublimation-driven body

https://osf.io/w23nv
2•Alis_Muzar•41m ago•0 comments

Show HN: EdgeVec – Sub-millisecond vector search in the browser (Rust/WASM)

https://github.com/matte1782/edgevec
2•matteo1782•45m ago•1 comments

'Mamdani Effect' Is Seeing More People Moving to New York, Not Leaving It

https://www.newsweek.com/mamdani-effect-more-people-moving-new-york-city-not-leaving-11193747
2•saubeidl•46m ago•0 comments

Portals must bend gravity [video]

https://www.youtube.com/watch?v=DydIhwLrbMk
2•ahlCVA•46m ago•0 comments

Show HN: I needed to record mobile web demos with my face, so I built this

https://www.youtube.com/watch?v=c_fq0TzlsXI
1•admtal•47m ago•0 comments

Show HN: PharmVault – Secure Notes with Spring Boot and JWT

https://github.com/nifski/PharmVault
2•nifemi1234•47m ago•3 comments
Open in hackernews

Ask HN: Who is honestly evaluating AI outputs and how?

2•toddmorey•5h ago
Especially with multimodal AI conversations, evaluating and benchmarking these models is an increasingly complex topic, but a frustrating interaction with AI can really leave customers feeling sour about your whole product / service.

For an in-product AI assistant (with grounding, doc retrieval, and tool calling) I'm having a hard time wrapping my head around how to evaluate and monitor its success with customer interactions, prompt adherence, correctness and appropriateness, etc.

Any tips or resources that have been helpful to folks investing this challenge? Would love to learn. What does your stack / process look like?