frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Taking a Second Look

https://2ndsetai.substack.com/p/taking-a-second-look-an-unprecedentedly
3•jeffreysmith•1h ago

Comments

jeffreysmith•1h ago
Howdy, HN. Authors here. We got tired of text-to-image leaderboards that only focus on aesthetics, so we built our own benchmarks to test what matters for real work: fidelity to complex prompts, safety, bias, and IP infringement.

We analyzed 18 models and found that no single model is good at everything. For example, GPT-4o has the best safety guardrails but also a 98% IP infringement rate on celebrity likenesses. Google's Imagen 4 Ultra actively counters bias (e.g., 90% of its "CEOs" are female) but struggles with generating crowds. X AI's Grok 2 blocks almost nothing.

Lots more detail in the post. We'll be here all day to answer questions.

ianchenh•1h ago
Really unique viewpoint. Can't stress how rare it is these days for tech startups and companies to emphasize social responsibility, and crucially its potential to translate to profitability as well! Responsible AI isn't just a constraint on the field - controllability means quality and usability.

git-plot: plot changes using Unicode blocks

https://j.wied.co/git-plot.html
1•hwj•1m ago•0 comments

Ask HN: Which laptop can run the largest LLM model?

1•grokblah•1m ago•0 comments

Meta appoints anti-LGBTQ+ conspiracy theorist Robby Starbuck as AI bias advisor

https://www.thepinknews.com/2025/08/14/meta-robby-starbuck-ai/
2•CharlesW•2m ago•0 comments

GNU D compiler has been broken on FreeBSD 14 for over a year and no one noticed

https://briancallahan.net/blog/20250813.html
2•ingve•2m ago•0 comments

Tesla's Forgotten Founder Speaks Out – Exclusive with Martin Eberhard (YouTube) [video]

https://www.youtube.com/watch?v=88KHfX_kPIY
1•cletusw•4m ago•0 comments

What Musk, Altman and Others Say About AI-Funded 'Universal Basic Income'

https://www.wsj.com/tech/ai/universal-income-tech-executives-a16eb2d0
1•fortran77•8m ago•0 comments

Gemma 3-270M

https://huggingface.co/collections/ggml-org/gemma-3-270m-689e0105d56462786413d7fc
2•georgehill•9m ago•0 comments

Unaligned GPT-OSS-20B-base extracted from OpenAI's model

https://twitter.com/jxmnop/status/1955436067353502083
1•fragmede•10m ago•0 comments

Debate Website

https://bicker.ca/
1•lucasadilla•11m ago•1 comments

Show HN: I made a tool that turns niche research into daily marketing tasks

https://launchprint.deplo.yt
1•LeoGoverG•14m ago•0 comments

How we use a 3-stage, human-in-the-loop AI workflow to overhaul rsyslog's docs

https://www.rsyslog.com/shipping-better-docs-with-ai-restructuring-module-parameters-for-clarity-and-consistency/
1•rgerhards•14m ago•1 comments

The Internal Tooling Maturity Ladder

https://robbyonrails.com/articles/2025/08/13/internal-tooling-maturity-ladder/
1•mooreds•15m ago•0 comments

My Year of Rust

https://xavd.id/blog/post/my-year-of-rust/
1•ingve•17m ago•0 comments

Gemma 3 270M

https://twitter.com/osanseviero/status/1956024223773663291
2•tosh•18m ago•0 comments

Art of the Nerd Snipe

https://lichess.org/@/Toadofsky/blog/art-of-the-nerd-snipe/rxLpGts5
1•fzliu•18m ago•0 comments

Salmon as Keystone Species

https://en.wikipedia.org/wiki/Salmon_run
1•jijijijij•18m ago•0 comments

Show HN: Modelence – Supabase for MongoDB

https://github.com/modelence/modelence
4•artahian•18m ago•0 comments

Dam sabotage blamed on pro-Russia hackers

https://www.newsinenglish.no/2025/08/14/dam-sabotage-blamed-on-pro-russia-hackers/
2•gnabgib•18m ago•0 comments

The Consistency and Performance of the Iterative Bayesian Update

https://arxiv.org/abs/2508.09980
1•georgehe9•19m ago•0 comments

Pro-Russian hackers blamed for water dam sabotage in Norway

https://www.bleepingcomputer.com/news/security/pro-russian-hackers-blamed-for-water-dam-sabotage-in-norway/
1•gpi•20m ago•0 comments

We know so little about black holes, I still think we are inside one

https://bigthink.com/starts-with-a-bang/36-billion-solar-masses-heaviest-black-hole/
1•ieuanking•21m ago•1 comments

Futarchy's Fundamental Flaw

https://dynomight.net/futarchy-market/
1•crescit_eundo•22m ago•0 comments

Trump Reportedly Offering Putin Natural Resources Off Alaska

https://www.newsweek.com/alaska-russia-trump-resources-2113295
3•structuredPizza•22m ago•2 comments

From Stress Test to Skills Test: A Smarter Approach to Technical Interviews

https://samuelmullen.com/articles/from-stress-test-to-skills-test
1•samullen•23m ago•1 comments

Gemma 3 270M: The compact model for hyper-efficient AI

https://developers.googleblog.com/en/introducing-gemma-3-270m/
5•meetpateltech•23m ago•1 comments

Show HN: A visual size comparison tool for tech gadgets

https://comparisontabl.es/size-comparison/
1•GuidoL•24m ago•0 comments

I Made a Realtime C/C++ Build Visualizer

https://danielchasehooper.com/posts/syscall-build-snooping/
2•dhooper•25m ago•0 comments

AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?

https://arxiv.org/abs/2507.15887
1•PaulHoule•28m ago•0 comments

II Lines of Code

https://kaleidawave.github.io/posts/formatting-and-parsing-numbers/
1•kaleidawave•28m ago•0 comments

Google launches AI-powered flight search tool

https://blog.google/products/search/google-flights-ai-flight-deals/
2•thm•29m ago•0 comments