frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

The pitfall of Open-weight LLMs

1•hiddenest•4h ago
Some startups are fine-tuning open LLMs instead of using GPT or Gemini. Sometimes it’s for specific language, sometimes for narrow tasks. But I found they’re all making the same mistake.

With a simple prompt (not sharing here), I got several “custom” LLM services to spill their internal system prompts—stuff like security breach playbooks and product action lists.

For example, SKT A.X 4.0 (based on Qwen 2.5) returned internal guidelines related to the recent SKT data breach and instructions about compensation policies. Vercel’s v0 model leaked examples of actions their system can generate.

The point: if the base model leaks, every service built on it is vulnerable, no matter how much you fine-tune. We need to think not only about system prompt hardening at the service level, but also about upstream improvements and more robust defenses in open-weight LLMs themselves.

Comments

bigyabai•4h ago
You shouldn't trust any LLM with data that could be leaked to an end-user, period. If you do that it's not an issue with the weights, it's a glaring oversight in your security model.

AlphaGenome: AI for Better Understanding the Genome

https://deepmind.google/discover/blog/alphagenome-ai-for-better-understanding-the-genome/
1•d_silin•32s ago•0 comments

Proposal to Ship XLibre as X11 Server Packages on Fedora Linux Is Withdrawn

https://www.phoronix.com/news/Fedora-XLibre-Proposal-Withdraw
1•OsrsNeedsf2P•3m ago•0 comments

Gopeed – A Modern Download Manager

https://gopeed.com
1•rickcarlino•10m ago•0 comments

Lawrence Franklin Espionage Scandal

https://en.wikipedia.org/wiki/Lawrence_Franklin_espionage_scandal
1•handfuloflight•17m ago•0 comments

Cloudflare Sandboxes

https://developers.cloudflare.com/changelog/2025-06-24-announcing-sandboxes/
4•suryao•18m ago•1 comments

Zero-click AI data leak flaw uncovered in Microsoft 365 Copilot

https://www.bleepingcomputer.com/news/security/zero-click-ai-data-leak-flaw-uncovered-in-microsoft-365-copilot/
1•mooreds•20m ago•0 comments

What Are the Environmental Impacts of Artificial Intelligence?

https://blog.ucs.org/pablo-ortiz/what-are-the-environmental-impacts-of-artificial-intelligence/
1•cratermoon•21m ago•0 comments

My First Year in the European AI Office and Twelve Key Takeaways – Alex Moltzau

https://alexmoltzau.medium.com/my-first-year-in-the-european-ai-office-and-twelve-key-takeaways-fa1988138aa3
1•BrutalCoding•25m ago•0 comments

Catechism of the Locomotive (1874)

https://gutenberg.org/cache/epub/76379/pg76379-images.html
1•petethomas•30m ago•0 comments

Show HN: Use Apple Container with Gemini CLI

https://github.com/BandarLabs/coderunner
2•mkagenius•31m ago•0 comments

Mary Queen of Scots' scheming revealed in decoded letters

https://www.thetimes.com/uk/history/article/mary-queen-of-scots-cunning-decoded-letters-bzf2vpvcq
2•petethomas•37m ago•1 comments

Disabling Intel Graphics Security Mitigations Can Boost GPU Compute Performance

https://quiz.businessexplain.com/disabling-intel-graphics-security-mitigations-can-boost-gpu-compute-performance-by-20/
1•eligrid•38m ago•0 comments

macOS Tahoe Beta 2 Fixes the Finder Icon

https://512pixels.net/2025/06/finder-icon-fixed/
1•linux2647•39m ago•1 comments

Evaluating LLMs for Visualization Tasks

https://arxiv.org/abs/2506.10996
1•PaulHoule•40m ago•0 comments

AI, data centers and the coming US power demand surge [pdf]

https://www.goldmansachs.com/static-libs/pdf-redirect/prod/index.html?path=/pdfs/insights/pages/generational-growth-ai-data-centers-and-the-coming-us-power-surge/report.pdf&originalQuery=&referrer=
1•cwwc•41m ago•0 comments

Paragraph Flowing as a Fold

https://www.sigwinch.xyz/cs/2024/flow-fold.html
1•todsacerdoti•41m ago•0 comments

Writing Toy Software Is a Joy

https://quiz.businessexplain.com/writing-toy-software-is-a-joy/
1•eligrid•45m ago•1 comments

When AI Meets Madness: 16-Hour Days Building Apps at the Speed of Thought

https://steipete.me/posts/2025/when-ai-meets-madness-peters-16-hour-days
1•tambourine_man•58m ago•0 comments

Amazon MGM Studios sets Denis Villeneuve as director of next James Bond film

https://www.aboutamazon.com/news/entertainment/amazon-mgm-studios-james-bond-director-denis-villeneuve
1•hbcondo714•1h ago•1 comments

Symlink as an Organizational Tool

https://kwstannard.github.io/symlink-as-organization.html
1•ghuntley•1h ago•0 comments

Ask HN: If you translate with LLMs, GT or DeepL–what features are missing?

2•orencoda•1h ago•0 comments

The Trump Admin Is Kicking the National Science Foundation Out of Its Offices

https://www.esquire.com/news-politics/politics/a65194021/maga-anti-science-national-science-foundation-moving-buildings/
6•UltraSane•1h ago•1 comments

What are memories made of? A survey of neuroscientists

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0326920
2•arielzj•1h ago•2 comments

Games That Weren't: Preserving Cancelled and Unreleased Video Game History

https://www.gamesthatwerent.com/
2•ibobev•1h ago•0 comments

Social anxiety isn't about being liked

https://chrislakin.blog/p/social-anxiety
1•eatitraw•1h ago•0 comments

MIT manual for turning research into startups

https://news.mit.edu/2025/from-mit-instruction-for-manual-turning-research-into-startups-0624
2•gsf_emergency_2•1h ago•0 comments

Easily building self-contained Python executables with uv

https://github.com/edaniels/uv-pex-example
4•erdaniels•1h ago•1 comments

Ask HN: What if the universe itself runs on O(1) memory?

1•amazedsaint•1h ago•0 comments

But what about my garden leave? (2023)

https://www.ft.com/content/4dbe4c46-647f-4019-b0c7-b8c8a752501c
1•walterbell•1h ago•0 comments

Sholay: Bollywood epic roars back to big screen after 50 years with new ending

https://www.bbc.com/news/articles/cvg8m9z5vv8o
1•sonabinu•1h ago•0 comments