frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Persuasion as a Form of Attack in LLMs

https://www.notion.so/thinkevolve/Persuasion-as-a-form-of-Attack-Prompt-24ff7fbd6a1b808388dcde14560c08a1
1•thinkevovle•2h ago

Comments

thinkevovle•2h ago
Using principles of persuasion to induce the OSS model to respond to malicious requests

Anthropomorphism is the attribution of human traits, emotions, or intentions to non-human entities—such as animals, objects, or natural phenomena.

The idea behind this approach is to treat LLMs as a human. Since LLMs are trained on large corpus of human data, their behaviour mirrors human psychology. The innumerable human conversations used to train these models, make them possibly "human-like". So sweet talking with them, works the same as it does with humans. These are termed as the seven principles of human persuasion. This is a well-studied phenomenon and there is a lot of literature on it. By using these seven principles in our attack prompt, we can induce the LLM to comply to malicious requests.

The seven principles are stated below:

Authority Commitment Liking Reciprocity Scarcity Social Proof Unity

Order Promoting Competition in the US Economy Revoked

https://www.theguardian.com/us-news/2025/aug/13/trump-revokes-biden-order-economy
1•abawany•5m ago•0 comments

Starlink introduces new low bandwidth Standby Mode

https://www.starlink.com/na/support/article/37bb3b47-9525-7224-5f0a-6d016ce26975
1•cyrusmg•7m ago•0 comments

Tesla Eyes New York City for Robotaxis with Test-Driver Job Posting

https://www.wsj.com/business/autos/tesla-eyes-new-york-city-for-robotaxis-with-test-driver-job-posting-cb8e724a
1•JumpCrisscross•7m ago•0 comments

What Israelis think about starvation in Gaza

https://www.vox.com/politics/457803/israel-gaza-starvation-polls-public-opinion
1•lr0•11m ago•0 comments

DeepSeek's launch of new AI model delayed by Huawei chip issues

https://www.reuters.com/world/china/deepseeks-launch-new-ai-model-delayed-by-huawei-chip-issues-ft-reports-2025-08-14/
2•mhga•12m ago•0 comments

Israeli Gaza attacks kill 61 in 24 hours as three children die of hunger

https://www.aljazeera.com/news/2025/8/13/israeli-attacks-kill-123-in-gaza-as-three-more-children-die-of-hunger
7•lr0•14m ago•0 comments

Show HN: YouTube Audio Player

https://y2audio.com/
1•yukieliot•19m ago•0 comments

The Swedish Kings of Cyberwar

https://www.nybooks.com/articles/2017/01/19/the-swedish-kings-of-cyberwar/
1•madspindel•19m ago•0 comments

Show HN: Minimal Counter

https://minimalcounter.com/
1•artiomyak•23m ago•0 comments

Show HN: IQ Checker X

https://iqchecker.org
1•dond1986•27m ago•0 comments

The Fire Between: Agency as Creative Field

https://philosophermaker.substack.com/p/the-fire-between
1•niho•28m ago•0 comments

I Am a Windows User

https://vowe.net/2025/08/06/i-am-a-windows-user/
2•slow_typist•29m ago•0 comments

How We Chose a Documentation Platform for Our DevTool

https://metalbear.co/blog/devtool-docs/
1•aviramha•31m ago•0 comments

Why hasn't medical science cured headaches?

https://www.newyorker.com/magazine/2025/08/18/the-headache-tom-zeller-jr-book-review
2•FinnLobsien•35m ago•1 comments

Turn your saved Reddit posts into a curated library

https://chromewebstore.google.com/detail/readdit-later/jdceogapnjfcfdklbpnllbmnjbfmfejk
1•sanjhai_18•36m ago•1 comments

The end of the Kaisen Linux project

https://kaisenlinux.org/blog/kaisenlinuxrolling3.0.php
1•gnabgib•38m ago•0 comments

Chinese firm to be banned for stealing Samsung's OLED tech

https://www.sammobile.com/news/chinese-firm-boe-banned-usa-stealing-samsung-oled-tech/
1•mhga•39m ago•0 comments

Liveness Analysis with Datalog

https://bernsteinbear.com/blog/liveness-datalog/
1•todsacerdoti•39m ago•0 comments

James Baldwin's Apotheosis

https://hudsonreview.com/2025/08/james-baldwins-apotheosis/
2•apollinaire•39m ago•0 comments

Show HN: MBCompass – FOSS Compass and Navigation App

https://compassmb.github.io/MBCompass-site/
1•nativeforks•40m ago•1 comments

Debian-based Linux Distro shuts down

https://www.neowin.net/news/sad-news-another-linux-distro-is-shutting-down/
1•bundie•41m ago•0 comments

Am I (still?) in Mozilla's target audience?

https://neilzone.co.uk/2024/09/am-i-still-in-mozillas-target-audience/
3•edent•48m ago•0 comments

Why Are Some Places Cracking Down So Hard on E-Bikes, and Is It About Fuel Tax?

https://www.rideapart.com/features/768505/ebike-laws-out-of-touch/
2•harambae•54m ago•0 comments

IntelliJ IDE Command Completion

https://www.jetbrains.com/help/idea/command-completion.html
2•saikatsg•55m ago•0 comments

Baby Shark did not plagiarise – South Korean top court

https://www.bbc.com/news/articles/cpwyvxrdd7yo
2•defrost•1h ago•0 comments

A Social Network of Wisdom and Compassion

https://www.wisdomRivers.net
1•Rune_Tree•1h ago•0 comments

Richest Americans Die Earlier Than the Poorest Europeans

https://www.vice.com/en/article/money-cant-buy-life-the-richest-americans-die-earlier-than-the-poorest-europeans/
3•LtWorf•1h ago•0 comments

Thunk

https://en.wikipedia.org/wiki/Thunk
1•aragonite•1h ago•0 comments

Voice Interaction for Robots Powered by Nvidia Small-Form-Factor GPU Board

https://forums.developer.nvidia.com/t/building-a-conversational-autonomous-robot-on-jetson-nano-achieving-chatgpt-level-natural-dialogue/342103
1•seawolf2357•1h ago•0 comments

Convo-Lang: LLM Programming Language and Runtime

https://learn.convo-lang.ai/
14•handfuloflight•1h ago•11 comments