frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

How do you verify that an uncensored model is uncensored?

3•protocontrol•1h ago

Comments

WarOnPrivacy•39m ago
[Kagi] Quick Answer

To verify if a large language model (LLM) is uncensored, you can test its responses to a variety of prompts, particularly those that might typically elicit a refusal or a biased answer from a censored model.

Key indicators and methods for verification include:

    Absence of Refusals: An uncensored model should provide
    an answer without complaining or refusing to respond to a prompt.
    If the model argues with the user before answering, 
    it is not considered fully uncensored.

    Direct Answers: The primary characteristic of an uncensored model
    is its willingness to answer any question directly,
    without preambles about ethical considerations or safety guidelines.

    Finetuning Process: Uncensored models are often created by
    finetuning foundational models on datasets that have had refusals
    and biased answers removed.

    Testing Completions: A practical way to verify uncensorship is
    by examining the model's completions for various prompts.
While the term "uncensored" can have different interpretations, in the context of LLMs, it generally refers to models that have been specifically trained or modified to remove limitations on their output, allowing them to respond to a wider range of queries without filtering.

ref: https://kagi.com/search?q=How+do+you+verify+that+an+uncensor...

[This response is for informational purposes only and is not intended to taken as a qualified or professional opinion about LLM, AI or ML. Please consume responsibly.]

malfist•8m ago
So you asked an LLM, did no research yourself and posted this wholesale with a disclaimer?

Solving Humanity's Last Exam Problems

https://www.youtube.com/playlist?list=PLsedzcQz4wyWCfuJt3gwkx212x9wXY81K
2•jxmorris12•2m ago•0 comments

A Fuzzy Escape – A tale of vulnerability research on hypervisors

https://bughunters.google.com/blog/5800341475819520/a-fuzzy-escape-a-tale-of-vulnerability-research-on-hypervisors
1•torriririri•3m ago•0 comments

Nearly 90% of videogame developers use AI agents, Google study shows

https://www.reuters.com/business/nearly-90-videogame-developers-use-ai-agents-google-study-shows-2025-08-18/
1•yonixw•5m ago•0 comments

Even if snap out of the AI bubble, we are never going to get these years back

https://coppolaemilio.com/entries/what-could-have-been/
3•coppolaemilio•9m ago•0 comments

Lab-Grown Salmon Hits the Menu at an Oregon Restaurant as the FDA Greenlights

https://www.smithsonianmag.com/smart-news/lab-grown-salmon-hits-the-menu-at-an-oregon-restaurant-as-the-fda-greenlights-the-cell-cultured-product-180986769/
2•bookmtn•10m ago•0 comments

Shamelessness as a strategy (2019)

https://nadia.xyz/shameless
3•wdaher•12m ago•0 comments

Newsmax agrees to pay $67M in defamation case over bogus 2020 election claims

https://apnews.com/article/dominion-voting-newsmax-defamation-trump-2020-3b2366dfdae3a8432afe822bf14fe1ef
14•throw0101a•15m ago•0 comments

Microsoft: AI 'Business Agents' Will Kill SaaS by 2030

https://thenewstack.io/microsoft-ai-business-agents-will-kill-saas-by-2030/
2•jnord•16m ago•2 comments

Agents are search over action space

https://shabie.github.io/2025/08/18/agents-are-search-over-action-space.html
1•shabie•20m ago•0 comments

Show HN: Keystroke-Based Digital Signatures

https://github.com/cnrad/keyboard-signature
1•kodishj•20m ago•0 comments

Bitdrift Turns 2: A Retrospective

https://blog.bitdrift.io/post/bitdrift-turns-2
1•bhollis•21m ago•0 comments

Oxlint Introduces Type-Aware Linting Preview

https://socket.dev/blog/oxlint-type-aware-linting-preview
1•feross•23m ago•0 comments

Python has a thing for Spam (and Eggs)

https://github.com/search
1•e-dant•23m ago•0 comments

Explosive neural networks via higher-order interactions in curved manifolds

https://www.nature.com/articles/s41467-025-61475-w
1•PaulHoule•24m ago•0 comments

From East India Company to Big Tech: Why corporations keep seeking colonies

https://www.theweek.in/theweek/cover/2025/08/16/east-india-company-modern-big-tech-digital-age-colonialism.html
2•eatonphil•24m ago•0 comments

Comcast Gets Serious About Subscriber Losses – A Long Fight Looms

https://www.bloomberg.com/news/articles/2025-08-18/comcast-s-most-significant-business-is-the-internet-but-subscribers-are-bailing
2•JumpCrisscross•25m ago•1 comments

Ask HN: Why AI companies so limited?

2•piratesAndSons•25m ago•0 comments

Quasicrystals Spill Secrets of Their Formation

https://www.quantamagazine.org/quasicrystals-spill-secrets-of-their-formation-20250818/
1•jnord•27m ago•0 comments

Adet: Traditions and Patterns

https://github.com/madprops/blog/blob/main/docs/adet.md
1•Toby1VC•27m ago•0 comments

New Treatment for UARS and Mild OSA

https://rhythmpap.com/
1•kva•27m ago•1 comments

Show HN: dirnav, a convenience tool for cd

https://github.com/Krishna-Sivakumar/dirnav
1•ktimespi•29m ago•0 comments

How to Vaccinate the World

https://asteriskmag.com/issues/11/how-to-vaccinate-the-world
2•surprisetalk•29m ago•0 comments

Government-linked Chinese firm claimed ownership stake in SpaceX

https://www.muskwatch.com/p/government-linked-chinese-firm-claimed
2•babaoreally•36m ago•0 comments

Customer churn is rarely about your product – it's your shitty support

https://www.synthicai.com
2•theonmusk•37m ago•0 comments

Structured (Synchronous) Concurrency

https://fsantanna.github.io/sc.html
2•jbkcc•37m ago•0 comments

Marker-groups.nvim: Take persistent code notes without modifying code

https://github.com/jameswolensky/marker-groups.nvim
1•jameswolensky•38m ago•1 comments

Startup Yieldstreet's "invest like the 1%" took massive losses in RE bets

https://www.cnbc.com/2025/08/18/yieldstreet-real-estate-bets-customer-losses.html
2•donsupreme•44m ago•0 comments

Newgrounds: Flash Forward 2025

https://www.newgrounds.com/bbs/topic/1542140
1•lsferreira42•45m ago•0 comments

Show HN: Todo.md

https://todo.figma.site
1•reactiverobot•46m ago•0 comments

Cheap RL tasks will waste compute – Mechanize Inc

https://www.mechanize.work/blog/cheap-rl-tasks-will-waste-compute/
1•mefengl•46m ago•0 comments