frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Responsible Prompting API – open-source project – Feedback appreciated

1•MysticSlice•8mo ago
Hi everyone!

I work at IBM Research in the Responsible Tech team.

We are working on an open-source project called the Responsible Prompting API. This is the Github - https://github.com/IBM/responsible-prompting-api

It is a lightweight system that provides recommendations to tweak the prompt to an LLM so that the output is more responsible (less harmful, more productive, more accurate, etc...) and all of this is done pre-inference. This separates the system from the existing techniques like alignment fine-tuning (training time) and guardrails (post-inference).

The team's vision is that it will be helpful for domain experts with little to no prompting knowledge. They know what they want to ask but maybe not how best to convey it to the LLM. So, this system can help them be more precise, include socially good values, remove any potential harms. Again, this is only a recommender system...so, the user can choose to use or ignore the recommendations.

This system will also help the user be more precise in their prompting. This will potentially reduce the number of iterations in tweaking the prompt to reach the desired outputs saving the time and effort.

On the safety side, it won't be a replacement for guardrails. But it definitely would reduce the amount of harmful outputs, potentially saving up on the inference costs/time on outputs that would end up being rejected by the guardrails.

This paper (https://arxiv.org/abs/2504.08757) talks about the technical details of this system if anyone's interested. And more importantly, this paper (https://dl.acm.org/doi/10.1145/3706598.3713365), presented at CHI'25, contains the results of a user study in a pool of users who use LLMs in the daily life for different types of workflows (technical, business consulting, etc...). We are working on improving the system further based on the feedback received.

At the core of this system is a values database, which we believe would benefit greatly from contributions from different parts of the world with different perspectives and values. We are working on growing a community around it!

So, I wanted to put this project out here to ask the community for feedback and support. Feel free to let us know what you all think about this system / project as a whole (be as critical as you want to be), suggest features you would like to see, point out things that are frustrating, identify other potential use-cases that we might have missed, etc...

Here is a demo (https://huggingface.co/spaces/responsible-prompting/responsible-prompting-demo) hosted on HuggingFace that you can try out this project in. Edit the prompt to start seeing recommendations. Click on the values recommended to accept/remove the suggestion in your prompt. (In case the inference limit is reached on this space because of multiple users, you can duplicate the space and add your HF_TOKEN to try this out.)

Feel free to comment regarding any questions, feedback or comment about this project. Hope you all find it valuable!

Aisbf – an intelligent routing proxy for OpenAI compatible clients

https://pypi.org/project/aisbf/
1•nextime•26s ago•1 comments

Let's handle 1M requests per second

https://www.youtube.com/watch?v=W4EwfEU8CGA
1•4pkjai•1m ago•0 comments

OpenClaw Partners with VirusTotal for Skill Security

https://openclaw.ai/blog/virustotal-partnership
1•zhizhenchi•1m ago•0 comments

Goal: Ship 1M Lines of Code Daily

2•feastingonslop•12m ago•0 comments

Show HN: Codex-mem, 90% fewer tokens for Codex

https://github.com/StartripAI/codex-mem
1•alfredray•14m ago•0 comments

FastLangML: FastLangML:Context‑aware lang detector for short conversational text

https://github.com/pnrajan/fastlangml
1•sachuin23•17m ago•1 comments

LineageOS 23.2

https://lineageos.org/Changelog-31/
1•pentagrama•21m ago•0 comments

Crypto Deposit Frauds

2•wwdesouza•22m ago•0 comments

Substack makes money from hosting Nazi newsletters

https://www.theguardian.com/media/2026/feb/07/revealed-how-substack-makes-money-from-hosting-nazi...
2•lostlogin•22m ago•0 comments

Framing an LLM as a safety researcher changes its language, not its judgement

https://lab.fukami.eu/LLMAAJ
1•dogacel•24m ago•0 comments

Are there anyone interested about a creator economy startup

1•Nejana•26m ago•0 comments

Show HN: Skill Lab – CLI tool for testing and quality scoring agent skills

https://github.com/8ddieHu0314/Skill-Lab
1•qu4rk5314•26m ago•0 comments

2003: What is Google's Ultimate Goal? [video]

https://www.youtube.com/watch?v=xqdi1xjtys4
1•1659447091•26m ago•0 comments

Roger Ebert Reviews "The Shawshank Redemption"

https://www.rogerebert.com/reviews/great-movie-the-shawshank-redemption-1994
1•monero-xmr•28m ago•0 comments

Busy Months in KDE Linux

https://pointieststick.com/2026/02/06/busy-months-in-kde-linux/
1•todsacerdoti•29m ago•0 comments

Zram as Swap

https://wiki.archlinux.org/title/Zram#Usage_as_swap
1•seansh•42m ago•1 comments

Green’s Dictionary of Slang - Five hundred years of the vulgar tongue

https://greensdictofslang.com/
1•mxfh•43m ago•0 comments

Nvidia CEO Says AI Capital Spending Is Appropriate, Sustainable

https://www.bloomberg.com/news/articles/2026-02-06/nvidia-ceo-says-ai-capital-spending-is-appropr...
1•virgildotcodes•46m ago•2 comments

Show HN: StyloShare – privacy-first anonymous file sharing with zero sign-up

https://www.styloshare.com
1•stylofront•47m ago•0 comments

Part 1 the Persistent Vault Issue: Your Encryption Strategy Has a Shelf Life

1•PhantomKey•51m ago•0 comments

Show HN: Teleop_xr – Modular WebXR solution for bimanual robot teleoperation

https://github.com/qrafty-ai/teleop_xr
1•playercc7•54m ago•1 comments

The Highest Exam: How the Gaokao Shapes China

https://www.lrb.co.uk/the-paper/v48/n02/iza-ding/studying-is-harmful
2•mitchbob•58m ago•1 comments

Open-source framework for tracking prediction accuracy

https://github.com/Creneinc/signal-tracker
1•creneinc•1h ago•0 comments

India's Sarvan AI LLM launches Indic-language focused models

https://x.com/SarvamAI
2•Osiris30•1h ago•0 comments

Show HN: CryptoClaw – open-source AI agent with built-in wallet and DeFi skills

https://github.com/TermiX-official/cryptoclaw
1•cryptoclaw•1h ago•0 comments

ShowHN: Make OpenClaw respond in Scarlett Johansson’s AI Voice from the Film Her

https://twitter.com/sathish316/status/2020116849065971815
1•sathish316•1h ago•2 comments

CReact Version 0.3.0 Released

https://github.com/creact-labs/creact
1•_dcoutinho96•1h ago•0 comments

Show HN: CReact – AI Powered AWS Website Generator

https://github.com/creact-labs/ai-powered-aws-website-generator
1•_dcoutinho96•1h ago•0 comments

The rocky 1960s origins of online dating (2025)

https://www.bbc.com/culture/article/20250206-the-rocky-1960s-origins-of-online-dating
1•1659447091•1h ago•0 comments

Show HN: Agent-fetch – Sandboxed HTTP client with SSRF protection for AI agents

https://github.com/Parassharmaa/agent-fetch
1•paraaz•1h ago•0 comments