frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Local task classifier and dispatcher on RTX 3080

https://github.com/resilientworkflowsentinel/resilient-workflow-sentinel
8•Shubham_Amb•4h ago
Hi HN, I am shubham a 3d artist who learned coding in college as an I.T. graduate know logics but not an expert as i just wanna try my hands on to ai

So i built Resilient Workflow Sentinel this is offline ai agent which classify urgency (Low,Medium and HIgh) and dispatches to the candidates based on availability Well i want an offline system like a person can trust with its sensitive data to stay completely locally

Did use ai to code for speeding and cutting labor.

Its works on RTX 3080 system (this is an basic affordable setup not heavy ai machinery) which i want it to make it reliable without heavy upgrade This is full system doesn't require ollama(I am not against it)

I see in companies tickets are raised on jira and slack. Currently people or manager (self) have to sort those things either manually read one by one or send them to the cloud. But the issue is you can't send everything like there is a lot of sensitive data out there which they do not trust and makes it harder and manual sorting through thousands is likely a nightmare.

But then just imagine u get all the task classified like its urgency and distribution u can selectively see which task is urgent and needs immediate attention and last of all information doesn't leave your building totally secure Also Api sending is not the only issue u are paying per token cost for task for each may be monthly 100$ to 1000$ which can like save hassle for startup a lot or companies as well

There was several biases like positional bias also json out put bias also have issues in attention At start i tried just prompting things like Chain of thoughts,RISE(evaluate negative first), given negative examples,Positive examples, somewhere it was struggling with commonsense issue so examples for that (Later changed the approach)

Well prompting did give the output and worked well but took too much time to process for single task like 70 to 90secs for a task

Then i tried batching and the biases got worst like it got stronger it always use to like favour alice also more prompts are like ignored and more

For json output i used constrain so model can only generate json and if fails there is a as well parser i used when i implemented prompting only

This reduce time from 90sec to nearly 15 to 30secs per task I used steering vector to correct the attention i seen issues happening

Stack: Language: Python 3.10 Model: qwen2.5-7b-instruct Libraries: Pytorch, Hugging Face Transformers (No Langchain, No Ollama) API: Fast API UI: NiceGUI Hardware: Ryzen 5, 16Gb ram RTX 3080

Implementation:

Quantization: Load model in nf4 quantization so models like 7b can fit on vram of 10gb which is on rtx 3080 also my hardware

Steering Vectors: Standard prompting wasn't enough. I need to block or direct certain things on a certain layer of llm to make it reliable.

Json Constraints: Used constraint to make model strictly give json and also stop from over explanation this happens at logits level where token are blocked which are not required etc

github : https://github.com/resilientworkflowsentinel/resilient-workf...

Youtube: https://youtu.be/tky3eURLzWo

Claude Opus 4.6

https://www.anthropic.com/news/claude-opus-4-6
1673•HellsMaddy•10h ago•709 comments

GPT-5.3-Codex

https://openai.com/index/introducing-gpt-5-3-codex/
1110•meetpateltech•9h ago•425 comments

My AI Adoption Journey

https://mitchellh.com/writing/my-ai-adoption-journey
394•anurag•8h ago•108 comments

We tasked Opus 4.6 using agent teams to build a C Compiler

https://www.anthropic.com/engineering/building-c-compiler
412•modeless•8h ago•394 comments

Recreating Epstein PDFs from raw encoded attachments

https://neosmart.net/blog/recreating-epstein-pdfs-from-raw-encoded-attachments/
245•ComputerGuru•1d ago•65 comments

Pong Cam – My ESP32S3 Thinks It's a WebCam

https://www.atomic14.com/2026/02/01/pong-cam
22•iamflimflam1•4d ago•1 comments

Animated Knots

https://www.animatedknots.com/
88•ostacke•3d ago•13 comments

Review of 1984 by Isaac Asimov (1980)

https://www.newworker.org/ncptrory/1984.htm
107•doruk101•6h ago•46 comments

GitHub Actions Is Slowly Killing Your Engineering Team

https://www.iankduncan.com/engineering/2026-02-05-github-actions-killing-your-team/
4•codesuki•57m ago•2 comments

MenuetOS – a GUI OS that boots from a single floppy disk

https://www.menuetos.net/
114•pjerem•2d ago•19 comments

I reversed Tower of Fantasy's anti-cheat driver: a BYOVD toolkit never loaded

https://vespalec.com/blog/tower-of-flaws/
6•svespalec•33m ago•2 comments

LinkedIn checks for 2953 browser extensions

https://github.com/mdp/linkedin-extension-fingerprinting
328•mdp•7h ago•161 comments

Claude Opus 4.6 extra usage promo

https://support.claude.com/en/articles/13613973-claude-opus-4-6-extra-usage-promo
126•rob•7h ago•38 comments

Launching My Side Project as a Solo Dev: The Walkthrough

https://alt-romes.github.io/posts/2026-01-30-from-side-project-to-kickstarter-a-walkthrough.html
39•romes•4d ago•2 comments

India's female workers watching hours of abusive content to train AI

https://www.theguardian.com/global-development/2026/feb/05/in-the-end-you-feel-blank-indias-femal...
35•thisislife2•5h ago•7 comments

Hypernetworks: Neural Networks for Hierarchical Data

https://blog.sturdystatistics.com/posts/hnet_part_I/
40•mkmccjr•11h ago•2 comments

You Still Struggle with CORS Even After Reading Docs

https://evan-moon.github.io/2020/05/21/about-cors/en/
13•bboydart•4d ago•5 comments

What if writing tests was a joyful experience? (2023)

https://blog.janestreet.com/the-joy-of-expect-tests/
41•ryanhn•6h ago•14 comments

Orchestrate teams of Claude Code sessions

https://code.claude.com/docs/en/agent-teams
318•davidbarker•10h ago•177 comments

Flock CEO calls Deflock a “terrorist organization” (2025) [video]

https://www.youtube.com/watch?v=l-kZGrDz7PU
535•cdrnsf•8h ago•370 comments

Show HN: Calfkit – an SDK to build distributed, event-driven AI agents on Kafka

https://github.com/calf-ai/calfkit-sdk
6•ryanyu•4h ago•0 comments

What's wrong with bunny hands on dinosaurs? (2018)

https://paleoaerie.org/2018/06/13/whats-wrong-with-bunny-hands-on-dinosaurs/
34•exvi•5d ago•13 comments

There Will Come Soft Rains (1950) [pdf]

https://www.btboces.org/Downloads/7_There%20Will%20Come%20Soft%20Rains%20by%20Ray%20Bradbury.pdf
150•wallflower•4d ago•37 comments

Show HN: Local task classifier and dispatcher on RTX 3080

https://github.com/resilientworkflowsentinel/resilient-workflow-sentinel
8•Shubham_Amb•4h ago•0 comments

Housman's Introductory Lecture (1892)

https://worrydream.com/refs/Housman_1892_-_Introductory_Lecture.html
10•coloneltcb•4d ago•0 comments

150 MB Minimal FreeBSD Installation

https://vermaden.wordpress.com/2026/02/01/150-mb-minimal-freebsd-installation/
136•vermaden•5d ago•24 comments

The RCE that AMD won't fix

https://mrbruh.com/amd/
84•MrBruh•4h ago•43 comments

Maihem (YC W24): hiring senior robotics perception engineer (London, on-site)

https://jobs.ashbyhq.com/maihem/8da3fa8b-5544-45de-a99e-888021519758
1•mxrns•10h ago

What's at the Other End of 8.8.8.8?

https://blog.nono.io/post/8.8.8.8/
6•marinesebastian•34m ago•0 comments

OpenClaw: When AI Agents Get Full System Access. Security nightmare?

https://innfactory.ai:443/en/blog/openclaw-ai-agent-security/
53•i-blis•4d ago•28 comments