frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Simular Agent S hits 72.6% success on 369 real computer tasks (human: 72.36%)

https://os-world.github.io/
1•taro666•48s ago•0 comments

Eng stack for a typical 2-person startup today

1•LunarFrost88•1m ago•0 comments

Show HN: Let ChatGPT create interactive forms and surveys for you

https://youropinion.is/import
1•aarnelaur•2m ago•0 comments

GPT Image 1.5

https://platform.openai.com/docs/models/gpt-image-1.5
1•charlierguo•2m ago•0 comments

Ask HN: Does your LLM respond and work in your personality?

1•taariqserendb•2m ago•0 comments

Ruby and Its Neighbors: Perl

https://noelrappin.com/blog/2025/10/ruby-and-its-neighbors-perl/
1•bolangi•3m ago•1 comments

Androids: The Team That Built the Android Operating System

https://www.audiobooks.com/promotions/promotedBook/601285/androids-the-team-that-built-the-androi...
1•Olshansky•4m ago•1 comments

OpenAI Rolls Back ChatGPT's Model Router System for Most Users

https://www.wired.com/story/openai-router-relaunch-gpt-5-sam-altman/
1•coloneltcb•4m ago•0 comments

A Little Bit Uncomfortable

https://www.allthingsdistributed.com/2025/12/a-little-bit-uncomfortable.html
1•cebert•5m ago•0 comments

Shinzo: Complete observability platform for AI Agents and MCP servers

https://github.com/shinzo-labs/shinzo
1•Olshansky•5m ago•0 comments

Volkswagen to close German plant, a first in their company history

https://www.nytimes.com/2025/12/16/business/volkswagen-dresden-factory-closed.html
1•OgsyedIE•6m ago•0 comments

Arduino UNO Q bridges high-performance computing with real-time control

https://www.arduino.cc/product-uno-q/
1•doener•7m ago•0 comments

Tools for detecting AI generated content

https://nikitanamjoshi.substack.com/p/tools-for-detecting-ai-generated
1•saikatsg•8m ago•0 comments

The Geek Within Ep136: Dan Moore (2023) [video]

https://www.youtube.com/watch?v=1C1q-o6DtPU
1•mooreds•8m ago•0 comments

Determinate Nix 3.0

https://determinate.systems/blog/determinate-nix-30/
2•todsacerdoti•8m ago•0 comments

Show HN: QKV Core – Run 7B LLMs on 4GB VRAM via surgical memory alignment

https://github.com/QKV-Core/QKV-Core
1•broxytr•9m ago•0 comments

Amazon shareholders call for report on AWS use in Gaza and by US ICE

https://www.datacenterdynamics.com/en/news/amazon-shareholders-call-for-report-on-aws-use-in-gaza...
4•giuliomagnifico•10m ago•0 comments

The new ChatGPT Images is here

https://openai.com/index/new-chatgpt-images-is-here/
2•meetpateltech•10m ago•0 comments

Claude was used for a cyber campaign – Im making it open-source

https://www.trysonder.ai
1•quantbagel•11m ago•1 comments

All AI videos are harmful

https://idiallo.com/blog/all-ai-videos-are-harmful
4•firefoxd•12m ago•0 comments

Brief note on hand-drawing and Lie groups

https://nigelvr.github.io/post-2.html
1•nigelvr•12m ago•0 comments

Ask HN: 5th Month of Unemployment and Still No Job

2•TylerJaacks•13m ago•1 comments

Europe Is in Decline. Good.

https://www.nytimes.com/2025/12/16/opinion/europe-decline-economy-china.html
1•mitchbob•15m ago•1 comments

Visualizing your path to FIRE and other financial goals (Free and Open Source)

https://futurestepsfinance.netlify.app
1•jsoclarke•15m ago•1 comments

Exchange Online ActiveSync Device Support Update

https://techcommunity.microsoft.com/blog/exchange/exchange-online-activesync-device-support-updat...
1•croes•15m ago•0 comments

Workbox: JavaScript Libraries for Progressive Web Apps

https://github.com/GoogleChrome/workbox
1•nateb2022•16m ago•0 comments

Show HN: Digital Ski Bootfitter

https://wayfinderboots.com/
2•brucebotsford•17m ago•0 comments

Collaborative Web Drawing / Image Tool

https://www.multipaint.net/
1•MugCostanza•17m ago•0 comments

Automating a Browser with Anthropic's Computer Use to Play Tic-Tac-Toe [video]

https://www.youtube.com/watch?v=dFONF4PqCLw
1•cbromann•17m ago•1 comments

Dell preps price hikes up to 30% citing memory pricing 'out of our control'

https://www.tomshardware.com/laptops/dell-preps-massive-price-hikes-up-to-30-percent-citing-memor...
2•speckx•18m ago•0 comments
Open in hackernews

ArkhamMirror: Airgapped investigation platform with CIA-style hypothesis testing

https://github.com/mantisfury/ArkhamMirror
94•ArkhamMirror•8h ago

Comments

ArkhamMirror•8h ago
I got tired of expensive SaaS tools that want my sensitive documents in their cloud. I built ArkhamMirror to do forensic document analysis 100% locally, free and open source.

What makes this different:

Air-gapped: Zero cloud dependencies. Uses local LLMs via LM Studio (Qwen, etc.)

ACH Methodology: Implements the CIA's "Analysis of Competing Hypotheses" technique which forces you to look for evidence that disproves your theories instead of confirming them

Corpus Integration: Import evidence directly from your documents with source links

Sensitivity Analysis: Shows which evidence is critical, so if it's wrong, would your conclusion change?

The ACH feature just dropped with an 8-step guided workflow, AI assistance at every stage, and PDF/Markdown/JSON export with AI disclosure flags. It's better than what any given 3-lettered agency uses.

Tech stack: Python/Reflex (React frontend), PostgreSQL, Qdrant (vectors), Redis (job queue), PaddleOCR, Spacy NER, BGE-M3 embeddings.

All MIT licensed. Happy to answer questions about the methodology or implementation! Intelligence for anyone.

Links: Repo https://github.com/mantisfury/ArkhamMirror

ACH guide with screenshots at https://github.com/mantisfury/ArkhamMirror/blob/reflex-dev/d...

V__•5h ago
What field are you in, sounds interesting that one would need such a tool?
cess11•5h ago
Description on the repo says it's for journalism, but I build similar rigs that I use for research in companies that have entered bankruptcy proceedings.

Commonly there is a lot of information and it might as well be unstructured, and then I need to get answers quickly because my clients aren't going to pay me for going about it slowly.

ArkhamMirror•5h ago
It's mainly useful for journalism purposes, yes. Audit and compliance uses were also a consideration. It's a unified tool for right now, but I'm working on turning the base of it into the frame and adding individual shards for specialized applications.
ArkhamMirror•5h ago
It's not just for people doing interesting things. It just helps people answer questions about stuff. The stuff can be interesting or boring or dangerous or silly. The last question I tested the ACH tool on was "Did William Shakespeare really author all of the works he was credited for?" - You can use this stuff to research whatever you want. That's the point of it - it's no one's business what you are interested in getting to the bottom of.
daft_pink•3h ago
Ironically, is there a way to try this out in the cloud for people who want this tool who aren’t hyper worried about security?

It looks cool.

ArkhamMirror•1h ago
Thanks, glad to hear it!

Short answer - no, not right now.

However, instead of going through locally hosted docker and local LLMs, you could reroute it wherever you like, but I don't have a cloud option set up at this time.

I'm focused on the developing the local, private applications myself, but nothing is stopping someone from hooking it up to stronger cloud-based stuff if they want.

The good news is that my plans for this include making it more modular, so people have better options for what it does and how powerful it is.

ckbkr10•6h ago
The idea is good. I do think that is going to be the future for high volume data leaks like the Snowden or Epstein files.

I do think though that this approach will become annoying quick:

https://github.com/mantisfury/ArkhamMirror/blob/main/scripts...

ArkhamMirror•5h ago
The cheesy noir persona is for the AI assisted install and that's it. Inside the app, the prompts are strictly business. (They still have roles, but not "characters" or "personas").
Theofrastus•6h ago
It's always interesting to stumble upon a bubble you never heard of.

This is super interesting. I will probably (hopefully?) never need to use it, but interesting nonetheless. It also makes sense to have this type of application airgapped. Journalists need to have near-perfect OPSEC depending on what they are working on.

ArkhamMirror•5h ago
Thanks for the interest! I agree, the less people that need it the better, but I want it to exist just in case.
ArkhamMirror•5h ago
In case it wasn't clear, the ACH update is on the reflex-dev branch -

https://github.com/mantisfury/ArkhamMirror/tree/reflex-dev

Garlef•4h ago
I'm wondering if the ACH Methodology could be used as a general purpose Chain-of-Thought variant.
ArkhamMirror•4h ago
Probably - LLMs definitely benefit from having decision-making frameworks. ACH is a super-widely useful tool, so I don't see why you couldn't tune an AI with it too.
smallerfish•4h ago
A video demo would be useful. I can't really tell how much the application is doing from the screenshots. Is it a tool with some smart guidance, or is it doing deep magic?
ArkhamMirror•4h ago
I didn't think a video would be very exciting. It did feel like deep magic when I tested it though. For the scenario in the screenshots, I provided the question, "Did we really land a man on the moon?" and the null hypothesis "We landed on the moon in 1969", and the low value piece of evidence "My dad told me he saw Stanley Kubrick's moon landing set one time and he never lies." Literally everything else the LLM generated on demand for me based on its existing training data, offline. It gave me hypotheses, challenges, evidence, filled out the matrix, did the calculations, everything.
darkwater•4h ago
And the answer was... ? :)
ArkhamMirror•4h ago
Well, based on the evidence provided against our competing hypotheses, The least problematic hypothesis is that we landed on the moon in 1969. Second least problematic hypothesis was "The Apollo 11 mission was a hoax staged by NASA and the U.S. government for public relations and Cold War propaganda, but the moon landing itself was real — only the public narrative was fabricated." Third least problematic was "The Apollo 11 mission was a real event, but the moon landing was not achieved by humans — it was an automated robotic mission that was misinterpreted or falsely attributed to astronauts due to technical errors or media misreporting." - The winning hypothesis had a score of 0 (lower is better), second place had a score of 6 (out of possible 10 for our evidence set), and third place had a score of 8. There was also a tie for 4th place "It was just a government coverup to protect the firmament. There is no "outer space."" and "The Apollo 11 mission never occurred; all evidence — including photos, video, and lunar rocks — was fabricated in secret laboratories using early 20th-century special effects and staged experiments, possibly by a small group of scientists and engineers working under government contract." - both of these scored 10 out of 10, making them the most problematic. Sorry guys.
stocksinsmocks•1h ago
Yeah, I’m pretty sure you need to sharpen your pencil on this one if the conclusion was that the Apollo program was legitimate.
ArkhamMirror•1h ago
I'm sure if the right evidence were submitted and run against the right hypotheses a different frontrunner could emerge. Remember - this is a tool to help you investigate better and figure out what to look for, not a tool that tells you the answer. It helps you eliminate unlikely answers more than it ever points at the "right" answer, and even the most unlikely answers can still be the "right" ones! Hang in there
afro88•2h ago
> Literally everything else the LLM generated on demand for me based on its existing training data, offline

That's a ton of scope for hallucinations, surely?

ArkhamMirror•1h ago
It would be enough to drive most local LLMs crazy if it tried to generate it all at once or if it was all part of one long session, but it's set up so the LLM doesn't have to produce much at a time. I only batch in small groups (like it will generate only 3 suggestions per request) and the session is refreshed between calls, and the output is generally force structured to fit correctly into the expected format. You can, however, ask for new batches of suggestions or conflicts or evidence more than once. Hallucinations can happen for any LLM use of course, but if they break the expected structure the output is generally thrown out. Even the matrix scoring suggestion - it works on the whole row, but behind the scenes the LLM is asked to return one response in one "chat" session per column, and then they are all entered at the same time once all of them have been individually returned. That way, if the LLM does hallucinate for the score, it outputs a neutral response for that cell and doesn't corrupt any of the neighboring cells.

If you use a smaller model with smaller context, it might be more prone to hallucinations and provide less nuanced suggestions, but the default model seems to be able to handle the jobs pretty well without having to regenerate output very often (it does happen sometimes, but it just means you have to run it again.) Also, depending on the model, you might get less variety or creativity in suggestions. It's definitely not perfect, and it definitely shouldn't be trusted to replace human judgement.

nilamo•3h ago
That logo is like concentric rings of power around Galadriel's seer-pool, looking at... Hogwarts?
ArkhamMirror•3h ago
Supposed to be the Mirror of Galadriel showing Arkham asylum. Just joshing on Palantir Gotham a little bit
sloped•3h ago
This looks interesting, and honestly makes me want to fire up The Roottrees are Dead and see if I can use this to solve the second act.
ArkhamMirror•3h ago
That would be a cool test - let me know if you decide to do it!
ChrisbyMe•1h ago
Interesting tool, do you have some domain knowledge as an analyst or something similar? I've always been curious what research tools analysts are using outside of like, Google.
ArkhamMirror•45m ago
Thank you, I'm glad it's gathered some interest!

I don't have any background as an analyst or anything like that. ACH is a real tool, really used by the CIA, and the existing versions are basically crappy spreadsheets, or not free, or both.

I don't doubt someone with coding skills could do it better, it's just that no one else has stepped up. Probably because there's no profit angle, but that's conjecture on my part.

0xdeadbeefbabe•42m ago
Doesn't ACH also constrain hypothesis generation in certain ways?
ajcp•37m ago
This is very compelling, very nice work!

I really would like to know how good this would be for a corporate Internal Audit workflow/professional.

ArkhamMirror•13m ago
This feature update is all about ACH, but there are several other functions that might also be of use for doing audit or compliance work.

Is there any particular function you had in mind?

ArkhamMirror can also scan your corpus for near duplicates, clusters, can check for signs of people using copy-paste in their work, find designated red flags, regex data, and that sort of thing. It's really generalized for as many use cases as possible at this stage, and I'm about to start working on modularity for specialization soon, so feel free to make suggestions on how you'd want to use it.

gosub100•27m ago
Is this "investigation platform" any different from legal "e-discovery" software products? It's a great accomplishment either way, but I am posting so other people know that lawyers use this stuff all the time and there are many (paid) off the shelf options.
jerlendds•18m ago
Beautiful work and it's always nice to see new projects in these spaces! I'm the creator of OSINTBuddy which is a somewhat similar project if you squint haha. We've just recently finished porting our web app to an electron binary (unreleased) for people who perform sensitive investigations (aka we have encryption at rest via Turso database) and collaboration features will be done via WebRTC + a signalling server.

I'm loving the approach you took to the UI! I had some similar ideas in mind and plan to build narrative reconstruction and timeline view tools too so it's really nice to see how others have done so! I'll definitely be following your work and I shared your project in the OSINTBuddy discord to hopefully get some more eyes on it :)

Great work, I hope you keep at it :)

ArkhamMirror•1m ago
That's awesome, thank you so much for getting more eyeballs on it!

My approach to security so far has been to keep it air-gapped and include a nukeitfromorbit.bat that will do everything but physically destroy your SSD to keep your privacy intact.

The narrative reconstruction tool was pretty fun to make, and it's been impressive in testing, but the real test will be if it actually helps someone in a real investigation.

If you see anything in my project that could help your project, then that's awesome news to me!

I'm definitely going to keep working, and hopefully soon it's going to do some pretty cool stuff. All the best to you and OSINTBuddy

VerifiedReports•5m ago
This looks very interesting. I already have Python and Docker set up the way I want. Will the installer mess with them?