frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Sid Meier's System for Real-Time Music Composition and Synthesis

https://patents.google.com/patent/US5496962A/en
1•GaryBluto•6m ago•1 comments

Show HN: Slop News – HN front page now, but it's all slop

https://dosaygo-studio.github.io/hn-front-page-2035/slop-news
3•keepamovin•7m ago•1 comments

Show HN: Empusa – Visual debugger to catch and resume AI agent retry loops

https://github.com/justin55afdfdsf5ds45f4ds5f45ds4/EmpusaAI
1•justinlord•10m ago•0 comments

Show HN: Bitcoin wallet on NXP SE050 secure element, Tor-only open source

https://github.com/0xdeadbeefnetwork/sigil-web
2•sickthecat•12m ago•1 comments

White House Explores Opening Antitrust Probe on Homebuilders

https://www.bloomberg.com/news/articles/2026-02-06/white-house-explores-opening-antitrust-probe-i...
1•petethomas•12m ago•0 comments

Show HN: MindDraft – AI task app with smart actions and auto expense tracking

https://minddraft.ai
2•imthepk•17m ago•0 comments

How do you estimate AI app development costs accurately?

1•insights123•18m ago•0 comments

Going Through Snowden Documents, Part 5

https://libroot.org/posts/going-through-snowden-documents-part-5/
1•goto1•19m ago•0 comments

Show HN: MCP Server for TradeStation

https://github.com/theelderwand/tradestation-mcp
1•theelderwand•21m ago•0 comments

Canada unveils auto industry plan in latest pivot away from US

https://www.bbc.com/news/articles/cvgd2j80klmo
2•breve•22m ago•1 comments

The essential Reinhold Niebuhr: selected essays and addresses

https://archive.org/details/essentialreinhol0000nieb
1•baxtr•25m ago•0 comments

Rentahuman.ai Turns Humans into On-Demand Labor for AI Agents

https://www.forbes.com/sites/ronschmelzer/2026/02/05/when-ai-agents-start-hiring-humans-rentahuma...
1•tempodox•27m ago•0 comments

StovexGlobal – Compliance Gaps to Note

1•ReviewShield•30m ago•1 comments

Show HN: Afelyon – Turns Jira tickets into production-ready PRs (multi-repo)

https://afelyon.com/
1•AbduNebu•31m ago•0 comments

Trump says America should move on from Epstein – it may not be that easy

https://www.bbc.com/news/articles/cy4gj71z0m0o
5•tempodox•31m ago•2 comments

Tiny Clippy – A native Office Assistant built in Rust and egui

https://github.com/salva-imm/tiny-clippy
1•salvadorda656•36m ago•0 comments

LegalArgumentException: From Courtrooms to Clojure – Sen [video]

https://www.youtube.com/watch?v=cmMQbsOTX-o
1•adityaathalye•39m ago•0 comments

US moves to deport 5-year-old detained in Minnesota

https://www.reuters.com/legal/government/us-moves-deport-5-year-old-detained-minnesota-2026-02-06/
6•petethomas•42m ago•2 comments

If you lose your passport in Austria, head for McDonald's Golden Arches

https://www.cbsnews.com/news/us-embassy-mcdonalds-restaurants-austria-hotline-americans-consular-...
1•thunderbong•46m ago•0 comments

Show HN: Mermaid Formatter – CLI and library to auto-format Mermaid diagrams

https://github.com/chenyanchen/mermaid-formatter
1•astm•1h ago•0 comments

RFCs vs. READMEs: The Evolution of Protocols

https://h3manth.com/scribe/rfcs-vs-readmes/
3•init0•1h ago•1 comments

Kanchipuram Saris and Thinking Machines

https://altermag.com/articles/kanchipuram-saris-and-thinking-machines
1•trojanalert•1h ago•0 comments

Chinese chemical supplier causes global baby formula recall

https://www.reuters.com/business/healthcare-pharmaceuticals/nestle-widens-french-infant-formula-r...
2•fkdk•1h ago•0 comments

I've used AI to write 100% of my code for a year as an engineer

https://old.reddit.com/r/ClaudeCode/comments/1qxvobt/ive_used_ai_to_write_100_of_my_code_for_1_ye...
2•ukuina•1h ago•1 comments

Looking for 4 Autistic Co-Founders for AI Startup (Equity-Based)

1•au-ai-aisl•1h ago•1 comments

AI-native capabilities, a new API Catalog, and updated plans and pricing

https://blog.postman.com/new-capabilities-march-2026/
1•thunderbong•1h ago•0 comments

What changed in tech from 2010 to 2020?

https://www.tedsanders.com/what-changed-in-tech-from-2010-to-2020/
3•endorphine•1h ago•0 comments

From Human Ergonomics to Agent Ergonomics

https://wesmckinney.com/blog/agent-ergonomics/
1•Anon84•1h ago•0 comments

Advanced Inertial Reference Sphere

https://en.wikipedia.org/wiki/Advanced_Inertial_Reference_Sphere
1•cyanf•1h ago•0 comments

Toyota Developing a Console-Grade, Open-Source Game Engine with Flutter and Dart

https://www.phoronix.com/news/Fluorite-Toyota-Game-Engine
2•computer23•1h ago•0 comments
Open in hackernews

Meta Segment Anything Model 3

https://ai.meta.com/blog/segment-anything-model-3/?_fb_noscript=1
178•alcinos•2mo ago

Comments

trevorhlynn•2mo ago
This was front page for a while last week

https://news.ycombinator.com/item?id=45982073

stronglikedan•2mo ago
what is old is new again
dang•2mo ago
Thanks! Macroexpanded:

Meta Segment Anything Model 3 - https://news.ycombinator.com/item?id=45982073 - Nov 2025 (133 comments)

p.s. This was lobbed onto the frontpage by the second-chance pool (https://news.ycombinator.com/item?id=26998308) and I need to make sure we don't end up with duplicate threads that way.

Workaccount2•2mo ago
I do a test on multimodal LLMs where I show them a dog with 5 legs, and ask them to count how many legs the dog has. So far none of them can do it. They all say "4 legs".

Segment anything however was able to segment all 5 dog legs when prompted to. Which means that meta is doing something else under the hood here, and may lend itself to a very powerful future LLM.

Right now some of the biggest complaints people have with LLMs stems from their incompetence processing visual data. Maybe meta is onto something here.

jampekka•2mo ago
Segmentation doesn't need to count legs. I'd guess something like YOLO could segment 5 legged dogs too.
chompychop•2mo ago
YOLO is not a segmentation model.
jampekka•2mo ago
https://docs.ultralytics.com/tasks/segment/
lucasban•2mo ago
I thought it was a joke about YAML
chompychop•2mo ago
Thanks! TIL there's a class of segmentation models with the YOLO naming scheme.
Der_Einzige•2mo ago
Lol you obviously haven't seen what cheats for FPS games look like in the last 3 years.

https://github.com/Babyhamsta/Aimmy

PunchTornado•2mo ago
I doubt that gemini 3 cannot do it.
nerdsniper•2mo ago
You don’t need segmentation to count legs. Object detection can do that. DeepLabCut from 2020 perhaps.
the_duke•2mo ago
Side question: what are the current top goto open models for image captioning and building image embeddings dbs, with somewhat reasonable hardware requirements?
NitpickLawyer•2mo ago
Try any of the qwen3-vl models. They have 8, 4 and 2B models in this family.
Glemkloksdjf•2mo ago
I would suggest YOLO. Depending on your domain, you might also finetune these models. Its relativly easy as they are not big LLMs but either image classification or bounding boxes.

I would recommend bounding boxes.

smallerize•2mo ago
Which YOLO?
Glemkloksdjf•2mo ago
Any current one. they are easy to use and you can just benchmark them yourself.

I'm using small and medum.

Also the code for using it is very short and easy to use. You can also use ChatGPT to generate small exepriments to see what fits your case better

throwaway314155•2mo ago
There aren’t any YOLO models for captioning and the other models aren’t robust enough to make for good embedding models.
Glemkloksdjf•2mo ago
You can get labels out of the classifier and bounding box models.

They are super fast.

Its just an alternative i'm mentioning. I would assume a person knowing a little bit of that domain.

Otherwise the first option would be CLIP i assume. llm-vl is just super slow and compute intensive.

jabron•2mo ago
What do you mean "bounding boxes"? They were talking about captions and embeddings, so a vision language model is required.
Glemkloksdjf•2mo ago
I suggested YOLO and non llm-vl as a lot faster alternative.

Of course CLIP would be otherwise the other option than a big llm-vl one.

daemonologist•2mo ago
For pure image embedding, I find DINOv3 to be quite good. For multimodal embedding, maybe RzenEmbed. For captioning I would use a regular multimodal LLM, Qwen 3 or Gemma 3 or something, if your compute budget allows.
vessenes•2mo ago
Released last week. Looks like all the weights are now out and published. Don’t sleep on the SAM 3D series — it’s seriously impressive. They have a human pose model which actually rigs and keeps multiple humans in a scene with objects, all from one 2D photo (!), and their straight object 3D model is by far the best I’ve played with - it got a really very good lamp with translucency and woven gems in usable shape in under 15 seconds.
nl•2mo ago
https://ai.meta.com/blog/sam-3d/ for those interested.
Fraterkes•2mo ago
Are those the actual wireframes they're showing in the demos on that page? As in, do the produced models have "normal" topology? Or are they still just kinda blobby with a ton of polygons
seanw265•2mo ago
I haven’t tried it myself, but if you’re asking specifically about the human models, the article says they’re not generating raw meshes from scratch. They extract the skeleton, shape, and pose from the input and feed that into their HMR system [0], which is a parametric human model with clean topology.

So the human results should have a clean mesh. But that’s separate from whatever pipeline they use for non-human objects.

[0]: https://github.com/facebookresearch/MHR

daemonologist•2mo ago
For the objects I believe they're displaying Gaussian splats in the demo, but the model itself can also produce a proper mesh. The human poses are meshes (it's posing and adjusting a pre-defined parametric model).
vessenes•2mo ago
I’ve only used the playground. But I think they are actual meshes - they don’t have any of the weird splat noise at the edge of the objects, and they do not seem to show similar lighting artifacts to a typical splat rendering.
Qwuke•2mo ago
Between this and DINOv3, Meta is doing a lot for the SOTA even if Llama 4 came up short compared to the Chinese models.
visioninmyblood•2mo ago
you can download them at https://github.com/facebookresearch/sam3. for 3d https://github.com/facebookresearch/sam-3d-objects
retinaros•2mo ago
I looked quickly but it does not generate a 3d model file right?
phkahler•2mo ago
Which (if any) of these models could run on a RaspberryPi for object recognition at several FPS?
enoch2090•2mo ago
Surprisingly, SAM3 works bad on engineering drawings while SAM2 kinda works, and VLMs like Qwen3-VL works as well
retinaros•2mo ago
yeah I tried too. Im trying a fine tuning on PIDs.
enoch2090•2mo ago
Looking forward to your progress! Just checked the paper and it says the underlying backbone is still DETR. My guess would be that SAM3 uses more video frames during the training process and caused the dilution of sparse engineering-paper-like data.
zubiaur•2mo ago
Had good luck with Gemini 2.5, SAM3 failed miserably with PIDs.
shashanoid•2mo ago
Miss the old segment anything page, used it a lot. This UI I found very complex to use
bradyriddle•2mo ago
Same.

Checkout https://github.com/MiscellaneousStuff/meta-sam-demo

It's a rip of the previous sam playground. I use it for a bunch of things.

Sam 3 is incredible. I'm surprised it's not getting more attention.

stronglikedan•2mo ago
> I'm surprised it's not getting more attention.

Remember, it's not the idea, it's the marketing!

colkassad•2mo ago
Been waiting days to get approval to download this from huggingface. What's up with that?
knicholes•2mo ago
I was approved within about 10 minutes for "Segment Anything 3"
observationist•2mo ago
Alternative downloads exist. You can find torrents, and match checksums against the HF downloads, but there are also mirrors and clones right there in HF which you can download without even having to log in.
colkassad•2mo ago
Thanks, got it and it's working wonders for my use case.
tschellenbach•2mo ago
same here, didn't get approval
cheesecompiler•2mo ago
This would be convenient for post-production and editing of video, e.g. to aid colour grading in Davinci Resolve. Currently a lot of manual labour goes into tracking and hand-masking in grading.
aliljet•2mo ago
I wonder how effective this is medical scenarios? Segmenting organs and tumors in cat scans or MRIs?
maelito•2mo ago
I wonder if this can be used to track an object's speed. E.g. a vehicle on a road. It would need to recognize shapes, e.g. car model or average size of a bike, to guess a speed.
vanjoe•2mo ago
For a long time I've wanted to use something like this to remove advertisements from hockey games.The moving ads on the boards are really annoying. Maybe I'll get around to actually doing that one of these days.