frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Gaussian Splatting Meets ROS2

https://github.com/shadygm/ROSplat
61•shadygm•9mo ago

Comments

arijun•9mo ago
This page is pretty light on the what and why. I gather it’s using ROS (which I had to look up to confirm means robot operating system) to render Gaussian splatting. And that’s faster than a dedicated GPU renderer? Doesn’t ROS add overhead in the form of message passing?
inhumantsar•9mo ago
it's for visualizing a robot's camera data in 3d space
shadygm•9mo ago
Hey! Great question, and thanks for taking a look!

The main idea behind ROSplat is to make it easier to send and visualize Gaussians over the network, especially in robotics applications. For instance, imagine you're running a SLAM algorithm on a mobile robot and generating Gaussians as part of the mapping or localization process. With ROSplat, you can stream those Gaussians via ROS messages and visualize them live on another machine. It’s mostly a visualization tool that usess ROS for communication, making it accessible and convenient for robotics engineers and researchers already working within that ecosystem.

Just to clarify, ROSplat isn’t aiming to be faster than state-of-the-art rendering methods. The actual rendering is done with OpenGL, not ROS, so there’s no performance claim there. ROS is just used for the messaging, which does introduce a bit of overhead, but the benefit is in the ease of integration and live data sharing in robotics setups.

Also, I wrote a simple technical report explaining some things in more detail, you can find it in the repo!

Hope that clears things up a bit!

hirako2000•9mo ago
Confused here despite the detailed explanation on the user case.

Today generating a static point cloud with gaussians involves:

- offline, far from realtime process to generate spacial information off 2D captures. LiDar captures may help but doesn't drastically cut down the this heavy step. - "train" generate gaussian information off 2D captures and geospatial data.

Unless I'm already referring to an antique flow, or that my RTX GPU is too consumer grade, how would all of this perform on embedded systems to make fast communication of gaussian relevant ?

shadygm•9mo ago
There's some algorithms, such as Photo-SLAM and Gaussian Splatting SLAM (although far heavier and slower), that show that it is indeed possible to be able to estimate position and generate Gaussians in real-time. These are definitely still the early days for these techniques tho.

The offline method still generates significantly higher resolution scenes of course, but as time goes on, real-time Gaussian Splatting will become more common and will be close to offline methods.

This means that in the near future, we will be able to generate highly realistic scenes using Gaussian Splats on a smart edge + mobile robot in real-time and pass the splats via ROS onto another device running ROSplat (or other) and perform the visualisation there.

hirako2000•9mo ago
OK. Thanks for your projections.

I generate on GPU I can barely fit a large scene on 12GB of memory, and it takes many hours to produce 30k steps gaussians.

I'm sure the tech will evolve, hardware too. We are just 5y away.

I respect you open sourcing your work, it is innovative. Feels like a trophy splash, I suggest putting a link to something substantial, perhaps a page explaining where the tech will land and how this project fits that future, rather than a link to some LinkedIn.

shadygm•9mo ago
Hey, I appreciate the feedback.

I did not put a LinkedIn link in the post or repo, but I totally get your point about wanting something more substantial to explain the bigger picture.

A lot of the motivation and reasoning behind the project is already included in the technical report PDF attached in the repository, I tried to make it as self-contained as possible for those curious about the background and use cases.

That said, if I find some time, I’ll definitely consider putting together a separate page to outline where I think this kind of tool fits into the broader future of GS and robotics.

Thanks again!

somethingsome•9mo ago
Il very curious of that.. My mean training with ~25-30 high quality cameras takes around 20 minutes and some Gb of memory on a single GPU, what is the size of your large scale scenes? I see many possible optimizations to lower that number of Gb and time
markisus•9mo ago
I have done a recent proof of concept to generate Gaussian splats from depth cameras in real-time. The intended application is for robotics and teleoperation. I made a post on reddit [1] a while back if you're interested.

I believe the quality of realtime Gaussian splatting will improve with time. The OPs project could help ROS2 users take advantage of those new techniques. Someone might need to make a Gaussian splat video codec to bring down the bandwidth cost of streaming Gaussians.

Another application could be for visualizing your robot inside a pre-built map, or for providing visual models for known objects that the robot needs to interact with. Photometric losses could then be used to optimize the poses of these known objects.

[1] https://www.reddit.com/r/GaussianSplatting/comments/1iyz4si/...

jimmySixDOF•9mo ago
So I upload a pre-baked GSplat of the ground state physical space, presumably there is some kind of calibration, then I can navigate the ROS device spatially using the GSplat to reflect position details instead of, or in addition to, actual camera feeds ? Or are they producing the splats somehow on the ROS device with limited camera poses ? Whatever the case may be, I still think the human controller side is where Splats are more useful so add a VR headset into the loop and I think this could open up real opportunities for example spatial minimaps, decoupled points of view, etc.
shadygm•9mo ago
Thanks for taking a look!

Just to clarify, ROSplat isn’t generating the Gaussians, it’s not a SLAM algorithm or a reconstruction tool. It’s purely a visualizer that uses ROS for message passing. The idea is that if you already have a system producing Gaussians (either live or precomputed), ROSplat lets you stream and view them in real time (as the ROS messages arrive).

So in your example, yes, you could upload a pre-baked GSplat, calibrate it to the robot’s frame, and use it for navigation or visualization. Or, if your ROS device is running something like SLAM, it can publish Gaussians as it goes. In both cases, ROSplat is just making them available for visualization, nothing more.

And I completely agree with you on your last comment. VR Gaussians are the way to go, I know that a company Varjo is currently working on them. Not sure if there's anything else that's available tho :/

dheera•9mo ago
I've actually been pondering using Gaussian splats for localization, I think it could be done. The idea would be looking for the pose that minimizes the MSE in density (rather than feature points or RGB similarity which are both vulnerable to lighting changes)
jimmySixDOF•9mo ago
Varjo are good at whatever they do but also check out @gracia_vr [1] they focus on Spalts in XR and playcanvas has supersplat which lets you view immersive mode for 3DGS [2].

[1] https://www.gracia.ai/ [2] https://github.com/playcanvas/supersplat

gitroom•9mo ago
Nice, these back and forths always remind me how much cool stuff is brewing behind the scenes. Tbh I'd love seeing more live demos of things like this, helps my brain get what's really happening.
shadygm•9mo ago
Yeah I agree, lack of visuals sometimes makes it hard to tell what's happening when a field moves as fast as it does in GS. There's a github page called Awesome3DGS [1] that is updated whenever there is a new paper in GS. It helped me a lot when I was getting started.

Most papers also have their own project page that showcases their contributions or demo their project as well (:

[1] https://github.com/MrNeRF/awesome-3D-gaussian-splatting

An AI agent published a hit piece on me

https://theshamblog.com/an-ai-agent-published-a-hit-piece-on-me/
646•scottshambaugh•2h ago•309 comments

GPT‑5.3‑Codex‑Spark

https://openai.com/index/introducing-gpt-5-3-codex-spark/
69•meetpateltech•31m ago•24 comments

Gemini 3 Deep Think

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-deep-think/
167•tosh•1h ago•58 comments

Major European payment processor can't send email to Google Workspace users

https://atha.io/blog/2026-02-12-viva
276•thatha7777•4h ago•181 comments

Launch HN: Omnara (YC S25) – Run Claude Code and Codex from anywhere

34•kmansm27•1h ago•47 comments

Improving 15 LLMs at Coding in One Afternoon. Only the Harness Changed

http://blog.can.ac/2026/02/12/the-harness-problem/
358•kachapopopow•5h ago•151 comments

Shut Up: Comment Blocker

https://rickyromero.com/shutup/
33•mefengl•1h ago•11 comments

A brief history of barbed wire fence telephone networks (2024)

https://loriemerson.net/2024/08/31/a-brief-history-of-barbed-wire-fence-telephone-networks/
73•keepamovin•3h ago•17 comments

Culture Is the Mass-Synchronization of Framings

https://aethermug.com/posts/culture-is-the-mass-synchronization-of-framings
74•mrcgnc•4h ago•44 comments

Warcraft III Peon Voice Notifications for Claude Code

https://github.com/tonyyont/peon-ping
824•doppp•13h ago•257 comments

US businesses and consumers pay 90% of tariff costs, New York Fed says

https://www.ft.com/content/c4f886a1-1633-418c-b6b5-16f700f8bb0d
250•mraniki•3h ago•205 comments

Apache Arrow is 10 years old

https://arrow.apache.org/blog/2026/02/12/arrow-anniversary/
111•tosh•5h ago•25 comments

Polis: Open-source platform to find common ground at scale

https://pol.is/home2
5•mefengl•14m ago•0 comments

The "Crown of Nobles" Noble Gas Tube Display (2024)

https://theshamblog.com/the-crown-of-nobles-noble-gas-tube-display/
106•Ivoah•6h ago•22 comments

The Future for Tyr, a Rust GPU Driver for Arm Mali Hardware

https://lwn.net/Articles/1055590/
72•todsacerdoti•4h ago•18 comments

MiniMax M2.5 released: 80.2% in SWE-bench Verified

https://www.minimax.io/news/minimax-m25
74•denysvitali•1h ago•15 comments

Beginning autonomous operations with the 6th-generation Waymo Driver

https://waymo.com/blog/2026/02/ro-on-6th-gen-waymo-driver
35•ra7•2h ago•24 comments

Show HN: 20+ Claude Code agents coordinating on real work (open source)

https://github.com/mutable-state-inc/lean-collab
23•austinbaggio•2h ago•22 comments

1D Cellular Automata Playground

https://paraschopra.github.io/1d-ca/
7•paraschopra•3d ago•0 comments

Run Pebble OS in Browser via WASM

https://ericmigi.github.io/pebble-qemu-wasm/
73•goranmoomin•5h ago•11 comments

ai;dr

https://www.0xsid.com/blog/aidr
265•ssiddharth•1h ago•131 comments

I Wrote a Scheme in 2025

https://maplant.com/2026-02-09-I-Wrote-a-Scheme-in-2025.html
90•maplant•3d ago•12 comments

So many trees planted in Taklamakan Desert that it's turned into a carbon sink

https://www.livescience.com/planet-earth/plants/china-has-planted-so-many-trees-around-the-taklam...
81•Brajeshwar•2h ago•19 comments

Apple patches decade-old iOS zero-day, possibly exploited by commercial spyware

https://www.theregister.com/2026/02/12/apple_ios_263/
184•beardyw•4h ago•109 comments

Lines of Code Are Back (and It's Worse Than Before)

https://www.thepragmaticcto.com/p/lines-of-code-are-back-and-its-worse
74•birdculture•3h ago•31 comments

Fast Properties in V8 (2017)

https://v8.dev/blog/fast-properties
8•aragonite•4d ago•0 comments

Carl Sagan's Baloney Detection Kit: Tools for Thinking Critically (2025)

https://www.openculture.com/2025/09/the-carl-sagan-baloney-detection-kit.html
122•nobody9999•11h ago•71 comments

TikTok is tracking you, even if you don't use the app

https://www.bbc.com/future/article/20260210-tiktok-is-tracking-you-even-if-you-dont-use-the-app-h...
121•belter•4h ago•90 comments

The missing digit of Stela C

https://johncarlosbaez.wordpress.com/2026/02/12/stela-c/
90•chmaynard•10h ago•14 comments

“Nothing” is the secret to structuring your work

https://www.vangemert.dev/blog/nothing
403•spmvg•4d ago•164 comments