frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: EgoExo Forge: Data and Utilities Needed for Ego and Exo Human Data

https://pablovela5620-egoexo-forge-viewer.hf.space
2•pablovelagomez•6h ago
Introducing EgoExo Forge - built on top of Rerun, Gradio and Huggingface Hub (I’ll be in San Francisco July 21–29 — if you’re into robotics, egocentric AI, large-scale data collection, or just want to chat, email me at pablovela5620@gmail.com!)

In my opinion, large-scale, diverse, and high-quality data is still the largest bottleneck for generalized robotics deployment. I believe that some version of imitation learning from human examples will be the most scalable + clean way to train humanoid robots (similar to what Tesla did for Full Self driving). Teleop is too expensive to collect a large enough dataset in a reasonable manner, so passive collection via egocentric (and in certain cases, exocentric) views feels like the right bet.

Over the past few months, I've been trying to build out the scaffolding for this and using Rerun as my underlying infrastructure. Data being collected needs to be easily inspectable + time series and rerun provides the right tooling for this.

My goal is to first build out a ground truth representative dataset from already existing open source data, generate some reasonable baselines, and then go out and collect my own data that adheres to the defined schema.

Starting with open-source datasets

1. EgoDex from Apple 2. HOCap from Nvidia and the University of Texas at Dallas 3. Assembly101 from Meta

All these different datasets have different sensor configurations + annotations, so my goal with egoexo-forge is to have one consistent labeling scheme + data layout. I built a data pipeline that aligns all of the different datasets in one general schema assuming the COCO133 keypoint layout that allows for exo+ego, ego only, or exo only

Since the scaffolding is already there, it becomes MUCH easier to add other datasets. So the next ones that I'll be including are HD-EPIC kitchens dataset, HOT3D, and finally my own personal iPhone + insta360 go collection method.

Once I have a diverse variety of datasets, I'll double down on what I believe to be the key algorithms required to make useful data for imitation learning

1. Camera Pose estimation via SLAM/SFM for ego perspective (and automatic calibration for exo) 2. Human pose estimation for both egocentric + exocentric views 3. Metric 3D reconstruction + object tracking

I'll be setting up reasonable open-source baselines for each of these to validate that these datasets work, and then finally try to use the generated datasets for some imitation learning via the pi0-lerobot repo I've been working on.

I plan on making a blog post + providing more info on all of this in the near future so stay tuned

Escaping Groupthink

https://www.thetransmitter.org/animal-behavior/escaping-groupthink-what-animals-behavioral-quirks-reveal-about-the-brain/
1•wjb3•1m ago•0 comments

Southeast Asia's Last Culinary Frontier: The 17,000 Islands of Indonesia

https://www.youtube.com/watch?v=dr3Hsa8Fam4
1•bane•7m ago•0 comments

Show HN: Buzz0.com – Daily curated Show HN posts

https://buzz0.com/
1•Airyisland•7m ago•0 comments

Doing More Is Often Easier

https://www.raptitude.com/2025/04/doing-more-is-often-easier/
2•_vaporwave_•12m ago•0 comments

Civilian hackers in China's military cyber strategy

https://margin.re/mobilizing-cyber-power-the-growing-role-of-cyber-militias-in-chinas-network-warfare-force-structure-2/
3•aaronsdevera•14m ago•0 comments

Deep Dive into Rails Database Connection Pools

https://www.prateekcodes.dev/rails-database-connection-pooling-explained/
1•prateekkish•15m ago•0 comments

Arguing About Woodworking More Popular Hobby Than Woodworking (2013)

http://www.closegrain.com/2013/04/arguing-about-woodworking-more-popular.html
2•ecliptik•18m ago•0 comments

TikTok prepares US app with its own algorithm and user data

https://www.reuters.com/world/china/tiktok-prepares-us-app-with-its-own-algorithm-user-data-2025-07-09/
1•mfiguiere•18m ago•0 comments

Your Prize for Saving Time at Work with AI: More Work

https://www.wsj.com/lifestyle/careers/ai-work-free-time-51c8c92a
10•petethomas•28m ago•4 comments

The case for building operator interfaces before AI agents

https://www.henrypray.com/writings/the-only-saas-feature-you-should-be-building
2•henrypray•32m ago•0 comments

Type-C To Type-C Scented Cable 48in

https://www.fivebelow.com/products/up-tech-type-c-to-type-c-scented-cable-48in-9184770
1•rendx•32m ago•0 comments

Show HN: ColorConJ – Explore Spanish color names by letter

https://colorconj.com/
1•lur0913•33m ago•0 comments

Eval AI jobs new market for Mercor

https://www.gardinercolin.com/p/marketplace-memo-13
1•predogger•34m ago•0 comments

In search of more efficient learning algorithms, researchers look to infants

https://www.thetransmitter.org/neuroai/the-babylm-challenge-in-search-of-more-efficient-learning-algorithms-researchers-look-to-infants/
3•domofutu•37m ago•0 comments

HIV-1 latency reversal via ectopic expression of a viral antisense transcript

https://www.science.org/doi/10.1126/sciadv.adu8014
2•PaulHoule•38m ago•0 comments

Our Missing Pieces

https://docs.google.com/document/d/1-KSIE89xHnipRBm8T6BRbxEQb5_byr5CwkB-S7XIwjQ/edit?tab=t.0
1•jger15•38m ago•0 comments

Claude Code OAuth Authentication Fails - "OAuth account information not found

https://github.com/anthropics/claude-code/issues/1484
1•rakken•40m ago•0 comments

CatchIdeas – Find High-Traffic Keywords for Product and Content Ideas

https://catchideas.com
1•labubulive•40m ago•0 comments

Fact Sheet: Autism Prevalence

https://www.thetransmitter.org/spectrum/prevalence-autism-u-s-remains-steady-new-data-suggest/
2•domofutu•42m ago•0 comments

No Tax on Overtime Calculator

https://notaxonovertimecalculators.org/
1•dond1986•42m ago•0 comments

V0 Platform API now in beta

https://vercel.com/changelog/v0-platform-api-now-in-beta
1•tzury•43m ago•0 comments

Research suggests electricity markets are using suboptimal pricing

https://arxiv.org/abs/2507.06035
1•cfata•44m ago•1 comments

Thoughts on Motivation and My 40-Year Career

https://charity.wtf/2025/07/09/thoughts-on-motivation-and-my-40-year-career/
3•zdw•45m ago•0 comments

Learning in living mice defies classic synaptic plasticity rule

https://www.thetransmitter.org/learning/learning-in-living-mice-defies-classic-synaptic-plasticity-rule/
2•domofutu•46m ago•0 comments

Doctest is a new C++ testing framework

https://github.com/doctest/doctest
2•BiraIgnacio•49m ago•0 comments

Most people who buy your game won't play it

https://howtomarketagame.com/2025/06/03/most-people-who-buy-your-game-wont-play-it/
1•walterbell•54m ago•0 comments

The #1 Reason Your GenAI Project Will Fail in Production

https://www.mlwhiz.com/p/from-prototype-to-production-mlops
1•ai_unwrapped•59m ago•0 comments

Andreessen Horowitz Leaves Delaware for Nevada, Tells Startups to Follow

https://www.bloomberg.com/news/articles/2025-07-09/andreessen-horowitz-leaves-delaware-for-nevada-tells-startups-to-follow
5•pilingual•1h ago•0 comments

Concorde – The 24 Hour World (1973) [video]

https://archive.org/details/concorde-the-24-hour-world
1•petethomas•1h ago•0 comments

Bug report forms powered by AI – No more duplicates, spam or lackluster reports

https://bugspot.dev
1•PaulPlay•1h ago•1 comments