We’re seeing a lot of robotic trials happening in private warehouses and on private test ranges at pretty rapid scale
Beyond that the methods for transfer learning behavior cloning behavior authoring are very robust so that I can get joint angles directly from a human via instrumentation through vision or even commodity sensors which captured trajectories that can be immediately applied to robotic joint positions.
The real challenge is actually capturing demonstration recordings from humans because it’s the hardest thing to instrument. The core task is instrumenting data capture of existing human tasks that are not done through machines, such that they can transfer to machines.
This is easiest done with existing human operated robots because the instrumentation is free, so data can go directly into real2sim2real pipelines.
There might seem counterintuitive but most of the actual technical bits and bites are already there it’s re-orienting the economic and logistical process of labor execution that is the major challenge.
I will say though, I’m seeing less and less barriers there as time goes on. Employers really want to not have to hand human employees
I think technology development can be faster thanks to better AI systems like VLA models, but I do think the time to real deployment will be long.
My pet issue is that the dexterity of the hands is still really poor. A human hand is incredible with what it can do.
I think between the general manipulation tasks, world understanding, and more these systems are still a long ways out for widespread use, though I wouldn’t be surprised if they find niche uses near term.
Vision for cars already includes object detection, and the better that is, the better robot object detection gets. The same for "human ran out on road" would work for "walking in house, small human is now in front of me, stop!".
I wonder how much of the one will port to the other. A house has paths aka "roads", inside and out. Places the robot may walk, and not. So path navigation is a thing too. Maybe 'getting around' is mostly solved, while of course other challenges are still there.
Sort of replying to others in this part, the reason people are all hung up on humanform, is that our entire world is made for humans. Whether stairs, doors, sidewalks, doorknobs, cupboards, or even space to walk in a small kitchen... it's all made to work with human shape and size.
(Yes, while there is wheelchair access mandated, that doesn't extend to the inside of every home, and all the spaces in homes, and even then everything we have is designed to be operated by fingers/arms/hands.)
So if you solve humanform, the robot can go anywhere and manipulate/do anything a human can. That means no change to the environment when you get one. Right or wrong, that's why everyone is after humanform.
They realized just how much of what an autonomous vehicle needs to do to navigate real world roads is similar to what an advanced robot would need to do to operate in real world environments. If they could get anywhere close to solving FSD, it would be an "in" on advanced robotics too.
The triumph of LLMs then made it glaringly obvious that the kind of advanced decision-making that you would need to power truly universal robots is no longer in the realm of science fiction, so a lot more companies followed.
It's useful, don't get me wrong, but when Waymo can handle Cairo and Rome, I'll consider it a solved problem.
Edit: Stated more explicitly: the human world is the way it is because of many reasons and can't always be changed naively (it's not like nobody in Cairo has thought about improving the traffic situation, or architects haven't thought about the ease of cleaning different flooring material). Robots which are general purpose with respect to their human-like capabilities must necessarily also accept a world in which humans live.
A self-driving car only has to do one thing - drive. It's also got a stable wheeled base and only a couple of degrees of freedom - got to steer and regulate it's speed.
Even if the only thing you wanted the humanoid robot to do is drive your car, it'd be massively harder for it since it's got all those degrees of freedom, will be bouncing around in the drivers seat, and presumably doesn't even know how to drive.
If the humanoid is more than a gimick - meant to be general purpose, then it needs an AGI brain and ability to learn for itself. It's not going to be learning in a simulator like your FSD car - it's be learning on the road like your teenage kid.
A generic example, fridges could easily last 40 to 50 years without maintenance. They wouldn't be all that more expensive either. Volvo, and the B-52 bomber program showed this, with Volvo having some models unchanged for 20 years. The B-52 has been in service longer than most people have been alive.
Each time an early wear or failure point is found in the B-52, it is documented, fixed, and rolled out to all B-52s. Their ancient, but more reliable than newer bombers and require less maintenance.
We could do this for everything. Design a fridge, and after 10 years collect the failures and see how they broke. Keep selling the same fridge, the same parts, and eventually it's a rock.
We don't do this, companies don't do this, because it's not best for profit.
So my point is robot maintenance could be minor, and if it was purely a lease model, would remain minor... because the company would profit from lower overall maintenance costs.
Lastly, compare a robot to a car driving 100s of thousands of KM. I've driven new cars to 150000km with almost no failure of any kind (except brakes. tires). So maybe not as bad as thought.
B-52s require regular inspections and maintenance just like any other aircraft. A fridge is less complicated, but it's still a machine. Even my grandfather's clock needed some work done every couple decades, and it didn't contain refrigerant, a compressor, fans, or have to deal with condensation.
A couple grand for gearbox rebuilds every few years, replacement vacuum cups or worn hard tooling as needed, troubleshoot electrical issues as they arise... and your quarter million robot cell ($60k of that is the robot, most of the rest is NRE labor) will only need one human instead of eight to spit out parts every 60 seconds for the next decade.
Unless you think the humanoid robots are going to wear out significantly faster than existing robots, wear and tear costs are negligible.
With tight process controls, turning a work cell that has multiple humans doing manual labor for material handling, fastening, inspection, labeling, etc. into one intelligent human keeping the automation well adjusted is a solved problem. Eliminating that last human - the one that makes decisions instead of moves materials - with a humanoid robot is going to take decades.
In many instances with repairing electronics and home appliances labour is the greatest cost, not the material. Sometimes it's as simple as replacing a 50 cent washer to repair something, or perhaps squirt some lube here or there regularly to prevent something from breaking down.
If it's the same for robot maintenance then robots being able to fix themselves and each other will change the equation on ownership tremendously.
Imagine if everyone had a domestic robot and if it broke down their neighbour's robot could repair it. That would be an extremely user friendly and cheap way to deal with the problem.
"Replace tiny parts" option - Which parts is the manufacturer making available for purchase and what does the supply chain look like for that? What tools are needed to do the disassembly, part installation, and re-assembly? Can a humanoid robot out in the real world replicate the clean room conditions in which delicate components were assembled then sealed inside some compartment so dust can never get to them? Are we going to put heat guns and soldering irons in the fingertips of every humanoid robot to support self repair? There's going to be problems that can't be resolved with the kinds of tools available in the average household.
"Replace modules / components" option - Having to buy a whole new hand when you really wanted to replace a single finger joint impacts the value proposition of self repair, it's not a 50 cent washer it's a $1000 pre-assembled component. The repair is now definitely doable in the field, at least.
You might also be assuming humanoid robot manufacturers would not work specifically against self-repair. They make more money if you buy a new robot, or you pay them to fix your broken robot. Maybe "fix this other robot" ends up on a list of forbidden tasks the robot will always refuse to do...
No reason a robot can't work in a dark cave flooded with radon, and that is going to be cheap real estate.
I would get a roomba but it can't do enough fine detail to be worth it.
We could have chosen that scale of investment in battery technology, certainly, but the finance guys said that it wasn't profitable to invest that much in it. China was willing to invest into this field for national security reasons (the threat was of the USN cutting off oil traffic in the Indian Ocean, well beyond the range of the Chinese military). Semiconductors themselves were the result of (US) national security investment- the Minuteman ICBM guidance system was the first large scale use of IC semiconductors.
Given how quickly battery technology has advanced- and how profitable it has been to build these factories- I think the evidence is that the finance guys were wrong about the profitability and importance of battery research/construction, and the national security investment jump-started the whole thing, just like with semiconductors.
The biggest bottlenecks are raw ingredients, power, and factories. Once the automated manufacturing flywheel gets started, units can be produced very rapidly. Specialized machines produce low-level components, while more generalized machines assemble higher-level components as well as products like themselves and other robots.
People don't factor a human's total compensation beyond an hourly wage.
Machines don't need as much breathing room as humans.
Machines can work a 6-day, 16-hour schedule.
And that's just not the case yet?
In general, you can get a dedicated machine for most human tasks that is easily 10-1000x productivity if you have a few million in capital. There are tasks on the margin where human flexibility and dexterity that having a human operate a $10k sewing machine is going to be very very hard to replace.
I'm not against the concept and I agree the manufacturing can be scaled. There just isn't a product yet.
Humanoid robots feel like they're decades away for being something people would want.
Roombas and lawn robots are all extremely popular.
I'm curious if a cleaning service wasn't an option, would the Roomba be worth the saved time compared to doing the cleaning yourself?
I also own a Roomba, but I don't have a cleaning service so my options are either do 100% of the cleaning or let the Roomba do its thing and manually take care of the difference.
For me it's just one less room I have to sweep.
Roomba is pretty mediocre at a single job it's kind of able to do.
Humanoid robots _potentially_, _hypothetically_ can do anything that human can do because they are designed for environment, tools and equipment that we designed for our bodies.
chmod775•2h ago
I'm not surprised at all they're struggling to find buyers.
Daneel_•2h ago
If they’re humanoid then they can already use tools, equipment, and access methods we already use for ourselves.
What part of the vision doesn’t make sense?
chmod775•2h ago
When these things can make a burger without help I'll change mind, but right now they're not even close to that. Everything I've seen so far makes them look like clumsy pieces of junk. I haven't even seen one make a sandwich without a human having to prepare every step for them so they could then perform "cutting motion" or "stack ingredients" (painfully slowly and shaking like a geriatric).
TheDudeMan•2h ago
elzbardico•2h ago
chmod775•2h ago
These aren't. They're not even ten percent there. I don't get why you'd try to mass-produce and market them.
Tesla is going to have proper autonomous driving in their consumer vehicles before they make one useful humanoid robot.
sjsdaiuasgdia•2h ago
The recent clip posted by Marc Benioff was...painful. It took a few seconds to reply to a simple greeting. Its next bit of speech in response to a query of where to get a Coke has a weird moment where it seems like it interrupts itself. Optimus offers to take Benioff to the kitchen to get a Coke. Optimus acknowledges Benioff's affirmative response, but just stands there. Then you hear Musk in the background muttering that Optimus is "paranoid" about the space. Benioff backs up a few feet. Optimus slowly turns then begins shuffling forward. Is it headed to the kitchen? Who knows!
The reaction to that should not be "OMG I cannot wait to pay you $200-$500k for one of these!" It should be "You want HOW MUCH for THIS? Are you nuts?"
xandrius•2h ago
- No pain
- No breaks
- No protesting/strikes
- No rises needed
- No happiness to take care of
All things business find annoying.
numpad0•1h ago
iamleppert•1h ago
They could release a software update and disable your entire workforce unless you agreed to pay more money. They could slow your workforce down to prop up a competitor, etc.
whiplash451•40m ago
Your current cloud provider can absolutely "release a software update and disable your entire workforce unless you agreed to pay more money". The reason why they don't is quite obvious (competition).
sandworm101•2h ago
https://youtu.be/U2sN5g6wOBU
chmod775•2h ago
That was my entire point!
zerotolerance•2h ago
Generalist robotics are all about minimizing or at least front loading some portion of retooling cost, minimizing overhead associated with safety and compliance, and being able to capitalize what would have otherwise been human opex. Those pressures aren't going anywhere.
schwartzworld•2h ago
What does them being humanoid have to do with this? There are other form factors that could get to 80% but might be simpler to implement.
BurningFrog•2h ago
If that ends up being a dominant or niche part of the robot market is way too early to predict.
ACCount37•2h ago
Can that "other form factor" climb stairs? Or operate existing power tools? Or get into a generic car to get transported to a new workplace? Or get teleoperated by a human with mocap gloves?
Non-humanoid robots don't get simplicity for free. They have to trade off capabilities to get there.
Nzen•1h ago
[0] https://www.youtube.com/watch?v=mHmmySGdaoM
ACCount37•2h ago
Less efficiently, sure, but for the manufacturing, logistics, maintenance? The economies of scale are immense.
The reason why we weren't doing exactly that back in the 80s isn't that universal humanoid robots somehow weren't desirable. It's that for a universal humanoid hardware to be useful, you need a fairly universal AI to back it.
That "universal AI" was nowhere to be seen back in the 80s, or the 90s, or the 00s. Now, we finally have a good idea of how to build the kind of AI required for it.
moffkalast•2h ago
Like, Digit costs a quarter mil and is rated for 10 thousand hours. It can stack boxes. For that price you can turn every box in your warehouse into an AGV and they'll last you forever.
0xbadcafebee•2h ago
It's impossible to do this in a general way. This could theoretically be scalable (produce the robot and have 10,000 companies all develop their own specialization routines), but the hardware (both the parts as well as neural interface) needs to be as capable as a human body, which isn't even remotely true. The physical robot will always limit what skills it can learn, on top of the difficulty of programming the skill.
I think we're hundreds of years away from making a robot that's as capable as a human. We would get there faster with synthetics or cyborgs. Create a human body without a brain, use Neuralink to operate it. Until then, specialized robots are the only thing that will scale to 10,000 skills.
ACCount37•1h ago
Currently, dedicated robotics datasets are pathetic - in both the raw size and domain diversity - compared to what we have for generative AIs in domains like text, sound, video or images. So adding any more data helps a lot.
If you trained a robot to fully strip down a specific e-scooter model - whether for repair, remanufacturing or recycling - that training data would then help with any similar tasks. As well as a variety of seemingly unrelated tasks that also require manual dexterity, manipulation and spatial reasoning.
Those "9999 specializations" all overlap in obvious and subtle ways - and they feed little bits of skills and adaptations to each other. Which is why a lot of the robotic companies are itching to start pushing the units out there as soon as they are able to perform some useful tasks. They want that real world training data.
Earw0rm•1h ago
It's basically a robot arm with mobility at that point, and if you need more than one, just have more than one robot wheel into place. There's no particular reason to have two arms.. one, or three, or five are all sensible numbers. Heck, a chassis supporting a variable number of arms and other appendages (sensors and so on) is plausible, and the control system looks more like an ant-colony mind than a human one.
Which is a long-winded way of saying, there's no particular reason to link embodiment and cognition at the individual arm level in a factory scenario.
ACCount37•1h ago
What remains is all the weird and awkward automation-resistant tasks where "just get a human to do it" is still easier and cheaper than redesigning everything to maybe get old school automation to handle them.
This is the kind of niche humanoid robots are currently aiming at. It's no coincidence that at least 3 companies trying to develop humanoid robots have ties to vehicle manufacturers.
turnsout•1h ago
numpad0•1h ago
This just occurred to me: do standard industry robotic arms not fit that description perfectly? They're not specialized for any particular task, the only customizable parameters are the size and the end effector.
They can move around car bodies or seats, or pick up an airbrush. They can probably be installed with a five-fingered hand, or onto a giant human torso, should such tools somehow made sense for some applications. They feel like the generalist robot that meets most of the expectations for the hypothetical factory humanoids, sans being a humanoid. I mean, I get it, but aren't those existing bots just what "the vision" calls for?
HarHarVeryFunny•1h ago
The premise itself seems bogus though - there's plenty of tasks such as traditional assembly line and conveyor belt automation where a stable robot bolted to the floor, with a wired power source and custom manipulators is going to be a much better option.
For a mobile robot stability and reliability are key, and it's hard to see how a humanoid robot would be anything other than a massive downgrade for applications like Amazon's warehouse robots, hospital drug delivery robots, mall security robots, robot vacuum cleaners, etc. Wheels for the win.
OTOH there's the dream/hype of a domestic robot doing all your household chores, where humanoid form might actually be a plus, but at this point that's a pipe dream, and I seriously doubt many people really want C-3PO in the kitchen washing the dishes even if he is managing to do it without breaking anything or short-circuiting himself. It's like a 60's vision of the future, with people in flying cars or living on mars. No product-market fit.
stackedinserter•1h ago