We'll know this works when it starts replacing Amazon pickers in quantity. Amazon has been trying to automate that for years, with many demos and contests. So far, nothing can quickly and reliably take random products out of one bin and put them in another. Amazon's robotic systems move larger containers and shelves of bins around, but do not yet pick individual items.
You raise a great point. And the Amazon picking staff are onshore in wealthy countries. I guess the minimum wage paid by Amazon is around 15 USD per hour.
I wonder: Is the task of automating this work primaryly difficult in vision or dexterity (motion)? Or maybe they are equally difficult for different reasons.
If we're talking about picking objects at random from one bin and putting it in another, I don't need my eyes to do that. Proprioception (shape and location) and touch (texture) are enough to do that.
Probably both vision and dexterity, and the first mistake we make as roboticists/engineers might be to distinguish the two like they're separate problems to solve or that a solution exists where the two live a separate life.
https://rodneybrooks.com/why-todays-humanoids-wont-learn-dex...
Agreed. The solution will likely be some vision foundation model that directly sends controls to the robot ("move here, grab, move there"), trained by Amazon with RL to integrate collision avoidance, object detection, grasping point detection, grasp verification etc.
There's a lot more money being thrown at this than in previous years. Seems to be growing beyond corporate R&D labs and university research towards startups trying to productize it.
I've seen multiple articles about robotic claws. This one made the rounds previously https://www.firgelli.com/pages/humanoid-robot-actuators
>A humanoid robot takes roughly 5,000 steps per hour. Each step sends a shock of 2–3× body weight through the leg actuators—forces that would be fine occasionally, but become destructive when repeated thousands of times without pause.
As someone who comes from the world of running and knee problems, I feel this misses the issue. Normal walking should not produce these kinds of shocks unless your gait is really jumpy or otherwise screwed up. You only start to see these forces when running and that's where technique becomes important even for humans if you want to prevent damage to your joints over long distances. But at least for walking I suppose that a fully articulated humanoid with all the degrees of freedom of human gait should be mostly a control problem, not a mechanical engineering one.
The force an impulse generates on a contact depends on the speed of deceleration. It's just F=m*a
Slow deceleration leads to low forces. If you have a contact event with a hard substance, like a rigid metal for accurate kinematics, the deceleration to zero upon a contact has to happen instantly. Meaning the deceleration is incredibly high, resulting in extremely high forces for a few milliseconds.
Human bodies are made out of a flexible and impact resistant material: water. When a contact event happens, your body deforms, which means that the deceleration happens over a longer time frame with less force. Not just that, your muscles also have a certain amount of flexibility in them and basically zero internal inertia. All the inertia is in the limb as a whole, whereas for a robot there is a spinning motor and gearbox that needs to slow down as well.
You could solve this as a control problem by adding series elastic actuators, which means you need to change your mechanical design.
The human body goes further than that too, when you're out jogging - as your foot approaches the ground for a stride, you slow the velocity of your foot downwards towards the ground so there's less of a sudden deceleration.
Imagine when you throw a tennis ball high in the sky, and then you catch it on your racket without bouncing by matching it's velocity, your feet do the same thing with the ground on a smaller scale.
Maybe this workforce is useful not because of it's direct output, but for it's mere existence : look politian, I'm creating jobs !
As others say, not necessarily. The breakeven point for jobs like Amazon may be quite low (or high? I mean simply "not yet there").
I'd say that we'll know it works when robots with those hands start turning out on the Russo-Ukrainian frontline en masse, because it is there where the lack of manpower has the most pressing and brutal consequences, and cannot be mitigated by usual peacetime incentives (e.g. better benefits).
That frontline has already sucked in all the automatization innovations of the last decade, as long as they proved themselves in combat.
> We'll know this works when it starts replacing Amazon pickers in quantity.
That doesn’t follow. There are plenty of tasks that can be fully and reliably automated but aren’t, for the simple reason that human labor is dirt cheap compared to advanced robotics.
I disagreed, then re-read your post, then re-read the OP, and now I've come full circle to apologize; I think you make a fair point.
I work at a biotech. We spent who knows how much time and money trying to develop a 'lab technician bot' to automate one of our critical assays. Turns out, a 6-figure machine still isn't as economical as my coworker Y, one of the veteran lab-technicians. Sure she takes the occasional sick day but even at our volume (and we do industrial-level, multiple clients batched into a single assay pass) it won't be economical to replace her for a very long time (if we even reach that scale).
But that does follow. The economics working is not some outside factor. If the robot “could do the task” but would cost more than paying a human to do the same task then the robot “does not work”. It is frequently because the robot would be too slow, or not reliable enough, or could only handle certain types of items. But ultimately all of these boil down to cost.
We have seen lab demoes of robotic manipulation for decades. The reason why they stay in the lab (when they do) and don’t become ubiquitous is because they are not good enough. In other words they don’t work. The economics and “does it work” is not two separate concerns but one and the same.
It's a continuum, not binary. The same robot that doesn't financially "work" for replacing a manual scavenger sorting garbage in an African slum might be quite cost-effective sorting recycling in Switzerland, and would likely have a niche regardless of price if used to (say) sort biohazardous or radioactive materials. And there are already millions of robots out there assembling cars etc.
What is the point of humanoid “general” robots then? We already have pretty reliable ways to make and train humans. Humans are cheaper and better than robots. I could imagine robots for some specialised tasks where you don’t want to use a human for eg security reasons, but you don’t need general purpose robots for that
technology gets cheaper over time. If they were always going to cost the amount they do now, you might have a point. But they'll eventually get much cheaper.
Robots are good at things that are "simple" but where human precision is not good enough, or where people would get bored and make mistakes.
If robots ever do get cheaper than humans for it, though?
In natural ecosystems, nobody beats the apex predator directly, and nobody beats the hyperspecialized niche critter at their own game. The new species has some advantage that’s different than what is there.
If a humanoid robot is slower dumber human that is expensive, requires power, can’t get wet, falls over, and doesn’t understand stairs. Is not sleeping and being radiation tolerant enough of an advantage to be worth it?
The nature comparison doesn't work on a fundamental level because you're only getting a fraction of the human's power based on how much they're happy to sell.
You forgot a big one in your description of the hypothetical advantages:
No free will
[dead]
They already are, the problem with humanoid robots is that people think that adding legs to the robot will somehow fundamentally make it more intelligent.
People see a robot arm attached to a stationary platform and understand it requires integration work to perform a single task.
But when those same people see a humanoid robot, they think they can just talk to it like a real human and it will do what you told it to do.
They don't think about the fact that the humanoid robot has to be programmed exactly the same way the stationary robot arm has to be programmed and that programming the legs in addition to the arms is a much more challenging problem.
Robots can be optimized for tasks and if they are, their benefits are greater. When cars replaced the horse, it was because they didn’t poop, and because a car designed only for transport would not suddenly have a heart attack and stop working.
Funnily enough, cars have their own way of pooping and dying of a heart attack.
Cars can stop working suddenly in many many ways, for many reasons.
A friend who works at Amazon made the same point: "We don't really need robots in the FCs urgently [other than the Kivas], because it turns out you can just pay people $17/hour"
I was thinking this week that AI token costs are probably going to get so expensive soon that bright spark CEOs are going to realise “why am I paying for such expensive coding agents when I can pay people from the third world to code!?!” and announce outsourcing like it’s some kind of stunning and innovative revelation.
C-suite has been saying this for 30+ years. They never tire of it. Ask yourself: At this point in time, why aren't all programmers working from low cost jurisdictions?> when I can pay people from the third worldI think you didn’t grok the hidden punchline - this is the stage after they’ve replaced all their third world coders with AI agents, until one day a C-suiter has the revelation that humans are cheaper and better, and the company then starts toting its humanistic credentials all over LinkedIn.
Mechanical picking has been too slow. It's not a problem with the robot mechanics. Here's 300 picks/minute from 2012.[1] The parts are all the same, so the vision problem is simple.
But picking arbitrary objects from fulfillment bins is still running at a few picks per minute.[2] As the speed picks up, humans become less necessary.
That's the point of the test condition. When running a robot becomes more economical than paying full-scale humans $17/h, something important about robot abilities will have changed.
I dunno, I worked in an Amazon Warehouse for a year part-time (and a couple of weeks full-time when in-between jobs) --- on one occasion, I pulled up to a bin full of non-descript cardboard boxes near where a group of trainees were working their way through, grabbed one box, spun it around for the six-sided box check, scanned it, confirming it was the right one, and before I could move on to my next pick, a trainee asked, "How did you know that was the right box?", which required a several minute explanation of how the item description and the slight differentiations of the boxes led to that conclusion.
The big win would be training the folks doing stowing to not create such situations and to put markedly different things in each rainbow bin.
This would be a more convincing take if reasoning LLMs didn't already exist. Given the growth in capability over the last few years alone nothing about your description "several minute explanation of how the item description and the slight differentiations of the boxes" seems beyond an artificial intelligence to solve by the time humanoid robots would be ready to physically traverse a warehouse.
Your last point is also interesting given perhaps a robot is more amenable to such instruction, thus creating cascading savings. Each human has to be trained, and could be individually a failure. Robot can essentially copy its "brain" to its others.
Or likely more accurately, download the latest brain trained from all the robot's aggregate experiences from the amazon hivemind hq
The "Markedly Different things" in each bin was a big Amazon Warehouse advance in warehousing. Traditionally - things that were "alike" were put on shelves/bins - but (according to Amazon) it was far more efficient for pickers (at least back in the day - may have changed since then) to have random things on shelves located near each other to allow for equal access to popular items by pickers.