More

smeeth · 2026-02-06T18:12:49 1770401569

I always understood this to be why Tesla started working on humanoid robots

smt88 · 2026-02-06T19:42:59 1770406979

They started working on humanoid robots because Musk always has to have the next moonshot, trillion-dollar idea to promise "in 3 years" to keep the stock price high.

As soon as Waymo's massive robotaxi lead became undeniable, he pivoted to from robotaxis to humanoid robots.

senordevnyc · 2026-02-06T20:04:04 1770408244

Yeah, that and running Grok on a trillion GPUs in space lol

ACCount37 · 2026-02-06T19:04:58 1770404698

Pretty much. They banked on "if we can solve FSD, we can partially solve humanoid robot autonomy, because both are robots operating in poorly structured real world environments".

jasondigitized · 2026-02-06T22:07:36 1770415656

I don't want a humanoid robot. I want a purpose built robot.

simondotau · 2026-02-06T22:53:06 1770418386

Obviously both will exist and compete with each other on the margins. The thing to appreciate is that our physical world is already built like an API for adult humans. Swinging doors, stairs, cupboards, benchtops. If you want a robot to traverse the space and be useful for more than one task, the humanoid form makes sense.

The key question is whether general purpose robots can outcompete on sheer economies of scale alone.

dham · 2026-02-07T19:28:30 1770492510

It's called a dishwasher, washing machine, and dryer. Plus like robomowers, vaccums etc.

monocasa · 2026-02-07T00:18:11 1770423491

I mean, I would take a robot to handle all of my housework.

Purpose built, that probably takes the form of a humanoid robot since all of tasks it needs to do were previously designed for humanoids.

rswail · 2026-02-07T06:52:13 1770447133

Vacuuming and mopping are not inherently "designed" for humans.

Dusting with a single extensible and multiple degrees of freedom arm would be much more maneuverable than a human arm.

Loading and unloading washing machines or dryers or doign the same for dishes and cutlery in a dishwasher is not inherently designed for humans.

If anything, selling an integrated "housekeeping" system that fits into an existing laundry and combines features would be a much better approach.

monocasa · 2026-02-08T07:15:02 1770534902

I agree that each would be made slightly better with a more integrated system. But you could handle all of them in my hundred year old house with the form factor it was designed for: a humanoid. Probably pretty soon here for cheaper than each could be handled separately by more integrated systems.

rswail · 2026-02-09T06:17:12 1770617832

For new builds, a laundry/utility room that includes the dishwashing and other "housekeeping" facilities is a no-brainer when there is a custom robot built to use those facilities as well as maneuver around the rest of the house.

For old/retrofit renovations it also makes sense, but otherwise, yes, a human-form robot makes sense.

The question is which is a better investment for any robot manufacturer in 2026?

rswail · 2026-02-07T06:48:24 1770446904

The drop in demand for Tesla's clapped out model range would have meant embarrassing factory closures, so now they're being closed to start manufacturing a completely different product. Bait and switch for Tesla investors.

I wonder how long they'll be closed for "modifications" and whether the Optimus Prime robot factories will go into production before the "Trump Kennedy Center" is reopened after its "renovations".

Fricken · 2026-02-06T22:25:25 1770416725

It's so they can stick a Tesla logo on a bunch of chinese tech and call it innovation.

smeeth · 2026-02-03T18:12:13 1770142333

Just reading your description, it sounds like there are two variables:

1. Prompt adherence: how well the models follow your stated strategy

2. Decision quality: how well models do on judgment calls that aren’t explicitly in the strategy

Candidly, since you haven’t shared the strategy, there’s no way for me to evaluate either (1) or (2). A model’s performance could be coming from the quality of your strategy, the model itself, or an interaction between the two, and I can’t disentangle that from what you’ve provided.

So as presented, the benchmark is basically useless to me for evaluating models (not because it’s pointless overall, but because I can’t tell what it’s actually measuring without seeing the strategy).

porttipasi · 2026-02-03T18:27:22 1770143242

That's a fair point. You're right that without seeing the strategy, you can't fully disentangle what drives the differences.

But the strategy itself isn't really the point. Since every model gets the exact same prompt and the exact same market data, the only variable is the model. So relative performance differences are real regardless of what the strategy contains. If Model A consistently outperforms Model B under identical conditions, that tells you something meaningful about the model.

And honestly, that blend of prompt adherence and decision quality is how people actually use LLMs in practice. You give it instructions and context, and you care about the result.

You're right though that the strategy being private limits what outsiders can evaluate. It's something I'm thinking about.

smeeth · 2026-02-03T18:41:10 1770144070

> Model A consistently outperforms Model B under identical conditions, that tells you something meaningful about the model.

Not really! Sorry to harp on this, but there are two ways one model could outperform another:

1) It adheres to your strategy better

2) It improvises

If the prompt was "maximize money, here's inspiration" improvising is fine. If the prompt was "implement the strategy," improvising is failure.

Right now you have a leaderboard; you don’t yet have a benchmark, because you can’t tell whether high P&L reflects correctness.

porttipasi · 2026-02-03T19:09:59 1770145799

To be more specific: the prompt defines a trading philosophy and tells models what to look for in the charts. But the actual read and the decision is entirely on the model. Using your framing — it's closer to "here's inspiration, now maximize money" than "implement this exact strategy." Which means improvisation within that framework is exactly what's being measured.

But yeah, it's closer to a leaderboard right now.

smeeth · 2026-01-05T15:22:12 1767626532

I suspect this sort of thing starts to happen when UX decision-making gets decentralized. No single god-king would allow six or more different new icons; the lack of uniformity is obviously nonsensical to anybody, but not necessarily to a disorganized collective of anybodies.

smeeth · 2025-09-10T15:14:39 1757517279

If your "impossible" designs are manufactured by non-exclusive suppliers it isn't much of a moat.

smeeth · 2025-09-09T15:40:41 1757432441

Hate to break it to you, but many kids actually do better away from their parents than with them.

It's extremely sad, but a consistent finding in early childhood education is that the children who thrive most in daycares tend to come from the least advantaged backgrounds.

So a policy of paying parents to stay home would mostly benefit kids who are already well off.

yardstick · 2025-09-09T15:43:57 1757432637

Kids are social and like playing and learning from other kids. Daycare lets them do just that. It’s a great thing and every toddler I’ve met who wasn’t in daycare was behind in something. Especially verbal skills.

Plus daycare allows women to continue their career progression. It’s soo important. Not every woman wants to end their career as a mother to a young kid. Daycare enables successful women to thrive and still have families.

declan_roberts · 2025-09-09T16:26:11 1757435171

"Why do you want a thriving career?"

"So I can provide for my family"

"Why do you want to provide for your family?"

"So my children can have happy and fulfilling lives"

"What makes your young children feel happy?"

"Spending time with me"

A strong parent-child relationship is the biggest determination of life-long child happiness even into old age.

https://pmc.ncbi.nlm.nih.gov/articles/PMC4784487/

yardstick · 2025-09-11T01:14:16 1757553256

You can have a strong parent-child relationship while still using daycare.

Also people work due to other reasons unrelated to providing for their family. Individuals are allowed to have lives outside their kids.

garciasn · 2025-09-09T15:59:23 1757433563

Your anecdote is just that. All of it is highly dependent on the child, their environment, and the 'educator'. Please don't make assumptions based on your limited exposure; it's not helpful.

Spivak · 2025-09-09T16:09:50 1757434190

Your "it depends" argument is that some kids aren't social, don't like playing with other kids, are better off not having exposure to social interaction with peers and practice talking.

If this is the criticism then it's a glowing endorsement of daycare and school.

garciasn · 2025-09-09T16:13:06 1757434386

No; it depends on the 'educator'. A daycare that doesn't have kids interacting in a positive way could be just as detrimental as a parent that doesn't socialize their children externally to the home.

xp84 · 2025-09-09T15:50:44 1757433044

I'm just gonna throw this out here: Well-off kids who barely know their workaholic parents have different but equally bad issues for society, than the poor kids do.

Those poor kids have learning deficits. The "well-off" kids often have morality deficits.

A mom or dad raising them properly might help them more than being Student #642 in a government childcare facility.

This isn't an argument against childcare. My children attended preschool for 3 years before Kindergarten. But I'd rather that people got equal support to have a stay-at-home parent so that people can choose.

smeeth · 2025-09-09T16:16:46 1757434606

Do you have any evidence for that?

From what I’ve seen, the research leans the other way. For example:

Children from more advantaged families were actually more likely to view unfair distribution as unfair, while poorer children were more likely to accept it. [0]

Mother’s work hours show no link to childhood behavioral problems, it’s schedule flexibility that matters. [1]

For working-class families, more father work hours correlated with fewer behavioral problems.[2]

The idea that “well-off kids” end up with morality deficits because their parents work a lot doesn’t seem to hold up.

[0] https://onlinelibrary.wiley.com/doi/10.1111/desc.13230

[1] https://pmc.ncbi.nlm.nih.gov/articles/PMC9119633/

[2] https://pmc.ncbi.nlm.nih.gov/articles/PMC7021583/

nevir · 2025-09-09T15:55:09 1757433309

Like all things: the extremes are never good, and it's all about getting a healthy balance.

- Kids need lots of time with their parents

- Kids need lots of time around other kids

You can do that by sending them to daycare, and ALSO spending lots of time with them when they're home.

You can also do that by taking time off work, and then taking your kid(s) to places with other kids.

Both work; and it depends on your context which works for you.

ujkhsjkdhf234 · 2025-09-09T15:55:10 1757433310

You aren't wrong but calling it being "Student #642 in a government childcare facility" the wrong way of looking at it. Children grow up best when they are allowed to play with other children. Modern society robs kids of that and helicopter parents are bad for society.

xp84 · 2025-09-09T16:23:30 1757435010

I agree with you vigorously on both those points. I am skeptical however that NM will be able to create a lot of healthy, play-based environments for so many kids.

The market already has incentives to create them -- a ton of good places have waiting lists nationwide, showing unmet demand even at the current price. This suggests the price will need to go higher to attract enough people to do this job. It seems their "$12,000 value" estimate is based on an optimistic belief that they will be buying childcare for their citizens at current prices. When they realize there aren't that many slots available at current rates of pay, will they be okay significantly increasing the costs of the program?

So, my expectations for these facilities are very low and that's a big part of my concern.

ujkhsjkdhf234 · 2025-09-12T15:17:02 1757690222

I don't know where you live but where I live, the cost of daycare is extremely high and there is a waitlist on most places.

HatchedLake721 · 2025-09-09T16:19:02 1757434742

> Hate to break it to you, but many kids actually do better away from their parents than with them.

Is this based on something?

There's research left and right shows that children under 36 months at group nurseries are linked to increased aggression, anxiety, lower emotional skills, elevated cortisol (stress hormone), which is associated with long-term health and developmental risks.

Infants and children do better with one-to-one care at home by their parents and familiar faces, rather than strangers in a group setting.

cvoss · 2025-09-09T16:22:28 1757434948

Perhaps there is something about the environment of an economically disadvantaged household that could be improved by a stipend which allows at least one parent the breathing room to dedicate full time attention to the child instead of a job (or multiple jobs). I don't think the findings you mentioned cut against that idea at all.

I hear you saying the benefit of dedicated caregiving for children mostly helps families with less economic advantage. I'd agree with that, and suggest that OP's proposal capitalizes on exactly that. I'm not convinced of what may be implied in your argument that low-earners make for bad parents and that children should be separated more from their parents for their own good. Let the internal dynamics of a family be solved first, before saying we need to separate parents from children more.

Moreover, those with more economic advantage are unlikely to take a stipend in exchange for staying home. That's not a good deal when keeping the job pays so much that they can afford to pay for childcare.

It is precisely those with less advantage who will take the deal.

So I don't agree with your prediction that such a stipend mostly benefits those who are already well off.

bko · 2025-09-09T16:41:48 1757436108

> that the children who thrive most in daycares tend to come from the least advantaged backgrounds.

So the children that do well in daycare comes from poor homes? So kids from rich home don't do well in daycare?

Every interaction I've ever had says the opposite. The disruptive bully at school usually comes from a broken home.

MisterTea · 2025-09-09T15:58:19 1757433499

My daycare was called preschool. It allowed my mother to focus on my infant brother during the day while I was literally two blocks away running around, coloring and learning shapes. Show and tell was my favorite.

Minor49er · 2025-09-09T15:48:54 1757432934

> Hate to break it to you, but many kids actually do better away from their parents than with them.

How so?

smeeth · 2025-09-09T15:51:10 1757433070

The most obvious example is the children of addicts. It’s hard to imagine a kid is better off stuck at home with druggie parents than spending the day in daycare.

declan_roberts · 2025-09-09T16:31:41 1757435501

A good example of bottom quintile policy. Because the bottom quintile has a better outcome with a certain approach, it becomes standard care for everyone else.

Once you see it, you'll see it everywhere.

smeeth · 2025-09-09T17:35:17 1757439317

…so?

A realistic stay-at-home subsidy would max out around $30k. Your proposal only meaningfully shifts incentives for the bottom income quintile. For everyone else:

- Upper-income families can already afford to choose whatever setup they want.

- Middle-income families couldn’t take it because it’d mean too steep a drop in income.

So the alternative you proposed economically benefits the bottom quintile while leaving their kids worse off. For everyone else, it probably either doesn't matter or gives them cash they don't need as much.

Minor49er · 2025-09-09T16:43:33 1757436213

Anyone would be better off being away from addicts though

smeeth · 2025-09-05T17:39:47 1757093987

Another day, another person not getting discounted cash flow.

Models trained in 2025 don’t ship until 2026/7. That means the $3bn in 2025 training costs show up as expense now, while the revenue comes later. Treating that as a straight loss is just confused.

OAI’s projected $5bn 2025 loss is mostly training spend. If you don’t separate that out with future revenues, you’re misreading the business.

And yes, inference gross margins are positive. No idea why the author pretends they aren’t.

smeeth · 2025-09-02T10:11:25 1756807885

As far as analogies go I prefer approximate database

smeeth · 2025-08-14T14:19:52 1755181192

I've been annoyed for a while people don't use a common parameter weight/compute budget for benchmarking papers.

That said, it does make it easier to claim progress...

pizza · 2025-08-14T17:23:23 1755192203

https://github.com/KellerJordan/modded-nanogpt is pretty great in that respect

godelski · 2025-08-14T23:48:31 1755215311

As a researcher, I can totally agree, but at the same time this isn't super straight forward. Things get weird because you can't just translate from one GPU to another. There isn't a clean calculation for that. There's also other issues like parallelism. Sure, your model is stable with a batch size of 8192 but that's across 1 node, it might not be stable with that batch across 2 nodes. This is a real frustrating part and honestly I don't think most people even are aware such issues exist.

Right now I'm just happy when people are including parameter, GMACs (or FLOPs), and throughput. I always include those and the GPUs I used. I also frequently include more information in the appendix but frankly when I include it in the front matter the paper is more likely to be rejected.

I can tell you why this isn't happening though. There's a common belief that scale is all you need. Which turns into "fuck the GPU poor". I've published works where my model is 100x smaller (with higher throughput, and far lower training costs), and the responses from reviewers tend to be along the lines "why isn't it better?" or "why not just distill or prune a large model?" There's this weird behavior that makes the black box stay a black box. I mean Yi Tay famously said "Fuck theorists" on twitter

smeeth · 2025-08-05T20:04:40 1754424280

Their FOSS local inference service didn't go anywhere.

This isn't Anaconda, they didn't do a bait and switch to screw their core users. It isn't sinful for devs to try and earn a living.

kermatt · 2025-08-05T20:30:02 1754425802

Another perspective:

If you earn a living using something someone else built, and expect them not to earn a living, your paycheck has a limited lifetime.

“Someone” in this context could be a person, a team, or a corporate entity. Free may be temporary.

blitzar · 2025-08-05T20:09:59 1754424599

Yet. Their FOSS local inference service hasn't go anywhere ... yet.

dcreater · 2025-08-05T22:29:16 1754432956

You can build this and go build something else as well. You don't need to morph the thing you built. That's underhanded

smeeth · 2025-07-02T17:03:28 1751475808

It's worth noting this is the exact argument people used against adopting electric calculators.

thoughtpeddler · 2025-07-02T17:47:22 1751478442

Calculators are a very narrow form of intelligence as compared to the general-purpose intelligence that LLMs are. The muscle/steroid analogy from this same discussion thread is apt here. Calculators enhanced and replaced just one 'muscle', so the argument against them would be like "ya but do we really need this one muscle anymore?", whereas with LLMs the argument is "do we really even need a body at all anymore?" (if extrapolated out several more years into the future).

smeeth · 2025-07-02T21:17:34 1751491054

You don't need the analogy. If you have a tool that does a job for you your capacity to do the job degrades alongside other associated skills.

Tools that do many things and tools that do a small number of things are still tools.

> "do we really even need a body at all anymore?"

It's a legitimate question. What's so special about the body and why do we need to have one? Would life be better or worse without bodies?

Deep down I think everyone's answer has more to do with spirituality than anything else. There isn't a single objectively correct response.

conception · 2025-07-03T01:48:09 1751507289

They weren’t wrong. There are lots of cognitive and conceptual benefits to slide rules.