Hacker News new | past | comments | ask | show | jobs | submit
Surely a company as large as Microsoft is actively attempting to build their own models. They couldn't possibly have expected to stake the future of their software development on the conditions of a third party company?
Okay, but what if you're not Microsofts size and don't have and R&D budget large enough to fund development of your own models and tools?

This is a warning to any company, not building their own AI, that AI assisted development could become really expensive really fast and most likely won't pay off. What Microsoft is suggesting is that the current price is to high, but it's still not high enough for e.g. Anthropic to be profitable, or AI coding tools are only as good as the developers using them. So you can't meaningfully do layoffs by replacing the developers with AIs, because the cost is to high.

How does Microsoft plan to fix CoPilot, so that the cost will be so much lower than Claude, that budget overruns won't be a problem for their own customer?

I expect in the next year or so, we'll stop seeing headlines like "Anthropic buys $15b of compute from SpaceX" and we'll start seeing headlines like "Uber's AI department licenses GPT 6.2 as the foundation for their internal model," or something like that.

Smaller companies will have departments that distill larger models into something more specifically manageable and useful for them. At least, that's my personal prediction :)

loading story #48241378
loading story #48246460
Giving your workforce Claude is like giving everyone in the USPS a Ferrari.

There may be a spot of “good enough to pay for and make a profit” that exists.

MSFT and Apple are taking the same approach.

The frontier model space costs 1000x as much to develop as the small language models, and is only 1.5 years ahead.

Factually, the frontier models have not paid for themselves. So, if you're MSFT and Apple, you don't need to run in a race where even the winner loses massively.

You can try to train models 1.5 years behind that are highly likely to be profitable, given your market position.

The average person is lagging behind what AI is capable of by 3+ years anyway...

So you can save 1000x on training and 10x on inference and just use SOTA small models.

Why spend $5B training a model that's for sure not going to make $5B (after inference costs) when you can spend $5M building one that WILL make far more than that after inference costs?

> attempting to build their own models.

At one point there were rumours that they'd do that. They also have the rigts to oAI models for a few more years still, so they could always use that but apparently they're also compute starved (like anyone else).

MSFT does have a frontier AI Lab. My friend works there. I don’t know what they’re doing. But MSFT is one of like 5 entities that actually have the talent and physical infrastructure to compete in model-building.
Curb Your Enthusiasm theme starts playing.
i was thinking more arrested development but that works as well