Hacker News new | past | comments | ask | show | jobs | submit

Microsoft starts canceling Claude Code licenses

https://www.theverge.com/tech/930447/microsoft-claude-code-discontinued-notepad
The comments I see recommending selective use of cheaper models doesn't match the reality I experience working in the industry. I have the constant threat hanging over my head of being fired if I don't churn out code quickly enough. I'm not willing to gamble with my livelyhood by using a less effective model.

Saving money on tokens isn't something that's rewarded during performance reviews; particularly because it's difficult to quantify how much you saved versus hypothetically using a more expensive model.

loading story #48248328
> I have the constant threat hanging over my head of being fired if I don't churn out code quickly enough.

And the tragedy is that this isn't sustainable, and we all involved deeply in tech know this. There is eventually going to be a big reality check the companies will have to pay, because you can't force creativity and quality, not even with AI, because actual intelligence lies with us at least for now and for the foreseeable future. However when the rope eventually snaps these executives at best will fall upwards, with big severance bonuses and a list of "contributions" we have to be grateful for. We are the ones that will suffer through the next big layoffs.

loading story #48248663
loading story #48249424
loading story #48248671
loading story #48248117
loading story #48248235
loading story #48248320
If you have such toxic environment, run.
loading story #48247646
Maybe once we get universal income we can start recommending this. Until then the individual isn't to blame when the only option to keep providing is to keep grinding in a toxic environment.

But I'd agree that everyone can start planning a career shift that'll span a few months to some years in order to seek better working conditions. Passively accepting all work degradation because that's life and money is needed is partly responsible for the current situation too.

Where to, that's the question. The economy is in the gutters and the replace-people-with-AI craze is making the issue even worse.
loading story #48247403
loading story #48247511
loading story #48247937
And open positions are simply because someone decided to run from that place
loading story #48247029
From reading the article. They offered their developers both Claude code and Copilot.

What they wanted was for them to use both and feedback which was better.

The developers voted with their feet and didn’t use Copilot.

What Microsoft were hoping was that the opposite would happen...

For months, Employees had the option to choose claude code or copilot. Now they dont.

Underlying model choice still has no restrictions. Opus 4.6 is by far the most popular. there's still big $$$ bills going anthropic's way.

loading story #48246064
loading story #48246257
Most of us never had the option for work to pay for Claude Code -- some internal orgs did this. That being said I had a personal Claude Code subscription for a bit.

Honestly I find GitHub Copilot CLI (and now also the new GitHub Copilot app) quite decent. I mostly use it with Opus 4.7, or rarely with GPT-5.5. The VSCode extension is ok, but CLI or app are the better experience IMO.

I wish I could understand the appeal of using Claude Code inside VScode rather than Copilot. I feel like I'm missing something obvious.
loading story #48245802
loading story #48245791
loading story #48245792
loading story #48245809
loading story #48245737
loading story #48245746
There's definitely a way to use Claude code that is token conscious.

I've tried throwing unsupervised agentic software factory workflows against the wall, and they burned through my tokens like nobody's business but didn't produce much.

Supervised, human-in-the-loop process on the other hand is much more productive but doesn't consume nearly as much. Maybe that's why everyone's pushing agentic approaches so much.

Yeah. Claude does good work but reviewing it all properly takes quite a bit of time. It got to the point I started having trouble maxing out my weekly allocation.

Dealt with that by going all out and making an agentic parallel code review skill. Basically an infinite TODO list generator. Now I'm definitely getting 100% of the usage I paid for. It really burns tokens like nobody's business, and catches a lot of issues while at it. I've been looping this review/fix process every week. It's dramatically reduced the amount of stuff I need to pay attention to during my human review sessions.

loading story #48246564
The current thinking is automated agents is what turns this from an industry in the tens of billions to a multi trillion dollar one. So yes you are right on the money, agents stimulate demand for this thing they've built.
"The bureaucracy is expanding to meet the needs of the expanding bureaucracy"
loading story #48246836
I didn't know that one. Loosely said to be Oscar Wilde.
loading story #48245315
[flagged]
loading story #48240981
I think it's great. People at a broad scale are getting first hand experience with resource management. It's a fairly cheap way of doing it too (in contrast to: learning this by managing humans) and we can all benefit from the skill transfer.
At the enterprise level though, its going to be hard to want to use a service in which costs are not predictable, and keeping those costs under control requires employee training.
You can put a limit on token spend and provide training (and even pre-configured workflows) on how to limit token spend.

Like the other commenter said: cloud spend can also spin out of control if you don't pay attention, yet we've found ways to keep it under control (training, guardrails, limits, transparancy).

loading story #48246306
>...use a service in which costs are not predictable, and keeping those costs under control requires employee training.

Isn't this a (mildly exaggerated) description of AWS, which is a very successful service?

loading story #48245485
Am I losing my mind, aren't there multiple headlines each day about companies penalizing employees for not using AI enough?
loading story #48241376
loading story #48241772
To be fair, the cost of software development has always been fairly unpredictable. What may be different is that the cost used to be roughly proportional to man-hours spent, while now the number of agents running in parallel may be less predictable.
loading story #48241157
loading story #48240949
There's no fucking training to mitigate a slot machine.
loading story #48246050
loading story #48246047
loading story #48245335
> There's definitely a way to use Claude code that is token conscious.

By buying a subscription and dealing with the limits, using claude code and paying per token seems like the fast lane to the poor house.

> There's definitely a way to use Claude code that is token conscious.

Colleague used Sonnet 4.6 on some pretty normal agentic coding tasks through AWS Bedrock to keep the data in the EU, 100 EUR usage in a single day. In comparison, the Mistral subscription costs about 20 EUR per month and we tested that for similar tasks it was okay, the usage got to around 10% of that monthly limit in a single day. Or Anthropic's own Max (5x) plan where you get way, way more tokens to do with as you please.

I feel like the sweet spot is having a monthly subscription with any of the providers (you're subsidized a bunch), but if you have to pay per tokens, now I'd just look in the direction of what tasks DeepSeek would be okay for, sadly probably not in the situation above. For a startup, though...

On the other hand, this feels a bit hypocritical:

> It was part of an effort to get project managers, designers, and other employees to experiment with coding for the first time, and sources tell me that Claude Code has proved very popular inside Microsoft over the past six months.

They're gonna say that the future is all AI... until they get the bill.

loading story #48246706
I was trying to get a better sense of the time cost quality matrix of these, so I threw together a quick eval of Sonnet 4.6, Mistral's dev model, and Opus 4.7 (figuring it's what you'd use if you were on Max).

The results for a function implementation and test of levenshtein distance in js are pretty similar but Mistral is 30x cheaper than Opus 4.7 and 4x faster than Sonnet 4.6.

https://5m6qnuhyde.evvl.io/

loading story #48245625
loading story #48242334
{"deleted":true,"id":48241718,"parent":48241363,"time":1779484164,"type":"comment"}
> They're gonna say that the future is all AI... until they get the bill.

I mean, the will continue to say so, they just want to be the ones being paid for the service, not anthropic :)

My experience as well... I've only hit Antrhopic's 5hr threshold a few times, and two of them was within a half hour of the window. Also, all three times I'd already accomplished a LOT.

I tend to work with the agent, and observe what's going on as well as review/test and work through results/changes. I spend a lot more time planning tasks/features than the execution, even using the agent as part of planning and pre-documentation. It works really well. I don't think people burning through the 5hr allotment in under an hour are actually reviewing/QC/QA the results of what they're doing in any meaningful way, and likely producing as much garbage as good (slop).

I'm really curious as to HOW the MS employees were using the agents as much as what they were doing.

I suspect subscription limits are quite a bit higher than the equivalent tokens their dollar cost could purchase. I similarly feel like I can get a lot done with a $20/mo Claude Pro subscriptions, but also can easily spend $10-20/day at API pricing with similar usage.
loading story #48240573
loading story #48240048
loading story #48240459
I get 98.6% cache hits on Claude code. Short of drastic arch changes it’s hard to imagine it getting much better.
98.6% cache hits doesn't distinguish an efficient workflow from an overly chatty linear agent repeatedly reusing the same context. Plus, it says nothing directly that the process has good useful progress per token.
loading story #48241022
You pay for cache hits on every turn and even with the newest architectures longer context is slower/more energy intensive. Constructing concise turns that reuse prefix and stop when the new context is no longer useful help, as does pushing generation down into cheaper models while using stronger models for verification.
---- Before it was:

Me: We need to do this this that.

Claude: <random stuff that approximates human outout>

Me: Are you sure?

Claude: Well actually there is a bug <more random stuff that looks right this time>

----- Now it is:

Me: We need to do this this that.

Claude: <random stuff that approximates human outout>

Claude: Let me consult the advisor on that.

Claude: advisor came up with some advice, adjusting according to that. <more random stuff that looks right this time>

yeah, by using codex
So, snippet from the article says the following:

> I understand that Microsoft is planning to remove most of its Claude Code licenses and push many of its developers to use Copilot CLI instead. While Claude Code has been a popular addition, it has also undermined Microsoft’s new GitHub Copilot CLI coding tool — a command line version of GitHub Copilot that runs outside of development apps like Visual Studio Code.

And people here are interpreting this as related mainly to the Claude burning too much tokens too quickly and suggesting Microsoft should rather use SomeOtherLLM©?

Is this Hacker News or rather Marketing Wars?

So "Microsoft chooses to eat its own dogfood" is a more accurate title?
I don’t think people read the article, I didn’t until I saw your comment. The article feels like clickbait tbh.
Por que no los dos?

Eso mensaje de hijo de Carlos

loading story #48241805
Feels about right.

I've launched an internal demo of Claude Code and Deepseek on the same day and we burned through our monthly allowance for Claude in just over a week, with more than a half of that budget being spent in one day. With DS people are unable to go through that same amount of money in a month, not even close.

With that Claude feels like an expensive toy, while DS is a shovel, purely because developers do not feel like they are eating into a precious resource while using it. Also it does not feel like there is much of a difference in capability between Claude and DS-pro. DS-pro and flash do feel like sonnet/opus and haiku, but flash is still very-very capable.

I rage canceled Claude today.

After 2 weeks of Claude getting progressively worse and worse, today was the final straw.

I don't care if they have a phone app. The model is COMPLETE garbage after you subscribe long enough and they think they've "got you".

I can't code on my phone if the model literally moves in the wrong direction and does the opposite of what I tell it to. If I wanted to make my code worse, I'd just randomly commit garbage. I don't need a mobile app for that.

loading story #48242632
loading story #48242640
Considered Gemini?
loading story #48241217
loading story #48241287
loading story #48249657
loading story #48247460
My experience is, Claude Code burns way more tokens compared to other agents, probably to ensure high levels of perceived quality, which is, most of the times not worth the bloat for the user. The bloat works for Anthropic as an advertisement at the cost of your tokens.
its kind of weird tho, jensen also said we should be burning tons of tokens as well... 'perceived quality' cant be the only reason these ceos pushing token usage so hard can it?
Thus does kind of beg the question: If developers are being laid off because AI is better/faster/cheaper or makes all their people 10x or whatever fig leaf, what happens if the required tooling ends up being more expensive? From the investor’s point of view is the drag of employee costs better or worse than a ballooning expense item?
More expensive is a difficult calculation: faster can sometimes warrant the higher cost, if it means you can go faster to market. Also, LLMs work 24x7, and can be scaled up and down as needed. Faster to off board an LLM than to fire an employee (especially here in Europe). So, even if AI is more expensive than a developer, from TCO and ROI perspective it can still make business sense.
They lay people off and look good in front of investors. Then they hire people, talk about "growth", and once again look good in front of investors.

This would never fly if stock market was rational. But it never is.

loading story #48245518
I suppose if it all works out it'll end up way more expensive than the employees the models displaced ever were. These kinds of technologies usually end up as an oligopoly at best, and those players will have a wide moat by then, and the things these models build will be tweaked such that no other model or human being can realistically work on them anymore, and then they can price gouge everyone to the brink of unprofitability.
loading story #48241052
loading story #48245599
> If developers are being laid off because AI is better/faster/cheaper

This is, in my opinion, tripe. SWEs are being laid off because of post-Covid over-hiring. The only evidence for labour destruction is in junior hires. But not because anyone is being fired, but because entry-level jobs are being cannibalised.

loading story #48246116
I suspect AI would have to get drastically more expensive before it starts looking worse than payroll. If one developer using Claude Code can effectively substitute for 2 developers, you are already coming out ahead at current API pricing assuming very heavy usage, your cost is going to be ~1.5x developer (factoring in beyond salary - benefits, PTO, the other overhead that comes with having employees).

So you're getting 2 for the price of 1.5. Scale that up to 500 devs at a big company and it's a big chunk of change saved on payroll.

Keeping your headcount or hiring humans instead, AI would have to start to cost upwards of $15k/month/developer or more before it costs more than hiring. You're looking at about 4 billion tokens per month before humans start to break even or are cheaper.

loading story #48240765
loading story #48241197
"AI" is just a cover for laying ppl off and saving cost. But the pendulum will swing the the other way and the companies will realise that knowledgeable ppl are still required to generate and utilize the generated code. No serious company can run with vibe-coded apps generated by laymen.
There is no profit, expense, revenue. Those don't matter. Only thing that matters is stock price goes up, and laying off makes stock price go up. When laying off make stock price go down, then laying off stop.
loading story #48240856
{"deleted":true,"id":48241230,"parent":48239700,"time":1779481822,"type":"comment"}
loading story #48248519
I’ve been quite content with CoPilot’s $10/mo plan. Still offers access to Claude models (limited tokens) but has no time limits like the $20 Claude plan, so no interruptions in work flow. I use one of the free models for the more pedestrian tasks then sic Claude on the particularly thorny problems. Works very well for me.
I'm not sure if you are referring to the old or new plan?

Github Copilot offered probably the best value and was IMO underappreciated for a long time; I've been an annual subscriber since day 1.

The changes announced a few days ago completely revoke that value proposition, I doubt I'll continue with it.

loading story #48246264
Can even buy more premium tokens for more Claude use, which I have done once. But most of the time the tokens included in the plan are sufficient.
Cancellation effective June 30. This was a _pilot_ launched in December that accidentally consumed their 2026 yearly target spend on AI!

I expect the r/LocalLLaMA guys to be going nuts about this news.

From the article

> It was part of an effort to get project managers, designers, and other employees to experiment with coding for the first time.

I suspect they weren't as efficient as they could be with token use either. Sounds like they were trying to encourage non-developers to vibe code stuff

loading story #48241007
The title is somewhat bait. It reads like MSFT is using less AI, while in fact it's just a force swap to Copilot.

Arguably, Copilot is GPT 5? Not sure what the CLI offers behind the covers.

Employees (at least on my team) get access to the Claude models as well when using Copilot CLI.
Copilot is the name for the harness / wrapper of MSFT products

The CLI can swap to whatever model (/models) based on your subscriptions.

The copilots on desktop or Office Apps are likely just GPT5 nano or other tiny models with cheap inference

I disagree. As someone who just got a new Windows laptop with Copilot baked(forced) in I've tested Copilot a lot.

It. is. so. bad.

It feels like it's at least 1-2 years behind the current top models.

loading story #48245804
loading story #48246301
I'm surprised they even had them in a first place. Doesn't Microsoft have a deep partnership with OpenAI? Aren't all Copilot things powered by various GPT models? I would assume the two companies have barter agreements of sorts.
They do have agreements, but they aren't exclusive, and Microsoft and Open AI have had a rather public falling out over the last year.
loading story #48246739
loading story #48248192
Well, that's the inevitable outcome of token-maxxing :shrugs:
Lots of these places measure employee token use with managers having dashboards. It seems like performative code production rather than making anything useful.

Speed without judgement always compounds badly.

loading story #48248741
I switched to OpenRouter and OpenCode a while ago. It is much cheaper, much much cheaper, and A LOT more reliable. Particulary Gemini was a piece of trash when it came to uptime
If you properly keep documents, architecture, and decision records, token consumption can be pretty less. Iam managing everything with two codex plus sub. Repo size is 300 k loc ( backend).
I switched from Claude code to the GitHub copilot app recently. Since our repositories are hosted on GitHub I find the copilot app better integrated for the PR workflow with PR management available in the app. I don’t think I miss any of the features of Claude code I never thought I would make the switch but copilot upped the game.

Also it became very hard to convince management to keep both Claude code and GitHub Copilot enterprise licenses.

I think whats funny is that employees were most likely already covering the cost for these tools because they are useful. Companies didn't believe employees were using these tools and now have forced their usage and no longer have the costs subsidized.

Similarly companies seem to reward high token usage as a sign of someone willing to play ball with AI and again have forced higher costs on themselves for people reward hacking or using tokens out of spite.

There is no world where I can put my company’s data through an external site without their express consent and security sign off. I suspect at most companies there’s zero path for people to have been paying for it themselves.
loading story #48241087
My guess is that at most companies, employees are prohibited from doing this, but not prevented.
That's very interesting to reconcile with the fact that not too far, Amazon employees feel incentivized to use as many tokens as possible.
"incentivize to use as many tokens as possible" = "Upper management knows people dont like change so we are forcing them to come up with ways to use this thing". It does not mean that management will encourage wastefulness in the future, and it also doesnt mean that token usage from now wont be reviewed in the future. Whats to stop them from dinging your performance in november because you wasted a hundred thousand on tokens with nothing to show for it?
loading story #48241437
loading story #48247860
Surely a company as large as Microsoft is actively attempting to build their own models. They couldn't possibly have expected to stake the future of their software development on the conditions of a third party company?
Okay, but what if you're not Microsofts size and don't have and R&D budget large enough to fund development of your own models and tools?

This is a warning to any company, not building their own AI, that AI assisted development could become really expensive really fast and most likely won't pay off. What Microsoft is suggesting is that the current price is to high, but it's still not high enough for e.g. Anthropic to be profitable, or AI coding tools are only as good as the developers using them. So you can't meaningfully do layoffs by replacing the developers with AIs, because the cost is to high.

How does Microsoft plan to fix CoPilot, so that the cost will be so much lower than Claude, that budget overruns won't be a problem for their own customer?

loading story #48239921
loading story #48241135
> attempting to build their own models.

At one point there were rumours that they'd do that. They also have the rigts to oAI models for a few more years still, so they could always use that but apparently they're also compute starved (like anyone else).

loading story #48241152
MSFT and Apple are taking the same approach.

The frontier model space costs 1000x as much to develop as the small language models, and is only 1.5 years ahead.

Factually, the frontier models have not paid for themselves. So, if you're MSFT and Apple, you don't need to run in a race where even the winner loses massively.

You can try to train models 1.5 years behind that are highly likely to be profitable, given your market position.

The average person is lagging behind what AI is capable of by 3+ years anyway...

So you can save 1000x on training and 10x on inference and just use SOTA small models.

Why spend $5B training a model that's for sure not going to make $5B (after inference costs) when you can spend $5M building one that WILL make far more than that after inference costs?

Curb Your Enthusiasm theme starts playing.
loading story #48243498
loading story #48248735
The way coding agent work is fantastically wasteful. All the megabytes of code are processed over and over and over, sometimes withing just one session.

There are papers describing KV cache precomputation for commonly used documents (e.g. KVLink), but, of course, it's not a priority for model providers: they'd rather sell you more tokens, also they would rather get to AGI/ASI first than optimize usage of existing models...

Claude code gets >98% KV cache hits. It’s not reprocessing unless you let the cache go cold (5 minutes, which is annoyingly short).
loading story #48241265
loading story #48241119
loading story #48245431
{"deleted":true,"id":48245307,"parent":48239145,"time":1779518587,"type":"comment"}
Microsoft should host DeepseekV4 internally for its developers. And you're welcome.
Microsoft does self host claude and gpt for GHCP
This is the smartest solution to do, to self host the model locally on premise.
loading story #48241070
What's the point of eating your own dog food when the only thing you are doing is reselling other people's dog food? Microsoft don't have any competing LLM.
Tokens aren’t that much of an issue when your not evaluated on the usage
They got DeepSeek on Azure, would cut costs by 10x … if they ran it on Huawei
I think tech companies are doing layoffs partly because they need to cover AI operating expenses.
I think so too, otherwise why wouldn't you put that (purported) increased capacity/output into improving your existing products or creating new ones, with the headcount that you already have?
loading story #48247449
I switched from Anthropic to OpenAI after spending ~$40K in equivalent token costs using Claude over 3 months.

I found Opus 4.7 to be slow and wasteful with token usage. It's shocking how inefficient it is with tasks like bash tool usage and web searching, delegating them to a dozen subagents only to get stuck and never return until you esc and intervene. That, in addition to all of the broken tooling Anthropic built in to limit token usage like the broken monitoring tool made managing Claude a chore. I was happy to pay $200/month for Opus 4.5 when they had more capacity, but 4.7 felt like a huge step back and no longer worth the price and inconvenience.

I remember an OpenAI employee comment on the GPT5.5 release post about how they specifically geared it towards long-horizon tasks and its been a breathe of fresh air in that regard. I have five two-week long sessions going right now and there's been no degradation in performance or efficiency. It's much better at carrying rules/learnings forward even in long-running sessions and grounding/refreshing itself in verified facts when it loses context.

Its funny because in two weeks I've gotten way more done with GPT5.5 with way fewer tokens and way less handholding. I think this goes to show how important tooling and the harness is and how a capable model like Opus 4.7 can be severely handicapped by bad product decisions.

Being able to mange context over long running sessions is a function of the harness, not the model. Are you using Claude Code with GPT5.5? Codex? piclaw? They’ll all have different context management strategies to let you keep going when you would otherwise have filled up context and be forced to stop.
loading story #48247842
loading story #48248346
My impression is they're being cancelled in favor of full internal adoption of Copilot CLI, which has got much better over the past few months.
I'm also a big fan of Copilot CLI, especially after demoing it to a coworker who liked Claude Code.
To be fair, Microsoft dogfooding something for once would be great.
loading story #48246990
This feels like these kind of bad incentive problems we always here about on here ... Like bugs and vipers.
Doesn't MS have the compute to run GPT 5.5 for all its employees?
This is an AI generated summary of a blog post (https://www.thelowdownblog.com/2026/05/microsoft-cancels-int...) which is a summary of an AI generated article (https://blazetrends.com/microsoft-cancels-claude-code-pilot-...) which is a summary of another AI generated article (https://www.themodelwire.com/article/microsoft-starts-cancel...) which is a summary of an article from The Verge (https://www.theverge.com/tech/930447/microsoft-claude-code-d...). I guess it would be better to link the Verge article instead.
The absolute state of the Hacker News main page in 2026. Thank you for taking your time to put it all together.
2nd link doesn't work. That would be a neat tool, to find the original article and see how many levels of AI summary it has gone through, a game of AI telephone!
loading story #48239456
My bad. I had trouble finding the original source when I googled for it and grabbed a link. I was originally shown a screenshot of a x.com post.
loading story #48239224
Man, maybe it's time for me to give the verge a subscription. There the only ones actually doing any journalism here and a bunch of AI blogs skimming off the top.
boy i'm leaving the internet. sun is shining. was a good time here while it lasted.
The artificial centipede.
i swear i'm going to start an amish community and internet where we forbid any technological development past 2019

call me a luddite, i'll be wearing it as a badge of honor

Welp, this is the future we live in now
loading story #48246762
AI slop ruined a story about AI? This thread is a story about itself.
Microsoft poorly manages token use of most expensive models in a pilot. Then they use that failure to advertise/position their own Github Copilot agents to procurement teams, over the now widely validated Claude Code-based agents.

At least Codex is trying to win validation on merit.