The Wintel Era Is Over

Over the past two days, several giants posted almost the same sentence at almost the same time: "A new era of PC." NVIDIA posted it. Windows posted it. Arm followed. Behind the line was a set of coordinates pointing to Taipei. Tech companies declare new eras every day. That is not interesting. What is interesting is that they rarely do it together. What is even more interesting is that this time, they are all talking about the PC.

An old thing that had lost its spotlight to mobile for more than a decade suddenly returned to the center of the table.

Many people's first reaction was: AI PCs are coming again.

I think that is too shallow.

The PC is not becoming a hot topic again because it suddenly became sexy, nor because the giants want to give laptops a new marketing name. The real reason is more practical, and more uncomfortable:

Cloud AI is too expensive.

Over the past two years, companies buying AI often ended up realizing they were not buying efficiency. They were working for model vendors. Tokens kept burning. Bills kept rising. Bosses kept asking about ROI. Teams kept tuning prompts. In the end, the only thing that looked more futuristic was the bill. Productivity was still stuck in the past.

Especially with Agentic Coding.

It has already proven useful. Writing code, changing code, running tests, reading repos, doing migrations, cleaning up technical debt. These are not tricks. They are real needs. The problem is that once something is truly useful, usage explodes. Once usage explodes, token cost becomes a tax.

The paradox of AI is this: the more useful it is, the more expensive it becomes; the more automated it is, the less controllable it becomes.

That is why the PC is back.

Not the old PC used to open Excel, browse the web, and plug into a monitor. A new thing:

A personal AI compute node.

The Wintel era is over.

Not because Intel will collapse tomorrow. Not because x86 will suddenly leave the stage. An architecture that ruled the PC for forty years will not be buried by a few tweets.

But the value anchor of the PC has changed.

In the past, the default answer for a PC was Windows + Intel/x86. Next, the core question for a PC will become: can this machine handle enough local intelligence work at a lower cost?

Whoever can answer that question stands at the center of the new era.

Why Is The PC Hot Again?

For more than a decade, the PC has felt like an old friend left behind by the times.

It is still there. We still use it every day. Work depends on it. Code is written on it. Documents are edited on it. Meetings happen on it. Financial reports are read on it.

But it is no longer sexy.

What is sexy is mobile, apps, Feed, short video, super apps, always-online everything. The PC is more like an office desk: important, but not imaginative.

Because for more than a decade, the most valuable computing left the PC.

Storage moved to the cloud. Applications moved to SaaS. Consumption moved to mobile. Collaboration moved to the browser. Compute became something you buy on demand. The PC slowly became a terminal, a screen, a keyboard, a browser container.

It did not need to be that powerful.

The valuable things were in the cloud.

That logic worked in the past. Companies bought SaaS. Users opened browsers. Data lived in the cloud. Collaboration happened in the cloud. The local machine only needed to be smooth enough not to get in the way.

So PCs became cheaper, and more boring.

But AI reversed this.

AI is not just another SaaS. The core cost of AI is not the page, the account, permissions, or the database. It is every inference. Every prompt, every context expansion, every tool call, every detour an Agent takes in the background, is token spend.

The PC lost imagination in the past because local compute no longer determined productivity.

Local compute matters again because intelligence itself has become productivity.

That is why the PC is back at the table.

It is not a PC revival.

It is the cost structure of cloud AI pushing the PC back.

The SaaS Math Does Not Apply To AI

Many people misread AI because they are still using a SaaS framework to understand it.

What is SaaS?

You buy a seat and pay monthly. The more users use it, the happier the vendor is. The software has already been written, server costs are relatively controllable, and marginal costs are diluted by scale.

For companies, SaaS is also easy to calculate.

One person pays 30 dollars a month. You can roughly estimate how much time it saves, how many processes it removes, and how much manual coordination it replaces. Even if the estimate is imperfect, at least the bill is stable.

AI is not like that.

AI billing is not seat-based. It is usage-based. You are not paying to "own a capability." You are paying for "each use of that capability."

The more you use it, the higher the bill.

The more successful the automation, the more frequent the calls.

The harder the Agent works, the faster tokens burn.

This is the opposite of the SaaS economic model.

The best part of SaaS is that marginal cost is hidden by the vendor. Companies buy certainty: fixed subscription, continuous usage, and better economics as usage rises.

The worst part of token pricing is that marginal cost is thrown back in the company's face. Companies buy uncertainty: nobody knows how much they will spend today, how much they will spend tomorrow, or how many steps an Agent took in the background.

SaaS sells efficiency certainty. Tokens sell cost uncertainty.

That is why many AI projects produce awkward expressions once they land.

The demo is stunning.

The POC looks promising.

Everyone in the weekly meeting feels the future has arrived.

Then the bill arrives.

Once the bill arrives, many things start to look wrong.

It turns out a simple task made the Agent read more than a dozen files, open dozens of context rounds, call several tools, and reflect on itself several times in the middle. It looked very intelligent. The bill was also very intelligent.

More awkwardly, the result was not stable.

Human employees may be expensive, but at least the cost is fixed. How much an engineer costs per month is basically clear. An Agent is different. It is like an intern paid by mileage: the harder it runs, the more expensive it becomes, and it may not know when it has gone off track.

So the first-principles question for enterprise AI is not "is the model strong enough?"

It is:

Does this make economic sense?

Agentic Coding Breaks The Contradiction Open

Why does this contradiction show up first in Agentic Coding?

Because coding is one of the few AI use cases that has already proven useful.

Using AI to write ad copy is often just a bonus. Using AI to summarize meetings may be useful, or may never be read. Using AI to generate images is more often entertainment or an assistant inside a design workflow.

Agentic Coding is different.

Code is structured. Feedback is clear. Tasks are high-frequency. Value is measurable. It can read repos, change code, generate diffs, run tests, explain errors, do migrations, and clean up technical debt.

It is not "possibly useful."

It is actually useful.

That is also what makes it dangerous.

If something is useless, nobody burns much money on it. The things that truly burn money are always useful things.

Once Agentic Coding enters the workflow, it becomes a necessity. Engineers want to use it every day. Teams want to use it every day. Bosses also want everyone to use it more, because it really does work.

But the way it works is expensive.

Writing one line of code is not expensive. Understanding a repo is expensive.

Changing one function is not expensive. Figuring out where to change is expensive.

Generating a diff is not expensive. Repeatedly verifying it, running tests, fixing errors, and running tests again is expensive.

The core consumption of Agentic Coding is often not in the final lines of code. It is in all the exploration, reading, reasoning, trial and error, rollback, and retry before that.

This is very similar to how humans write code.

The difference is that human exploration cost is already packaged into salary. An Agent's exploration cost appears line by line on the token bill.

That is why Agentic Coding will become a necessity while forcing companies to rethink their cost structure.

Agentic Coding proves that AI is useful, and also proves that all-cloud AI is too expensive.

That sounds contradictory, but it is not.

Precisely because it is useful, it becomes high-frequency.

Precisely because it is high-frequency, it cannot stay expensive forever.

The Value Of Local LLMs Is Not Being Smarter, But Being Cheaper

When people discuss local LLMs, they often ask one question:

Can a local model beat Claude? Can it beat GPT? Can it beat Gemini?

That is the wrong question.

In the short to medium term, local LLMs do not need to beat frontier models.

They only need to absorb enough low- and medium-complexity tasks.

That is enough.

What companies really need is not to call the strongest model for every request, but to stratify intelligence.

Simple tasks go to the local model.

Medium tasks go to an internal company model.

Complex tasks, key decisions, and hard reasoning go to a cloud frontier model.

That is a normal cost structure.

Many teams are using AI in an extravagant way today. Completion, summarization, search, simple refactoring, script writing, log explanation, test generation. Everything goes to the most expensive cloud model.

That is like delivering takeout in first class.

It can be done.

The math is wrong.

The value of local LLMs is not turning every computer into AGI. Their real value is absorbing a large volume of tasks that are not worth calling a frontier model for, leaving cloud tokens for the places where they are truly needed.

The value of local LLMs is not being smarter. It is being cheaper.

More precisely, it is having a healthier cost structure.

Buying tokens is opex: continuous spend, pay per use.

Buying hardware is capex: one-time spend, with depreciation. Spread over three or five years, as long as usage frequency is high enough, local compute becomes cheaper than cloud tokens.

This may not be that sensitive for individual users.

But companies are very sensitive to it.

One engineer calls an Agent dozens of times a day. A whole team calls it thousands of times a day. Add CI, code review, automated testing, knowledge-base Q&A, document generation, and log analysis, and token spend quickly stops being "small money."

When AI moves from toy to infrastructure, cost structure moves from "experience issue" to "life-or-death issue."

So local LLMs are not a belief system.

They are accounting.

The PC Becomes A Compute Asset Again

This is where the PC becomes important again.

Phones are important, of course. But phones are more like attention devices. They handle messages, consumption, photos, payments, social interaction, and instant response.

The PC is different.

The PC is a productivity device. It has a keyboard, a large screen, a file system, an IDE, a terminal, a browser, enterprise software, local data, development environments, repos, scripts, logs, and permission boundaries.

If an Agent is going to do real work, it is not swiping around on a phone.

It needs to read files, change code, run tests, look things up, call tools, connect to enterprise systems, and execute tasks across applications.

These things naturally happen on the PC.

Phones consume intelligence. PCs produce intelligence.

That is why an AI PC cannot be understood merely as "a laptop with an extra NPU."

That is too narrow.

The real new PC is not an extra Copilot key, nor a system assistant that can chat. It is a local intelligent work node.

It needs to handle part of model inference.

It needs to process private context.

It needs to finish low-latency tasks locally.

When cloud tokens are too expensive, it needs to intercept enough intelligent work.

When necessary, it routes complex tasks to the cloud.

In other words, the PC is no longer just a terminal for accessing cloud services.

The PC becomes a compute asset again.

That is what is really worth watching behind "A new era of PC."

Not that the PC is suddenly going back to the center of the 2000s.

But that AI's cost structure is forcing part of computing back from the cloud to local machines.

Wintel Is Just The Name Of The Old Order

Intel is still at the table. x86 will certainly not disappear overnight. If the only goal is running local models, CPU architecture itself may not even be the most important variable.

What really matters is memory bandwidth, GPU/NPU throughput, software stack, model scheduling, power, thermals, developer ecosystem, and the stratified experience across cloud / local / edge.

Intel has cards.

AMD has cards.

Qualcomm has cards.

Apple has already proven another path can work.

But the word "Wintel" still matters.

Because it is not a technical term. It is the name of the old PC order.

The old default understanding of the PC was simple:

Windows + x86 + OEM + local applications + cloud SaaS.

Users did not need to understand that combination when buying a computer. The industry arranged it for them. Intel defined the hardware rhythm. Microsoft defined the operating system. OEMs shipped the machines. NVIDIA and AMD added weight in graphics or high-performance scenarios.

That order ran for decades.

Its core question was: can this machine run Windows software smoothly?

But the core question for the next PC has changed:

Can this machine run enough local intelligence tasks at a lower cost?

These are questions from two different eras.

For the first question, Wintel was the answer.

For the second question, Wintel is not necessarily the answer.

That is what it means for the Wintel era to be over.

Not that Intel disappears.

Not that Windows disappears.

Not that x86 disappears.

But that the default understanding of "PC = Windows + x86" is no longer enough to explain the next generation of PCs.

Who Is Calling For The New PC?

Now look back at those "A new era of PC" posts, and it gets interesting.

Why is NVIDIA saying it?

Because it cannot stay only in the data center.

If all AI happens in the cloud, NVIDIA has already won one round. But if companies start pushing a large amount of inference down to local machines for token efficiency, NVIDIA must enter the PC.

It needs to bring GPU, CUDA, local inference, developer ecosystem, and AI runtime back to personal devices.

It is not just selling chips.

It is selling the intelligence foundation of the next PC.

Why is Microsoft saying it?

Because Windows is the entry point for enterprise workflows.

If AI Agents are really going to land, Windows cannot remain a shell for opening cloud services. It has to become the operating system for local intelligent workflows: able to call models, manage permissions, connect applications, understand files, and route between local and cloud.

What Microsoft fears most is not that the PC market stops growing.

It fears that the workflow entry point of the AI era no longer belongs to Windows.

Why is Arm saying it?

Because local intelligence needs performance per watt.

Apple Silicon has already taught the market that laptops can be both powerful and efficient. If the Windows camp wants to catch up in AI PCs, it cannot keep relying only on the traditional path.

This time Arm is not saying "I can run Windows."

It is saying: the hardware form of the next PC no longer has to revolve around the old architecture.

So who is still using old language to talk about new PCs?

Many traditional players are still talking about CPU, frequency, core count, benchmarks, AI TOPS, and product-line updates.

These all matter, but they are not enough.

The real question for the new era is not "are the specs stronger?" It is "is the cost structure better?"

Can this machine burn fewer tokens?

Can it keep low-value requests local?

Can it make companies pay less tax to model vendors?

Can it turn Agentic Coding from an expensive toy into everyday infrastructure?

Whoever can answer those questions deserves to talk about the new PC.

The Cloud AI Tax

Over the past two years, AI companies have told many grand narratives.

AGI, agent, copilot, automation, super intelligence.

These words are sexy.

But companies eventually face another table: the bill.

That table is not sexy, but it is real.

When all intelligence is priced through cloud tokens, AI becomes a new tax. Every step you automate, you pay a tax. Every time you let an Agent think one more round, you pay a tax. Every time you connect a workflow, you pay a tax.

This is not to say model vendors are bad.

Training models, deploying models, and maintaining inference clusters are expensive. If someone provides a capability, of course they should charge for it.

But companies cannot build their automation forever on intelligence that someone else bills per call.

Especially for high-frequency, necessary, low- and medium-complexity tasks.

Once all these tasks move to the cloud, a company's cost structure increasingly looks like working for model vendors. Business growth may not be obvious. Bill growth definitely will be.

So local LLMs are not just a technical path.

They are bargaining power.

When a company has no local intelligence capability, every request can only go to the cloud. The model vendor names the price.

When a company has local intelligence capability, it at least has a choice.

Simple tasks run locally.

Sensitive data runs locally.

High-frequency tasks run locally.

Complex tasks go to the cloud.

This is not full replacement. It is redistribution.

The greatest value of local intelligence is giving companies back part of their cost control.

That is also why the PC becomes an asset again.

In the past, buying a high-performance PC often looked like a consumer electronics upgrade.

Now, buying an AI PC that can run local models looks more like buying a small compute node. It is not for making fans spin louder, nor for making spec sheets prettier. It is for continuously reducing token spend over the next few years.

Companies will do that math.

A New Era Is Not Created By A Launch Event

Of course, saying "A new era of PC" does not mean the new era has truly arrived.

There are still many traps.

Are local models good enough?

Can Windows make the local AI runtime smooth?

Will developers adapt?

Can enterprise IT manage these local models?

How should security and permissions work?

How should model updates work?

How should local and cloud routing work?

Who schedules the NPU, GPU, and CPU?

None of these questions is simple.

And the biggest problem of the PC camp has not changed: it is not Apple.

Apple can decide chips, operating system, developer tools, hardware form factor, and user experience as one company. Windows PCs are an alliance. Microsoft, NVIDIA, Arm, Intel, AMD, Qualcomm, OEMs, and developers all want to win, and each controls only one part.

The advantage of an alliance is openness.

The disadvantage of an alliance is that nobody is fully in charge.

So whether this new PC narrative works does not depend on a launch event, or on a few tweets. It depends on whether it can hide the complexity.

Users do not care what architecture you use.

Companies do not care how many TOPS you have.

Engineers also do not want to study every day which task should use which model, runtime, or backend.

Everyone cares about one thing:

Can the work be done more cheaply?

If the answer is yes, the new PC is real.

If the answer is no, it is just another round of AI PC marketing.

The Unspoken Name After PC

So when several giants simultaneously say "A new era of PC," I do not think the point is PC.

The point is new era.

In the old era, the PC was the default combination of Windows + x86, the terminal for accessing SaaS and cloud services, the office device that kept working quietly after mobile took away the spotlight.

In the new era, the PC may become something else:

A personal AI compute node.

It does not necessarily handle the strongest reasoning, but it must handle the highest-frequency reasoning.

It does not necessarily replace cloud models, but it must reduce dependence on cloud tokens.

It does not necessarily give everyone AGI, but it must prevent companies from paying model vendors a tax on every automation.

The Wintel era is over, not because Intel has failed.

It is because the core question for the next PC is no longer "can it run Windows?"

It is:

Can it afford to run intelligence?

For the past forty years, the default answer for the PC was Wintel.

Over the next few years, that default answer will be taken apart and recombined. Windows is still here. Intel is still here. x86 is still here. OEMs are still here.

But the era in which everyone played around the same hardware rhythm is over.

So the sharp part of "A new era of PC" is not "new era."

It is the unspoken question after PC:

When tokens become expensive enough that everyone starts buying local compute again, who still gets to define the PC?