Disclosure: My company, Tirias Research, has consulted for IBM, Nvidia, and other companies mentioned in this article.
Growth drivers
The surge in generative AI adoption was more dramatic than even the most optimistic projections could predict. Then, a fresh spike in demand emerged in September 2024 with the launch of ChatGPT-o1, the first broadly implemented "reasoning" model. Unlike earlier versions, it didn’t just provide answers; it worked through questions, delivering more considered, logical, and nuanced responses. This kind of reasoning required significantly more behind-the-scenes "reasoning tokens" per session, leading to a sharp increase in token generation. Furthermore, user engagement levels soared. By the close of 2024, time spent generating content using generative and reasoning AI models had grown by over 22 times compared to the prior year.
Innovation also accelerated at an unprecedented pace. From the introduction of transformers in 2017 to ChatGPT-1 in 2022 and reasoning models in 2024, the timeline of breakthroughs continues to shrink. Sophisticated model architectures like mixture-of-experts (MoE) enable more efficient reasoning while keeping active parameter usage low. Open-source models such as Meta’s Llama series challenge proprietary solutions by offering lighter, faster alternatives that can run locally on laptops and smartphones. Meanwhile, optimizations like sparse attention and conditional computing are producing more efficient models like DeepSeek R1 (launched in 2025), which originally used only 37 billion active parameters per token, compared to Llama’s 405 billion or over 1 trillion in some closed models.
Token demand by the numbers
Tirias Research anticipates continued growth in user numbers, visit frequency, time spent, and AI-generated content. Additionally, with agentic APIs becoming available in 2025, AI agents will begin autonomously linking AI models together, forming thoughts, executing tasks, and interacting with other services. Human prompting will no longer be the sole catalyst for AI activity once autonomous agents start generating usage independently. Consequently, the annual rate of token generation is projected to jump from 677 trillion in 2024 to 2,092 trillion by the end of 2025 and reach 77,000 trillion (77 quadrillion) by the end of 2030.
Simon Solotko, Senior Analyst at Tirias Research, states: "The AI ecosystem is under extraordinary pressure. Multimodal capabilities, user demand, and agentic and multimedia workflows are progressing so rapidly that even improvements in compute hardware and software won’t be enough to counteract the rise in demand."
A forecast snapshot from 2028 shows that the use of AI assistants and agents will likely be dominated by a few providers. On the infrastructure side, however, API-accessed AI models are expected to power a broad array of business and consumer applications by embedding AI capabilities into customer-facing service providers.
The industry may shift toward a natural monopoly similar to Google's dominance in internet searches. With its early entry via ChatGPT and strong brand recognition, OpenAI currently leads the market in AI models and token generation. Whether OpenAI maintains its edge remains unclear.
Future Trends
Larger models will keep increasing in size and complexity, surpassing hardware advancements. The biggest models already exceed the memory capacity of any single accelerator, requiring clusters of GPUs and entire racks to process workloads. However, progress in distillation and efficiency will help scale down to smaller, specialized models. The arrival of DeepSeek marked a major leap in model efficiency, redefining performance benchmarks.
AI Agents will become widespread. Industry leaders, including Nvidia’s Jensen Huang and IBM’s Arvind Krishna, envision every employee working alongside multiple AI agents. Some agents will reside in machines, others in virtual environments, and still others in physical robots. AI agents will also start collaborating with one another.
Competition in AI will intensify. As models mature, differentiation isn't just about size or speed anymore—it now involves a broader set of factors. Services are embedding AI models into workflows, APIs, and interactive applications, driving toward full automation of tasks and entertainment. At the same time, cost constraints are pushing every player to adopt cutting-edge methods for faster training, better inference, and reduced computational costs. This competition extends beyond the enterprise—AI is now influencing global politics as nations race to innovate.
Moreover, AI will continue evolving. By the end of the decade, AI-generated images and video may surpass text as the main form of AI-generated content and the primary driver of future compute needs. A significant portion of this content could be created on edge devices. Media generation combined with autonomous AI agents and machines will usher in the next phase of AI development.
Final Thoughts
Unlike previous technology adoption cycles, generative AI shows no signs of slowing. Rapid advances in both capability and efficiency are fueling increased demand. As agentic AI expands beyond human interaction, the number of "users" of generative AI will grow exponentially.
I will explore the rising AI demand for images, video, autonomous agents, and autonomous machines, along with the global infrastructure requirements and total cost of operation (TCO) of generative AI in upcoming articles.
The above is the detailed content of The Unstoppable Growth Of Generative AI (AI Outlook Part 1). For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undress AI Tool
Undress images for free

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

Google’s NotebookLM is a smart AI note-taking tool powered by Gemini 2.5, which excels at summarizing documents. However, it still has limitations in tool use, like source caps, cloud dependence, and the recent “Discover” feature

Here are ten compelling trends reshaping the enterprise AI landscape.Rising Financial Commitment to LLMsOrganizations are significantly increasing their investments in LLMs, with 72% expecting their spending to rise this year. Currently, nearly 40% a

Investing is booming, but capital alone isn’t enough. With valuations rising and distinctiveness fading, investors in AI-focused venture funds must make a key decision: Buy, build, or partner to gain an edge? Here’s how to evaluate each option—and pr

Disclosure: My company, Tirias Research, has consulted for IBM, Nvidia, and other companies mentioned in this article.Growth driversThe surge in generative AI adoption was more dramatic than even the most optimistic projections could predict. Then, a

Those days are numbered, thanks to AI. Search traffic for businesses like travel site Kayak and edtech company Chegg is declining, partly because 60% of searches on sites like Google aren’t resulting in users clicking any links, according to one stud

The gap between widespread adoption and emotional preparedness reveals something essential about how humans are engaging with their growing array of digital companions. We are entering a phase of coexistence where algorithms weave into our daily live

Let’s talk about it. This analysis of an innovative AI breakthrough is part of my ongoing Forbes column coverage on the latest in AI, including identifying and explaining various impactful AI complexities (see the link here). Heading Toward AGI And

Let’s take a closer look at what I found most significant — and how Cisco might build upon its current efforts to further realize its ambitions.(Note: Cisco is an advisory client of my firm, Moor Insights & Strategy.)Focusing On Agentic AI And Cu
