Goodie

Get a Demo

Interested in trying Goodie? fill out this form and we'll be in touch with you.
Thank you for submitting the form, we'll be in touch with you soon.
Oops! Something went wrong while submitting the form.

LLM Data Wars Timeline: Deals, Restrictions & Platform Power Plays (2023-2026)

Track how data access battles are reshaping AI answers, and see the key deals, restrictions, lawsuits, and platform shifts driving fragmented AI visibility.
Julia Olivas
March 9, 2026
Table of Contents
This is some text inside of a div block.
Share on:
Share on LinkedIn

Decode the science of AI Search dominance now.

Download the Study

Meet users where they are and win the AI shelf.

Download the Study

Decode the science of AI Search Visibility now.

Download the Study

Win the Citation Game

Download the Study

Win the Citation Game

Download the Study

Last Updated: February 2026

The LLM data wars didn’t start with a single lawsuit or licensing deal. They emerged gradually, as platforms realized that the data fueling AI systems was not longer just content, but lucrative, strategic infrastructure.

This timeline tracks key moments when access to LLM training data shifted: API restrictions, licensing agreements, legal enforcement, and the rise of platform-native AI.

This is a living resource. As new developments occur, we’ll update this post as the landscape continues evolving. 

How to Read This Timeline

  • Each entry includes:
    • What happened
    • Why it mattered
  • Older entries are preserved for historical context; new updates are added as they occur

For a deeper analysis of why these shifts are fragmenting AI answers, see our companion piece: The LLM Data Wars: Why AI Answers Are Fragmenting.

LLM Data Wars Timeline at a Glance

Graphic timeline of the LLM data wars between 2023 and 2026.

2023: The Gates Start Closing

2024: Licensing, Lawsuits & Regulation

Graphic depicting the licensing and regulatory changes that happened in the LLM space in 2024.

2025: Consolidation & Enforcement

2026: Ongoing

  • January 2026: OpenAI announces testing of advertisements in ChatGPT for free users
  • January 2026: Havas unveils AVA, a global LLM portal offering access to GPT-5, Claude Opus 4.5, and Gemini 3 (rolling out Spring 2026)
  • January 2026: 2026: Yahoo expands Scout, its AI-powered discovery and search experience, across Yahoo properties
  • New licensing deals, lawsuits, platform AI launches, and regulatory actions will be added as they occur

2026: Key Projections & Industry Trends

By 2026, AI search has clearly moved beyond experimentation and into core commercial infrastructure. AI-driven search traffic grew 155.6% in 2025, with projections estimating it could funnel as much as $750 billion in US revenue by 2028. However, that growth isn’t evenly distributed. 

ChatGPT continues to dominate overall AI search traffic, accounting for roughly 84.1% of trackable volume, but that dominance varies significantly by industry. At the same time, enterprise-facing systems like Microsoft Copilot saw explosive adoption, growing 25× throughout 2025, a sign that productivity and workflow-integrated AI was emerging as a parallel discovery layer, not a secondary one.

Adoption is also fragmenting sharply by vertical. Legal services saw roughly 15× growth in AI search, events-related use cases grew 20×, and insurance adoption surged nearly 90× year over year. These shifts reinforced a broader pattern: AI discovery is no longer general-purpose. 

AI is becoming industry-shaped, with different models, data sources, and interfaces dominating different domains. Importantly, this shift wasn’t just about traffic volume. AI-referred visits were converting at 2-3× higher rates than traditional channels, suggesting that AI answers are shaping decisions before users ever reach owned properties.

At the same time, competitive differentiation among AI platforms increasingly hinged on licensed data depth, not just retrieval or reasoning quality. Perplexity AI expanded partnerships with financial data providers, including Benzinga, FactSet, Morningstar, and Quartr, underscoring how access to proprietary, high-trust datasets is becoming a prerequisite for relevance in certain categories. 

Taken together, these trends point to a future where AI discovery is both massively influential and deeply fragmented, shaped less by a single “best model” and more by who controls the data, the interface, and the economic incentives underneath.

Graphic showing the closed intelligence loop of AI.

Why Monitoring AI Visibility Matters Now

As the LLM data wars intensify, visibility becomes harder to reason about from the outside. Access rules differ by platform. Licensing deals are often private. Model behaviors shift without public notice. And AI answers increasingly reflect which data ecosystems a brand appears in, not just how well it ranks on the open web. 

This timeline shows how quickly the ground is moving. It also highlights a deeper challenge: brands, publishers, and content teams can no longer assume how (or where) they’re being represented in AI answers. 

In a fragmented AI landscape, situational awareness matters. Teams need to know:

  • Which AI systems surface their brand
  • How they’re being described
  • Which sources and datasets are shaping those answers
  • Where gaps or distortions are emerging over time

Tools like Goodie are designed for exactly this moment, helping teams monitor AI visibility across models and platforms as access rules, partnerships, and data flows continue to shift. Not to game the system, but to understand it.

Because when data access determines visibility, visibility itself needs to be observable.

Conclusion: What the Timeline Makes Clear

Taken together, these events show that the LLM data wars aren’t a one-time disruption, but a structural shift in how intelligence is built, governed, and monetized.

What was once open infrastructure is now negotiated access. What once powered general-purpose models is increasingly locked behind platform boundaries. As a result, AI answers are fragmenting into ecosystem-specific views shaped by licensing, policy, and economics.

This timeline will keep evolving as new deals, lawsuits, platform AI launches, and regulations emerge. But the core dynamic is already clear: control over data increasingly determines control over answers.

For brands, publishers, and product teams, the challenge is understanding how these shifts affect visibility and representation inside AI systems. As discovery moves upstream, knowing where and how you appear in AI-generated answers becomes table stakes.

That’s why tools like Goodie exist: to make AI visibility observable as the ecosystem continues to fragment.

We’ll continue updating this timeline as the data wars unfold, because in an AI-shaped web, understanding what changed and what it means is half the battle.

LLM Data Wars Timeline: FAQs

What are the LLM data wars?

The LLM data wars refer to the growing conflict over who can access, license, and control the data used to train large language models. As platforms restrict scraping, sign exclusive licensing deals, and build their own AI tools, access to training data has become a competitive and economic lever, not a given.

Why are platforms restricting AI training data now?

As AI systems became commercially valuable, platforms realized their data was strategic infrastructure. Restricting access allows platforms to:

  • Monetize data through licensing
  • Protect competitive advantage
  • Build platform-native AI experiences
  • Control how their ecosystems are interpreted by AI

How do data restrictions affect AI answers?

AI models reflect what they were allowed to learn from. When data access differs across platforms and partnerships, models develop different strengths, blind spots, and defaults. The result is fragmentation: the same question can produce different answers depending on the AI system you use.

Is this mainly a legal or technical issue?

It’s both, but the long-term impact is economic and structural. Legal actions and regulations enforce boundaries, while technical and architectural choices determine how models adapt. Together, they reshape how intelligence is built and surfaced at scale.

What does this mean for brands and publishers?

Visibility is no longer guaranteed by rankings alone. Brands and publishers can be well-known in one AI ecosystem and invisible in another, depending on where their data appears and how it’s licensed. This creates new risks around omission, misrepresentation, and lost influence in AI-driven discovery.

Decode the science of AI Search dominance now.

Download the Study

Meet users where they are and win the AI shelf.

Download the Study

Win the Citation Game

Download the Study

Decode the science of AI Search Visibility now.

Download the Study

Win the Citation Game

Download the Study
Check out other articles
Enjoy the best AI Optimization newsletter on the internet - right in your inbox.
Thanks for subscribing! Your next favorite newsletter is on its way.
Oops! Something went wrong while submitting the form.
LinkedinInstagramYoutubeTikTok
© Goodie 2025
All Rights Reserved
Goodie logo
Goodie

AEO Periodic Table: Elements Impacting AI Search Visibility in 2025

Discover the 15 factors driving brand visibility in ChatGPT, Gemini, Claude, Grok, and Perplexity — based on 1 million+ prompt outputs.
Your visibility game just leveled up. We’ve sent the AEO Periodic Table: Elements Impacting AI Search Visibility in 2025 report to your inbox.



If you do not receive the email, please check your spam folder.
Oops! Something went wrong while submitting the form.
Goodie

AEO Periodic Table: Factors Impacting AI Search Visibility in 2025

Discover the 15 factors driving brand visibility in ChatGPT, Gemini, Claude, Grok, and Perplexity — based on 1 million+ prompt outputs.
Your visibility game just leveled up. We’ve sent the AEO Periodic Table: Elements Impacting AI Search Visibility in 2025 report to your inbox.



If you do not receive the email, please check your spam folder.
Oops! Something went wrong while submitting the form.
Goodie

The 14 Factor AI Shopping Visibility Study

Get the data behind how today’s leading AI models retrieve, score, and select products and what your brand must do to stay visible and purchasable.
Thanks for joining the next era of product discovery.
Check your inbox for the AI Shopping Visibility Study.

If you do not receive the email, please check your spam folder.
Oops! Something went wrong while submitting the form.
Goodie

The Complete Social Impact on AI Answers Study

Access the full analysis with month-by-month trends, platform-by-platform breakdowns, and strategic frameworks for building citation-resilient content portfolios across social, earned, and owned channels.
Thanks for joining the next era of product discovery.
Check your inbox for Citation Study.

If you do not receive the email, please check your spam folder.
Oops! Something went wrong while submitting the form.
Goodie

The Complete Social Impact on AI Answers Study

V2 of Goodie's social citation research. 1.8 million citations, 10 AI surfaces, 37 content types classified for the first time. Includes the full platform coupling matrix, citability hierarchy, and strategic playbook.
Thanks for downloading the V2 Social Citations Study. Check your inbox for your copy. If you don't see it, check your spam folder.
Oops! Something went wrong while submitting the form.