The New York Times initiated legal proceedings against Microsoft, alleging systematic use of proprietary news articles to train AI models powering Copilot and other Microsoft products. The lawsuit seeks damages and an injunction preventing further unauthorized use of Times content.
The case centers on a fundamental tension in generative AI: training on internet-scale data requires massive textual inputs, but much of the internet contains copyrighted material. Microsoft's legal defense argues 'fair use,' but the Times counters that commercial AI model training fundamentally differs from journalistic fair use principles.
This lawsuit joins similar actions by other publishers. The outcome could establish precedent for how AI companies must license or compensate for copyrighted training data—a decision with profound implications for model architectures, costs, and the future of AI development economics.