A shadow industry of data middlemen is turning publishers’ work into fuel for AI agents—and forcing media companies to rethink what ‘outputs’ really mean
There are several dimensions to the ongoing legal war between the media industry and AI companies over copyright, and one of the major ones is the question of outputs. Which is to say: Scraping content without permission may be detestable, but if the party doing the scraping isn’t doing anything with it that would compete with the content creator, it’s difficult to prove harm. And many legal proceedings, especially civil claims, depend on showing the actions were harmful.
