The post Reddit Sues Perplexity AI, Alleging ‘Industrial-Scale’ Data Theft appeared on BitcoinEthereumNews.com. In brief Social media platform Reddit has sued Perplexity AI, accusing the firm of an “industrial-scale” scheme to scrape its user-generated content. Reddit alleges billions of search pages were scraped through tools that bypassed its and Google’s protections. The lawsuit names Perplexity, SerpApi, Oxylabs, and AWM Proxy as defendants. Social media platform Reddit has sued Perplexity AI in federal court on Wednesday, alleging that the artificial intelligence company and its data partners orchestrated an “ industrial-scale” scheme to scrape the platform’s user-generated content. Reddit alleges that the other defendants: SerpApi, Oxylabs, and AWM Proxy, developed and sold tools specifically designed to break security measures protecting its content, enabling the large-scale scraping of Reddit data from search results. The tools were allegedly built with the intention of bypassing two layers of protection: first, by evading Reddit’s own anti-scraping systems, and second, by circumventing Google’s controls to extract Reddit content directly from its search engine results. The data companies operated as “data-scraping service providers” and “circumvented Google’s technological control measures and automatedly accessed, without authorization, almost three billion search engine results pages,” a copy of the lawsuit reads. Reddit claims Perplexity used data from the three firms for its answer engine even after receiving a cease-and-desist letter in May 2024. A representative from Perplexity responded and shared a full response, posted on Reddit. Perplexity intentionally posted its response on Reddit “to illustrate a simple point: it’s a public Reddit link accessible to anyone, yet by the logic of Reddit’s lawsuit, if you refer to it in any way, they just might sue you too,” the representative told Decrypt. Perplexity described the lawsuit as “a sad example of what happens when public data becomes a big part of a public company’s business model.” “Reddit thinks that’s their right. But it is the opposite… The post Reddit Sues Perplexity AI, Alleging ‘Industrial-Scale’ Data Theft appeared on BitcoinEthereumNews.com. In brief Social media platform Reddit has sued Perplexity AI, accusing the firm of an “industrial-scale” scheme to scrape its user-generated content. Reddit alleges billions of search pages were scraped through tools that bypassed its and Google’s protections. The lawsuit names Perplexity, SerpApi, Oxylabs, and AWM Proxy as defendants. Social media platform Reddit has sued Perplexity AI in federal court on Wednesday, alleging that the artificial intelligence company and its data partners orchestrated an “ industrial-scale” scheme to scrape the platform’s user-generated content. Reddit alleges that the other defendants: SerpApi, Oxylabs, and AWM Proxy, developed and sold tools specifically designed to break security measures protecting its content, enabling the large-scale scraping of Reddit data from search results. The tools were allegedly built with the intention of bypassing two layers of protection: first, by evading Reddit’s own anti-scraping systems, and second, by circumventing Google’s controls to extract Reddit content directly from its search engine results. The data companies operated as “data-scraping service providers” and “circumvented Google’s technological control measures and automatedly accessed, without authorization, almost three billion search engine results pages,” a copy of the lawsuit reads. Reddit claims Perplexity used data from the three firms for its answer engine even after receiving a cease-and-desist letter in May 2024. A representative from Perplexity responded and shared a full response, posted on Reddit. Perplexity intentionally posted its response on Reddit “to illustrate a simple point: it’s a public Reddit link accessible to anyone, yet by the logic of Reddit’s lawsuit, if you refer to it in any way, they just might sue you too,” the representative told Decrypt. Perplexity described the lawsuit as “a sad example of what happens when public data becomes a big part of a public company’s business model.” “Reddit thinks that’s their right. But it is the opposite…

Reddit Sues Perplexity AI, Alleging ‘Industrial-Scale’ Data Theft

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

In brief

  • Social media platform Reddit has sued Perplexity AI, accusing the firm of an “industrial-scale” scheme to scrape its user-generated content.
  • Reddit alleges billions of search pages were scraped through tools that bypassed its and Google’s protections.
  • The lawsuit names Perplexity, SerpApi, Oxylabs, and AWM Proxy as defendants.

Social media platform Reddit has sued Perplexity AI in federal court on Wednesday, alleging that the artificial intelligence company and its data partners orchestrated an “ industrial-scale” scheme to scrape the platform’s user-generated content.

Reddit alleges that the other defendants: SerpApi, Oxylabs, and AWM Proxy, developed and sold tools specifically designed to break security measures protecting its content, enabling the large-scale scraping of Reddit data from search results.

The tools were allegedly built with the intention of bypassing two layers of protection: first, by evading Reddit’s own anti-scraping systems, and second, by circumventing Google’s controls to extract Reddit content directly from its search engine results.

The data companies operated as “data-scraping service providers” and “circumvented Google’s technological control measures and automatedly accessed, without authorization, almost three billion search engine results pages,” a copy of the lawsuit reads.

Reddit claims Perplexity used data from the three firms for its answer engine even after receiving a cease-and-desist letter in May 2024.

A representative from Perplexity responded and shared a full response, posted on Reddit.

Perplexity intentionally posted its response on Reddit “to illustrate a simple point: it’s a public Reddit link accessible to anyone, yet by the logic of Reddit’s lawsuit, if you refer to it in any way, they just might sue you too,” the representative told Decrypt.

Perplexity described the lawsuit as “a sad example of what happens when public data becomes a big part of a public company’s business model.”

“Reddit thinks that’s their right. But it is the opposite of an open internet,” Perplexity stated.

A representative from SerpApi told Decrypt they did not receive “any communication or service from Reddit” on the matter, adding that they “strongly disagree with Reddit’s allegations” and intend to seek legal recourse.

“No company should claim ownership of public data that does not belong to them. It is possible that it is just an attempt to sell the same public data at an inflated price,” Denas Grybauskas, chief governance and strategy officer at Oxylabs, told Decrypt in an emailed statement.

Reddit similarly “made no attempt to speak” with Oxylabs, Grybauskas said.

Decrypt has reached out to Reddit, Google, and AWM Proxy for comment and will update this article should they respond.

A legal tangle

In cases like this, courts would need to look first at whether the terms of service from platforms like Reddit “explicitly addresses AI training, data scraping, and commercial use,” Andrew Rossow, public affairs attorney and director of strategic partnerships at video search and content intelligence platform Oriane, told Decrypt.

If a user agreed to terms that “grant the platform a broad, perpetual, royalty-free license to their content,” that license “generally governs the relationship between the user and the platform,” Rossow explained.

But it doesn’t “automatically grant the AI company a license” to do the same, unless the terms permitted the platform “to sublicense or sell the data for that purpose,” he added.

Courts would then have to “distinguish between the user’s copyright in their expression (the text of the post) and the use of the content for data mining (extracting patterns, facts, and language models),” he explained.

Still, the supposed “knowledge” behind an LLM (large-language model) “is the product of millions of users’ time, effort, and creative expression,” Rossow argued.

“Treating this human-generated content as a free, raw, undifferentiated resource is a form of labor exploitation that devalues online contributions,” Rossow opined, adding that AI companies need to “respect digital citizenship and community norms,” given how these are “the implicit and explicit rules of the digital public spaces they ingest.”

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Source: https://decrypt.co/345613/reddit-sues-perplexity-ai-alleging-industrial-scale-data-theft

Market Opportunity
null Logo
null Price(null)
--
----
USD
null (null) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Unlimit Appoints Irene Skrynova as CEO, Global Payments

Unlimit Appoints Irene Skrynova as CEO, Global Payments

Unlimit announced the appointment of Irene Skrynova as CEO, Global Payments, as the company accelerates its evolution into a global financial infrastructure platform
Share
ffnews2026/03/12 18:17
Analyst Predicts ‘Uptober’ Rally for BTC Regardless of FOMC Decision

Analyst Predicts ‘Uptober’ Rally for BTC Regardless of FOMC Decision

The post Analyst Predicts ‘Uptober’ Rally for BTC Regardless of FOMC Decision appeared on BitcoinEthereumNews.com. Bitcoin traded at $116,236 as of 14:04 UTC on Sept. 17, up about 1% in the past 24 hours, holding above a key level as markets await the Federal Reserve’s policy announcement. Analysts’ comments Dean Crypto Trades noted on X that bitcoin is only about 7% above its post-election local peak, while the S&P 500 has risen 9% and gold has surged 36% during the same period. He said bitcoin has compressed more than those assets, making it likely to lead the next larger move, though it could form a “lower high” before extending further. He added that ether could join in once it breaks $5,000 and enters price discovery. Lark Davis pointed to bitcoin’s history around September FOMC meetings, saying every September decision since 2020 — except during the 2022 bear market — has preceded a strong rally. He stressed that the pattern is less about the Fed’s rate choice itself and more about seasonal dynamics, arguing that bitcoin tends to thrive in this period heading into “Uptober.” CoinDesk Research’s technical analysis According to CoinDesk Research’s technical analysis data model, bitcoin rose about 0.9% during the Sept. 16–17 analysis window, climbing from $115,461 to $116,520. BTC reached a session high of $117,317 at 07:00 UTC on Sept. 17 before consolidating. Following that peak, bitcoin tested the $116,400–$116,600 range multiple times, confirming it as a short-term support zone. In the final hour of the session, between 11:39 and 12:38 UTC, BTC attempted a breakout: prices moved narrowly between $116,351 and $116,376 before spiking to $116,551 at 12:34 on higher volume. This confirmed a consolidation-breakout pattern, though the gains were modest. Overall, bitcoin remains firm above $116,000, with support around $116,400 and resistance near $117,300. Latest 24-hour and one-month chart analysis The latest 24-hour CoinDesk Data chart, ending 14:04 UTC on…
Share
BitcoinEthereumNews2025/09/18 12:42
UiPath (PATH) Stock Slides 5% Despite Crushing Earnings on Every Metric

UiPath (PATH) Stock Slides 5% Despite Crushing Earnings on Every Metric

TLDR UiPath beat Q4 estimates with EPS of $0.30 vs $0.26 expected, and revenue of $481M vs $465M expected The stock fell more than 5% in premarket trading despite
Share
Coincentral2026/03/12 18:09