T
traeai
登录
返回首页
AI Will(@FinanceYF5)

Meta被曝非法下载80TB盗版书籍用于AI训练

8.7Score
Meta被曝非法下载80TB盗版书籍用于AI训练

TL;DR · AI 摘要

Meta被证实非法下载超80TB盗版书籍数据用于AI模型训练,与2010年Aaron Swartz事件形成强烈对比。

核心要点

  • Meta下载至少81.7TB数据,含35.7TB来自Z-Library和LibGen。
  • Aaron Swartz因下载70GB JSTOR资料被起诉,面临35年监禁。
  • Meta员工使用公司设备进行torrent下载,引发伦理争议。

结构提纲

按章节快速跳转。

  1. §Meta数据下载丑闻

    Meta被证实通过Anna's Archive等平台非法下载超过80TB盗版书籍数据用于AI模型训练。

  2. Meta至少下载了81.7TB数据,其中35.7TB来自Z-LibraryLibGen,另有80.6TB直接来自LibGen。

  3. 2010年Aaron Swartz因下载70GB JSTOR资料被起诉,面临100万美元罚款和35年监禁,最终自杀。

  4. Meta员工邮件显示其使用公司笔记本电脑进行大规模数据torrent,引发公众对数据获取合法性的质疑。

  5. 该事件暴露了大型科技公司在AI训练中依赖非法或灰色数据源的系统性风险。

思维导图

用一张图看清主题之间的关系。

查看大纲文本(无障碍 / 无 JS 友好)
  • Meta AI数据伦理争议
    • 数据获取方式
      • 非法torrent下载
      • 通过Anna's Archive等影子图书馆
    • 数据规模
      • 81.7TB总数据量
      • 35.7TB来自Z-Library和LibGen
    • 历史对照案例
      • Aaron Swartz事件(2010)
      • 70GB JSTOR下载,刑事追诉
    • 企业责任与伦理
      • 使用公司设备进行非授权下载
      • 员工邮件公开引发舆论压力

金句 / Highlights

值得收藏与分享的关键句。

#AI伦理#数据合规#Meta#开源#版权
打开原文

AI Will

@FinanceYF5

源:

Quote

Mushtaq Bilal, PhD

@MushtaqBilalPhD

May 1

Meta illegaly downloaded 80+ terabytes of books from LibGen, Anna's Archive, and Z-library to train their AI models. Aaron Swartz downloaded 70 GBs of articles from JSTOR (0.0875% of Meta) in 2010. Faced $1 million in fine and 35 years in jail. Took his own life in 2013.

Image 1: "Last month, Meta admitted to torrenting a controversial large dataset known as LibGen, which includes tens of millions of pirated books. But details around the torrenting were murky until yesterday, when Meta's unredacted emails were made public for the first time. The new evidence showed that Meta torrented "at least 81.7 terabytes of data across multiple shadow libraries through the site Anna’s Archive, including at least 35.7 terabytes of data from Z-Library and LibGen," the authors' court filing said. And "Meta also previously torrented 80.6 terabytes of data from LibGen.""

read image description

Image 2: “Torrenting from a corporate laptop doesn’t feel right”: Meta emails unsealed

read image description

Image 3: A photo of Aaron Swartz (1986-2013) when he was 19.

read image description

5:30 AM · May 8, 2026

6,359 Views

AI 可能会生成不准确的信息,请核实重要内容