Set as Homepage - Add to Favorites

精品东京热,精品动漫无码,精品动漫一区,精品动漫一区二区,精品动漫一区二区三区,精品二三四区,精品福利导航,精品福利導航。

【trang web xxx】Meta’s Llama has memorized huge portions of Harry Potter

Meta's Llama model has memorized Harry Potter and trang web xxxthe Sorcerer's Stoneso well that it can reproduce verbatim excerpts from 42 percent of the book, according to a new study.

Researchers from Stanford, Cornell, and West Virginia University analyzed dozens of books from the now-infamous Books3 dataset, a collection of pirated books used to train Meta's Llama models. Books3 is also at the center of a copyright infringement lawsuit against Meta,Kadrey v. Meta Platforms, Inc.The study's authors say their findings could have major implications for AI companies facing similar lawsuits.

According to the research paper, the Llama 3.1 model "memorizes some books, like Harry Potterand 1984, almost entirely." Specifically, the study found that Llama 3.1 has memorized 42 percent of the first Harry Potter book so well that it can reproduce verbatim excerpts at least 50 percent of the time. Overall, Llama 3.1 could reproduce excerpts from 91 percent of the book, though not as consistently.


You May Also Like

"The extent of verbatim memorization of books from the Books3 dataset is more significant than previously described," said the paper. But the researchers also discovered that "memorization varies widely from model to model and from book to book within each model, as well as varying in different parts of individual books." For example, the study estimated that Llama 3.1 only memorized 0.13 percent of Sandman Slimby Richard Kadrey, one of the lead plaintiffs in the class action copyright suit against Meta.

So, while some of the paper's findings seem damning, don't call it a smoking gun for plaintiffs in AI copyright infringement cases.

Mashable Light Speed Want more out-of-this world tech, space and science stories? Sign up for Mashable's weekly Light Speed newsletter. By clicking Sign Me Up, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy. Thanks for signing up!

"These results give everyone in the AI copyright debate something to latch on to," wrote journalist Timothy B. Lee in his Understanding AI newsletter. "Divergent results like these could cast doubt on whether it makes sense to lump J.K. Rowling, Richard Kadrey, and thousands of other authors together in a single mass lawsuit. And that could work in Meta’s favor, since most authors lack the resources to file individual lawsuits."

Why is Llama able to reproduce some books more than others? "I suspect that the difference is because Harry Potter is a much more famous book. It's widely quoted and I'm sure that substantial excerpts from it on third-party websites found their way into the training data on the web," said James Grimmelmann, a professor of digital and information law at Cornell University, who was cited in the paper.

What this also shows, Grimmelmann said, is that "AI companies can make choices that increase or reduce memorization. It's not an inevitable feature of AI; they have control over it."

Meta and other AI companies have argued that using copyrighted works to train their models is protected under fair use, a complex legal doctrine. However, the extent of memorization could complicate those arguments.

“Yes, I do think that the likelihood that LLMs are memorizing more than previously thought changes the copyright analysis,” Robert Brauneis, a professor with the George Washington University Law School, said in an email to Mashable. He concluded that the study’s findings could ultimately weaken Meta’s fair use argument.

We asked Meta for comment on the study's findings, and we'll update this article if we receive a response.


Disclosure: Ziff Davis, Mashable’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.

0.3042s , 8053.2421875 kb

Copyright © 2025 Powered by 【trang web xxx】Meta’s Llama has memorized huge portions of Harry Potter,Info Circulation  

Sitemap

Top 主站蜘蛛池模板: 国产一区二区区别:特点与差异剖析 | 国产精品亚洲一区二区在线 | 成人精品三级 | 日韩精品在线观看中文字幕 | 精品亚洲成a人在线播放 | yeyecao亚洲夜夜综合久久 | 18国产精品 | 国产欧美视频一区二区 | 日韩不卡在线播放 | 91麻豆日韩精品 | 日韩在线视频线视频免费 | 无码的免费的毛片视频观看 | 无码熟妇人妻av在线影片 | 91精品免费视频在线观看 | 色妞精品av一区二区三区 | 无码人妻精品1国产婷婷 | 国产91欧美一区二区精品 | 国产毛片午夜无码专区喷水 | 国产精拍视 | 国精品无码一区二区三区在线蜜臀 | 蜜臀av无码精品人妻色欲 | 久久久久精品免视看秋霞 | 性一交一乱一交A片久久 | 亚偷熟乱区婷婷综合二区 | 亚洲精品综合久久中文字幕 | 麻豆ⅴ传媒在线播放免费观看 | 国内精品久久久久影院vr | 性xxxxxxx欧美胖老太肥肥 | 福利片免费视频在线观看 | 久久久人成影片一区二区三区 | 成人午夜精品无码区久久 | 综合丁香激情五月 | 巨大黑人极品vjdeo | 国产露脸无码A区久久 | 亚洲精品aⅴ中文字幕乱码 亚洲精品AV一二三区无码 | 久久国产热精品波多野结衣av | 热久久网站| 久久精品中文字幕首页 | 麻豆91精品91久久久的内涵 | 强奷乱码在线观看三级 | 精品久久久久久综合日本 |