OpenAI used over a million hours of YouTube videos to train its AI model: Report


New Delhi, Apr 7 (IANS): Sam Altman-run OpenAI transcribed more than a million hours of YouTube videos to train its AI model called GPT-4, a report has claimed.

The New York Times reported that OpenAI knew this was not legal but “believed it to be fair use”.

“OpenAI president Greg Brockman was personally involved in collecting videos that were used,” according to the report.

An OpenAI spokesperson told The Verge that the company uses “numerous sources including publicly available data and partnerships for non-public data,” to maintain its global research competitiveness.

Google, which owns YouTube, said it has “seen unconfirmed reports” of OpenAI’s activity.

“Both our robots.txt files and Terms of Service prohibit unauthorised scraping or downloading of YouTube content,” the tech giant maintained.

Last year, The Information reported for the first time that OpenAI, which is now backed by Microsoft, trained its AI models on Google-owned YouTube by scrapping its data.

OpenAI "secretly used data from the site (YouTube) to train some of its artificial intelligence models".

YouTube is the single biggest and richest source of imagery, audio and text transcripts on the web.

 

  

Top Stories


Leave a Comment

Title: OpenAI used over a million hours of YouTube videos to train its AI model: Report



You have 2000 characters left.

Disclaimer:

Please write your correct name and email address. Kindly do not post any personal, abusive, defamatory, infringing, obscene, indecent, discriminatory or unlawful or similar comments. Daijiworld.com will not be responsible for any defamatory message posted under this article.

Please note that sending false messages to insult, defame, intimidate, mislead or deceive people or to intentionally cause public disorder is punishable under law. It is obligatory on Daijiworld to provide the IP address and other details of senders of such comments, to the authority concerned upon request.

Hence, sending offensive comments using daijiworld will be purely at your own risk, and in no way will Daijiworld.com be held responsible.