Home > News > AI Whistleblower Claims DeepSeek Trained on OpenAI Data

AI Whistleblower Claims DeepSeek Trained on OpenAI Data

Feb 25,25(3 months ago)
AI Whistleblower Claims DeepSeek Trained on OpenAI Data

OpenAI suspects that China's DeepSeek AI models, significantly cheaper than Western counterparts, may have been built using OpenAI's data. This revelation, following the substantial drop in Nvidia's market value (nearly $600 billion), prompted Donald Trump to call DeepSeek a "wake-up call" for the U.S. tech industry.

The emergence of DeepSeek triggered a sharp decline in AI-related stocks. Nvidia, a major player in GPU technology crucial for AI model operation, experienced a 16.86% share drop – a record on Wall Street. Other tech giants like Microsoft, Meta, and Alphabet also saw significant losses.

DeepSeek's R1 model, based on the open-source DeepSeek-V3, boasts significantly lower training costs (estimated at $6 million) compared to Western models like ChatGPT. While this claim is debated, it has raised concerns about the billions invested by American tech companies in AI, unsettling investors. DeepSeek's popularity, evidenced by its top ranking on U.S. free app download charts, further fuels this discussion.

OpenAI and Microsoft are investigating whether DeepSeek used OpenAI's API to integrate OpenAI's AI models into its own, a violation of OpenAI's terms of service. OpenAI acknowledges that Chinese companies frequently attempt to "distill" models from leading U.S. AI companies. Distillation, a technique for training AI models by extracting data from larger models, is a key point of contention.

OpenAI emphasizes its efforts to protect its intellectual property and is collaborating with the U.S. government to safeguard its technology. David Sacks, President Trump's AI czar, supports the claim that DeepSeek used distillation, a practice he expects will be addressed by leading AI companies.

The situation highlights the irony of OpenAI's position, given its own past accusations of using copyrighted internet data to train ChatGPT. Critics point to OpenAI's previous statements that creating AI models like ChatGPT without copyrighted material is "impossible," as evidenced by their submission to the UK's House of Lords and their ongoing legal battles, including a lawsuit from the New York Times for "unlawful use" of its content. Other lawsuits, such as one filed by 17 authors, further complicate the issue. The legal landscape surrounding AI training data and copyright remains highly contested.

DeepSeek is accused of using OpenAI’s model to train its competitor using distillation. Image credit: Andrey Rudakov/Bloomberg via Getty Images.

Discover
  • Denuncia Ciudadana CDMX
    Denuncia Ciudadana CDMX
    Denuncia Ciudadana CDMX is a powerful platform in Mexico City designed to empower citizens to report a variety of issues, including crime, safety concerns, and public service problems. Through an accessible website or a convenient mobile app, users can easily submit complaints or share valuable info
  • Goodnotes
    Goodnotes
    GoodNotes is a versatile note-taking application designed for iOS and macOS users, perfect for both students and professionals seeking an efficient way to manage their digital notes. The app stands out with its handwriting recognition capabilities, a wide range of customizable templates, and an asso
  • Galaxy Wearable (Samsung Gear)
    Galaxy Wearable (Samsung Gear)
    The Galaxy Wearable app, previously known as Samsung Gear, is an essential tool for managing and enhancing your experience with Samsung's range of wearable devices, including smartwatches and fitness trackers. This app facilitates a seamless connection between your wearables and Samsung smartphones,
  • EasyViewer-epub,Comic,Text,PDF
    EasyViewer-epub,Comic,Text,PDF
    EasyViewer is a versatile app designed for reading a wide range of formats, including EPUB, comics, text files, and PDFs. It offers a user-friendly interface that allows users to seamlessly navigate through their documents and enjoy features such as zooming, bookmarks, and customizable reading setti
  • SayMe - anonymous questions
    SayMe - anonymous questions
    SayMe is an innovative app designed for users who value anonymity and open dialogue. It offers a unique platform where individuals can ask and answer questions without disclosing their identities, fostering an environment of honest communication. This feature makes SayMe a go-to choice for those see
  • ShareMe: File sharing
    ShareMe: File sharing
    ShareMe is a powerful file-sharing application that revolutionizes the way you transfer files, photos, videos, and apps between devices. By eliminating the need for mobile data or Wi-Fi, ShareMe enables fast and reliable transfers across Android and iOS platforms. The app boasts an intuitive interfa