OpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Us

khalid_salad@awful.systems · 2 days ago

OpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Us

BigMuffin69@awful.systems · 2 days ago

Footage of Deepseek slurping the knowledge out of the GPT4

Gustephan@lemmy.world · 2 days ago

I don’t know enough to say whether this is valid or just crybaby tech bros having a fit on fox news but like… God I hope deepseek is completely stolen like this because and I hope there’s absolutely nothing closedai can do about the fact that there’s a better thief out there on the market. Fuck them so hard and fuck their hypocrisy about stealing data. Maybe we can finally move away from trying to use a double digit percentage of national electric grid capacity to power a fucking glorified magic 8ball

blakestacey@awful.systems · 2 days ago

This is much more a TechTakes story than a NotAwfulTech one; let’s keep the discussion over on the other thread:

https://awful.systems/post/3400636

khalid_salad@awful.systems · edit-2 2 days ago

Noted going forward. Sorry about that! ❤

humanspiral@lemmy.ca · 2 days ago

They don’t have any evidence. They say someone did “hammer their API”, and then they terminated their license (last year), but they don’t know who. China bashing is not going to depend on actual evidence.

All that matters, in the end, is “customer prices” instead of our devoted love for Sam Altman.

NextElephant9@awful.systems · 2 days ago

Knowledge distilation is training a smaller model to mimic the outputs of a larger model. You don’t need to use the same training set that was used to train the larger model (the whole internet or whatever they used for chatgpt), but can use a transfer set.

Here’s a reference: Hinton, Geoffrey. “Distilling the Knowledge in a Neural Network.” arXiv preprint arXiv:1503.02531 (2015)., https://arxiv.org/pdf/1503.02531