𝗗𝗲𝗲𝗽𝘀𝗲𝗲𝗸 𝗶𝘀 𝗱𝗶𝘀𝗿𝘂𝗽𝘁𝗶𝗻𝗴 𝗳𝗼𝘂𝗻𝗱𝗮𝘁𝗶𝗼𝗻𝗮𝗹 𝗔𝗜 𝗺𝗼𝗱𝗲𝗹𝘀 𝗮𝗻𝗱 𝗰𝗼𝘀𝘁 𝗶𝗺𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀𝗅 🤯
𝗖𝗼𝗺𝗽𝗶𝗹𝗮𝘁𝗶𝗼𝗻 𝗼𝗳 𝗣𝗼𝘀𝘁𝘀 𝗼𝗻 𝗗𝗲𝗲𝗽𝗦𝗲𝗲𝗸𝗅
𝗥𝗲𝗮𝗰𝘁𝗶𝗼𝗻𝘀 𝗳𝗿𝗼𝗺 𝗴𝗹𝗼𝗯𝗮𝗹 𝗔𝗜 𝗘𝘅𝗽𝗲𝗿𝘁𝘀-
Yann LeCun - To people who see the performance of DeepSeek and think: "China is surpassing the US in AI."
You are reading this wrong.
The correct reading is: "Open source models are surpassing proprietary ones."
Andrew Ng "Today's "DeepSeek selloff" in the stock market -- attributed to DeepSeek V3/R1 disrupting the tech ecosystem -- is another sign that the application layer is a great place to be. The foundation model layer being hyper-competitive is great for people building applications."
𝗪𝗵𝗮𝘁 𝗶𝘀 𝗱𝗲𝗲𝗽𝘀𝗲𝗲𝗸? 𝗘𝘃𝗲𝗿𝘆𝘁𝗵𝗶𝗻𝗴 𝗬𝗼𝘂 𝗻𝗲𝗲𝗱 𝘁𝗼 𝗸𝗻𝗼𝘄-
DeepSeek is a Chinese AI startup that has rapidly emerged as a disruptive force in the global artificial intelligence landscape. Founded in July 2023 by Liang Wenfeng (also transliterated as Li Wenf), a Zhejiang University graduate and hedge fund manager, the company developed an open-source large language model (LLM) that rivals leading U.S. models like OpenAI's GPT-4 at a fraction of the cost135. - from Perplexity
𝗞𝗲𝘆 𝗜𝗻𝗻𝗼𝘃𝗮𝘁𝗶𝗼𝗻𝘀 𝗮𝗻𝗱 𝗙𝗲𝗮𝘁𝘂𝗿𝗲𝘀:
𝗖𝗼𝘀𝘁 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝗰𝘆: The Development and training cost of deepseek is under 6 Million USD, where as OpenAI and Gemini takes tens of millions of dollars.
𝗧𝗲𝗰𝗵𝗻𝗶𝗰𝗮𝗹 𝗣𝗲𝗿𝗳𝗼𝗿𝗺𝗮𝗻𝗰𝗲: Independent benchmark tests show DeepSeek models outperforming ChatGPT-4 in mathematics, programming, and reasoning tasks.
𝗪𝗵𝘆 𝘁𝗵𝗲 𝗨𝗦 𝗦𝘁𝗼𝗰𝗸 𝗠𝗮𝗿𝗸𝗲𝘁𝘀 𝗮𝗿𝗲 𝗰𝗿𝗮𝘀𝗵𝗶𝗻𝗴:
The DeepSeek was developed using open source models - Llama from Meta, by coming up with new ideas and built them on top of other people's work despite significant restrictions on Hardware from USA. They developed these models with lower end GPU's demonstrating that unlimited hardware is not the solution. By proving high-performance models can be built cheaply and openly, it pressures Western firms to justify their massive investments while offering developing nations an accessible AI alternative.
𝗗𝗲𝗲𝗽𝗦𝗲𝗲𝗸 𝗘𝘅𝗽𝗮𝗻𝗱𝘀 𝗔𝗜 𝗣𝗼𝗿𝘁𝗳𝗼𝗹𝗶𝗼 𝘄𝗶𝘁𝗵 𝗝𝗮𝗻𝘂𝘀 𝗣𝗿𝗼-7𝗕 - Chinese AI firm DeepSeek releases new open-source multimodal model Janus Pro-7B on Hugging Face, claiming performance matching specialized models like DALL-E 3. Link:
https://seekingalpha.com/news/4398945-deepseek-releases-open-source-ai-multimodal-model-janus-pro-7b𝗗𝗲𝗲𝗽𝗦𝗲𝗲𝗸 𝗗𝗶𝘀𝗿𝘂𝗽𝘁𝘀 𝗔𝗜 𝗠𝗮𝗿𝗸𝗲𝘁 𝘄𝗶𝘁𝗵 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝘁 𝗠𝗼𝗱𝗲𝗹 - DeepSeek's R1 model demonstrates efficient AI development, matching top performers while using fewer resources and lower costs. Link:
https://venturebeat.com/ai/deepseek-r1s-bold-bet-on-reinforcement-learning-how-it-outpaced-openai-at-3-of-the-cost/