๐๐ฒ๐ฒ๐ฝ๐๐ฒ๐ฒ๐ธ ๐ถ๐ ๐ฑ๐ถ๐๐ฟ๐๐ฝ๐๐ถ๐ป๐ด ๐ณ๐ผ๐๐ป๐ฑ๐ฎ๐๐ถ๐ผ๐ป๐ฎ๐น ๐๐ ๐บ๐ผ๐ฑ๐ฒ๐น๐ ๐ฎ๐ป๐ฑ ๐ฐ๐ผ๐๐ ๐ถ๐บ๐ฝ๐น๐ถ๐ฐ๐ฎ๐๐ถ๐ผ๐ป๐๐
๐คฏ
๐๐ผ๐บ๐ฝ๐ถ๐น๐ฎ๐๐ถ๐ผ๐ป ๐ผ๐ณ ๐ฃ๐ผ๐๐๐ ๐ผ๐ป ๐๐ฒ๐ฒ๐ฝ๐ฆ๐ฒ๐ฒ๐ธ๐
๐ฅ๐ฒ๐ฎ๐ฐ๐๐ถ๐ผ๐ป๐ ๐ณ๐ฟ๐ผ๐บ ๐ด๐น๐ผ๐ฏ๐ฎ๐น ๐๐ ๐๐
๐ฝ๐ฒ๐ฟ๐๐-
Yann LeCun - To people who see the performance of DeepSeek and think: "China is surpassing the US in AI."
You are reading this wrong.
The correct reading is: "Open source models are surpassing proprietary ones."
Andrew Ng "Today's "DeepSeek selloff" in the stock market -- attributed to DeepSeek V3/R1 disrupting the tech ecosystem -- is another sign that the application layer is a great place to be. The foundation model layer being hyper-competitive is great for people building applications."
๐ช๐ต๐ฎ๐ ๐ถ๐ ๐ฑ๐ฒ๐ฒ๐ฝ๐๐ฒ๐ฒ๐ธ? ๐๐๐ฒ๐ฟ๐๐๐ต๐ถ๐ป๐ด ๐ฌ๐ผ๐ ๐ป๐ฒ๐ฒ๐ฑ ๐๐ผ ๐ธ๐ป๐ผ๐-
DeepSeek is a Chinese AI startup that has rapidly emerged as a disruptive force in the global artificial intelligence landscape. Founded in July 2023 by Liang Wenfeng (also transliterated as Li Wenf), a Zhejiang University graduate and hedge fund manager, the company developed an open-source large language model (LLM) that rivals leading U.S. models like OpenAI's GPT-4 at a fraction of the cost135. - from Perplexity
๐๐ฒ๐ ๐๐ป๐ป๐ผ๐๐ฎ๐๐ถ๐ผ๐ป๐ ๐ฎ๐ป๐ฑ ๐๐ฒ๐ฎ๐๐๐ฟ๐ฒ๐:
๐๐ผ๐๐ ๐๐ณ๐ณ๐ถ๐ฐ๐ถ๐ฒ๐ป๐ฐ๐: The Development and training cost of deepseek is under 6 Million USD, where as OpenAI and Gemini takes tens of millions of dollars.
๐ง๐ฒ๐ฐ๐ต๐ป๐ถ๐ฐ๐ฎ๐น ๐ฃ๐ฒ๐ฟ๐ณ๐ผ๐ฟ๐บ๐ฎ๐ป๐ฐ๐ฒ: Independent benchmark tests show DeepSeek models outperforming ChatGPT-4 in mathematics, programming, and reasoning tasks.
๐ช๐ต๐ ๐๐ต๐ฒ ๐จ๐ฆ ๐ฆ๐๐ผ๐ฐ๐ธ ๐ ๐ฎ๐ฟ๐ธ๐ฒ๐๐ ๐ฎ๐ฟ๐ฒ ๐ฐ๐ฟ๐ฎ๐๐ต๐ถ๐ป๐ด:
The DeepSeek was developed using open source models - Llama from Meta, by coming up with new ideas and built them on top of other people's work despite significant restrictions on Hardware from USA. They developed these models with lower end GPU's demonstrating that unlimited hardware is not the solution. By proving high-performance models can be built cheaply and openly, it pressures Western firms to justify their massive investments while offering developing nations an accessible AI alternative.
๐๐ฒ๐ฒ๐ฝ๐ฆ๐ฒ๐ฒ๐ธ ๐๐
๐ฝ๐ฎ๐ป๐ฑ๐ ๐๐ ๐ฃ๐ผ๐ฟ๐๐ณ๐ผ๐น๐ถ๐ผ ๐๐ถ๐๐ต ๐๐ฎ๐ป๐๐ ๐ฃ๐ฟ๐ผ-7๐ - Chinese AI firm DeepSeek releases new open-source multimodal model Janus Pro-7B on Hugging Face, claiming performance matching specialized models like DALL-E 3. Link:
https://seekingalpha.com/news/4398945-deepseek-releases-open-source-ai-multimodal-model-janus-pro-7b๐๐ฒ๐ฒ๐ฝ๐ฆ๐ฒ๐ฒ๐ธ ๐๐ถ๐๐ฟ๐๐ฝ๐๐ ๐๐ ๐ ๐ฎ๐ฟ๐ธ๐ฒ๐ ๐๐ถ๐๐ต ๐๐ณ๐ณ๐ถ๐ฐ๐ถ๐ฒ๐ป๐ ๐ ๐ผ๐ฑ๐ฒ๐น - DeepSeek's R1 model demonstrates efficient AI development, matching top performers while using fewer resources and lower costs. Link:
https://venturebeat.com/ai/deepseek-r1s-bold-bet-on-reinforcement-learning-how-it-outpaced-openai-at-3-of-the-cost/