Close Menu
  • Home
  • UNSUBSCRIBE
  • News
  • Lifestyle
  • Tech
  • Entertainment
  • Sports
  • Travel
Facebook X (Twitter) WhatsApp
Trending
  • England vs South Africa: Proteas seal 14-run victory in rain-affected T20I series opener in Cardiff | Cricket News
  • Sam Edelman Shoes for $17 & Splendid Shorts for $12
  • Looking back at Charlie Kirk’s career as a political activist
  • ‘Our hearts stopped’: Scientists find baby pterosaurs died in violent Jurassic storm 150 million years ago
  • What we know about the Charlie Kirk shooting at Utah Valley University
  • Preorder new Apple products on Amazon: AirPods Pro 3, Apple Watch Series 11, and more
  • Hillsin: Trainer Chris Honour wanted to make ‘good impression’ says BHA prosecutor | Racing News
  • Charlie Kirk Shot at Utah Speaking Event
Facebook X (Twitter) WhatsApp
Baynard Media
  • Home
  • UNSUBSCRIBE
  • News
  • Lifestyle
  • Tech
  • Entertainment
  • Sports
  • Travel
Baynard Media
Home»Tech»DeepSeek R1: Why AI experts think it’s so special
Tech

DeepSeek R1: Why AI experts think it’s so special

EditorBy EditorJanuary 28, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

All of a sudden, DeepSeek is everywhere.

Its R1 model is open source, allegedly trained for a fraction of the cost of other AI models, and is just as good, if not better than ChatGPT.

This lethal combination hit Wall Street hard, causing tech stocks to tumble, and making investors question how much money is needed to develop good AI models. DeepSeek engineers claim R1 was trained on 2,788 GPUs which cost around $6 million, compared to OpenAI’s GPT-4 which reportedly cost $100 million to train.

DeepSeek’s cost efficiency also challenges the idea that larger models and more data leads to better performance. Amidst the frenzied conversation about DeepSeek’s capabilities, its threat to AI companies like OpenAI, and spooked investors, it can be hard to make sense of what’s going on. But AI experts with veteran experience have weighed in with valuable perspectives.

DeepSeek proves what AI experts have been saying for years: bigger isn’t better

Hampered by trade restrictions and access to Nvidia GPUs, China-based DeepSeek had to get creative in developing and training R1. That they were able to accomplish this feat for only $6 million (which isn’t a lot of money in AI terms) was a revelation to investors.

But AI experts weren’t surprised. “At Google, I asked why they were fixated on building THE LARGEST model. Why are you going for size? What function are you trying to achieve? Why is the thing you were upset about that you didn’t have THE LARGEST model? They responded by firing me,” posted Timnit Gebru, who was famously terminated from Google for calling out AI bias, on X.

Mashable Light Speed


Tweet may have been deleted

Hugging Face‘s climate and AI lead Sasha Luccioni pointed out how AI investment is precariously built on marketing and hype. “It’s wild that hinting that a single (high-performing) LLM is able to achieve that performance without brute-forcing the shit out of thousands of GPUs is enough to cause this,” said Luccioni.


Tweet may have been deleted

Clarifying why DeepSeek R1 is such a big deal

DeepSeek R1 performed comparably to OpenAI o1 model on key benchmarks. It marginally surpassed, equaled, or fell just below o1 on math, coding, and general knowledge tests. That’s to say, there are other models out there, like Anthropic Claude, Google Gemini, and Meta’s open source model Llama that are just as capable to the average user.

But R1 causing such a frenzy because of how little it cost to make. “It’s not smarter than earlier models, just trained more cheaply,” said AI research scientist Gary Marcus.


Tweet may have been deleted

The fact that DeepSeek was able to build a model that competes with OpenAI’s models is pretty remarkable. Andrej Karpathy who co-founded OpenAI, posted on X, “Does this mean you don’t need large GPU clusters for frontier LLMs? No, but you have to ensure that you’re not wasteful with what you have, and this looks like a nice demonstration that there’s still a lot to get through with both data and algorithms.”


Tweet may have been deleted

Wharton AI professor Ethan Mollick said it’s not about it’s capabilities, but models that people currently have access to. “DeepSeek is a really good model, but it is not generally a better model than o1 or Claude” he said. “But since it is both free and getting a ton of attention, I think a lot of people who were using free ‘mini’ models are being exposed to what a early 2025 reasoner AI can do and are surprised.”


Tweet may have been deleted

Score one for open source AI models

DeepSeek R1 breakout is a huge win for open source proponents who argue that democratizing access to powerful AI models, ensures transparency, innovation, and healthy competition. “To people who think ‘China is surpassing the U.S. in AI,’ the correct thought is ‘open source models are surpassing closed ones,'” said Yann LeCun, chief AI scientist at Meta, which has supported open sourcing with its own Llama models.


Tweet may have been deleted

Computer scientist and AI expert Andrew Ng didn’t explicitly mention the significance of R1 being an open source model, but highlighted how the DeepSeek disruption is a boon for developers, since it allows access that is otherwise gatekept by Big Tech.

“Today’s ‘DeepSeek selloff’ in the stock market — attributed to DeepSeek V3/R1 disrupting the tech ecosystem — is another sign that the application layer is a great place to be,” said Ng. “The foundation model layer being hyper-competitive is great for people building applications.”


Tweet may have been deleted

Topics
Artificial Intelligence
DeepSeek



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleWhite House freeze on federal aid will not affect student loans, Education Department says
Next Article Why Timothée Chalamet Gained 20 Pounds for A Complete Unknown
Editor
  • Website

Related Posts

Tech

Preorder new Apple products on Amazon: AirPods Pro 3, Apple Watch Series 11, and more

September 10, 2025
Tech

Watch out for fake iPhone 17 Pro videos

September 10, 2025
Tech

How to preorder the new Apple Watch Series 11, Ultra 3, and SE 3

September 10, 2025
Add A Comment

Comments are closed.

Categories
  • Entertainment
  • Lifestyle
  • News
  • Sports
  • Tech
  • Travel
Recent Posts
  • England vs South Africa: Proteas seal 14-run victory in rain-affected T20I series opener in Cardiff | Cricket News
  • Sam Edelman Shoes for $17 & Splendid Shorts for $12
  • Looking back at Charlie Kirk’s career as a political activist
  • ‘Our hearts stopped’: Scientists find baby pterosaurs died in violent Jurassic storm 150 million years ago
  • What we know about the Charlie Kirk shooting at Utah Valley University
calendar
September 2025
M T W T F S S
1234567
891011121314
15161718192021
22232425262728
2930  
« Aug    
Recent Posts
  • England vs South Africa: Proteas seal 14-run victory in rain-affected T20I series opener in Cardiff | Cricket News
  • Sam Edelman Shoes for $17 & Splendid Shorts for $12
  • Looking back at Charlie Kirk’s career as a political activist
About

Welcome to Baynard Media, your trusted source for a diverse range of news and insights. We are committed to delivering timely, reliable, and thought-provoking content that keeps you informed
and inspired

Categories
  • Entertainment
  • Lifestyle
  • News
  • Sports
  • Tech
  • Travel
Facebook X (Twitter) Pinterest WhatsApp
  • Contact Us
  • About Us
  • Privacy Policy
  • Disclaimer
  • UNSUBSCRIBE
© 2025 copyrights reserved

Type above and press Enter to search. Press Esc to cancel.