{"id":563,"date":"2025-01-29T20:45:00","date_gmt":"2025-01-29T21:45:00","guid":{"rendered":"http:\/\/asian-idol.com\/?p=563"},"modified":"2025-02-20T14:55:47","modified_gmt":"2025-02-20T14:55:47","slug":"deepseek-is-bad-for-silicon-valley-but-it-might-be-great-for-you","status":"publish","type":"post","link":"http:\/\/asian-idol.com\/index.php\/2025\/01\/29\/deepseek-is-bad-for-silicon-valley-but-it-might-be-great-for-you\/","title":{"rendered":"DeepSeek is bad for Silicon Valley. But it might be great for you."},"content":{"rendered":"
\n

\"A

OpenAI CEO Sam Altman speaks to the media as he arrives at the Sun Valley Lodge for a media finance conference on July 11, 2023.<\/figcaption><\/figure>\n

When it comes to AI, I\u2019d consider myself a casual user and a curious one. It\u2019s been creeping into my daily life for a couple of years, and at the very least, AI chatbots can be good at making drudgery slightly less drudgerous<\/a>. <\/p>\n

But whenever I start to feel convinced that tools like ChatGPT and Claude can actually make my life better, I seem to hit a paywall, because the most advanced and arguably most useful tools require a subscription. Then came DeepSeek.<\/p>\n

The Chinese startup DeepSeek sunk the stock prices<\/a> of several major tech companies on Monday after it released a new open-source model that can reason on the cheap: DeepSeek-R1. The company says R1\u2019s performance matches OpenAI\u2019s initial \u201creasoning\u201d model, o1<\/a>, and it does so using a fraction of the resources. It also cost a lot less to use. That adds up to an advanced AI model that\u2019s free to the public and a bargain to developers who want to build apps on top of it.<\/p>\n

While OpenAI, Anthropic, Google, Meta, and Microsoft have collectively spent billions of dollars training their models<\/a>, DeepSeek claims it spent less than $6 million<\/a> on using the equipment to train R1\u2019s predecessor, DeepSeek-V3. (Disclosure: Vox Media is one of several publishers that has signed partnership agreements with OpenAI. Our reporting remains editorially independent.)<\/p>\n

To get unlimited access to OpenAI\u2019s o1, you\u2019ll need a pro account, which costs $200 a month<\/a>. DeepSeek does charge companies for access to its application programming interface (API), which allows apps to talk to each other<\/a> and helps developers bake AI models into their apps. <\/strong>But what DeepSeek charges for API access is a tiny fraction of the cost that OpenAI charges for access to o1<\/a>. So it might not come as a surprise that, as of Wednesday morning, DeepSeek wasn\u2019t just the most popular AI app in the Apple and Google app stores. It was the most popular app<\/a>, period.<\/p>\n

\u201cThe main reason people are very excited about DeepSeek is not because it’s way better than any of the other models,\u201d said Leandro von Werra<\/a>, head of research at the AI platform Hugging Face. \u201cIt’s more that it’s an open model, and coming from a place where people didn’t expect it to come from.\u201d<\/p>\n

So as Silicon Valley and Washington pondered the geopolitical implications of what\u2019s been called a \u201cSputnik moment\u201d for AI<\/a>, I\u2019ve been fixated on the promise that AI tools can be both powerful and cheap. And on top of that, I imagined how a future powered by artificially intelligent software could be built on the same open-source principles that brought us things like Linux and the World Web Web. <\/p>\n

This could be wishful thinking and a little bit naive. After all, OpenAI was originally founded as a nonprofit company with the mission to create AI that would serve the entire world, regardless of financial return. That\u2019s no longer the case<\/a>. <\/p>\n

But this is why DeepSeek\u2019s explosive entrance into the global AI arena could make my wishful thinking a bit more realistic. While my own experiments with the R1 model showed a chatbot that basically acts like other chatbots \u2014 while walking you through its reasoning, which is interesting \u2014 the real value is that it points toward a future of AI that is, at least partially, open source. It indicates that even the most advanced AI capabilities don\u2019t need to cost billions of dollars to build \u2014 or be built by trillion-dollar Silicon Valley companies. That means more companies could be competing to build more interesting applications for AI. <\/p>\n

And while American tech companies have spent billions trying to get ahead in the AI arms race<\/a>, <\/strong>DeepSeek\u2019s sudden popularity also shows that while it is heating up<\/a>, the digital cold war between the US and China doesn\u2019t have to be a zero-sum game.<\/p>\n

DeepSeek\u2019s unconventional, almost-open-source approach<\/strong><\/h2>\n

While you may not have heard of DeepSeek until this week, the company\u2019s work caught the attention of the AI research world a few years ago. The company actually grew out of High-Flyer, a China-based hedge fund founded in 2016 by engineer Liang Wenfeng. High-Flyer found great success using AI to anticipate movement in the stock market<\/a>. That, however, prompted a crackdown on what Beijing deemed to be speculative trading, so in 2023, Liang spun off his company’s research division into DeepSeek, a company focused on advanced AI research.<\/p>\n

From the outset, DeepSeek set itself apart by building powerful open-source models cheaply and offering developers access for cheap. In the software world, open source means that the code can be used, modified, and distributed by anyone. In the context of AI<\/a>, that applies to the entire system, including its training data, licenses, and other components. Thanks to DeepSeek’s open-source approach, anyone can download its models, tweak them, and even run them on local servers. <\/p>\n

The major US players in the AI race \u2014 OpenAI, Google, Anthropic, Microsoft \u2014 have closed models built on proprietary data and guarded as trade secrets. Meta has set itself apart by releasing open models. Conventional wisdom suggested that open models lagged behind closed models<\/a> by a year or so. DeepSeek apparently just shattered that notion.<\/p>\n

\"\"<\/p>\n

DeepSeek\u2019s models are not, however, truly open source. They\u2019re what\u2019s known as open-weight AI models<\/a>. That means the data that allows the model to generate content, also known as the model\u2019s weights, is public, but the company hasn\u2019t released its training data or code<\/a>. Von Werra, of Hugging Face, is working on a project to fully reproduce DeepSeek-R1<\/a>, including its data and training pipelines. One of the goals is to figure out how exactly DeepSeek managed to pull off such advanced reasoning with far fewer resources than competitors, like OpenAI, and then release those findings to the public to give open-source AI development another leg up. <\/p>\n

\u201cIf more people have access to open models, more people will build on top of it,\u201d von Werra said.<\/p>\n

Still, we already know a lot more about how DeepSeek\u2019s model works than we do about OpenAI\u2019s. DeepSeek published a detailed technical report on R1 under an MIT License, which gives permission to reuse, modify, or distribute the software. A similar technical report on the V3 model<\/a> released in December says that it was trained on 2,000 NVIDIA H800 chips versus the 16,000 or so integrated circuits competing models needed for training. Training took 55 days and cost $5.6 million, according to DeepSeek, while the cost of training Meta\u2019s latest open-source model, Llama 3.1, is estimated to be anywhere from about $100 million<\/a> to $640 million<\/a>. But because Meta does not share all components of its models, including training data, some do not consider Llama to be truly open source<\/a>.<\/p>\n

When it comes to performance, there\u2019s little doubt that DeepSeek-R1 delivers impressive results that rival its most expensive competitors. A comparison of models from Artificial Analysis shows that R1 is second only to OpenAI\u2019s o1 in reasoning and artificial analysis<\/a>. It actually slightly outperforms o1 in terms of quantitative reasoning and coding. The big tradeoff appears to be speed. DeepSeek is kind of slow, and you\u2019ll notice it if you use R1 in the app or on the web. It does show you what it\u2019s thinking as it\u2019s thinking, though, which is kind of neat.<\/p>\n

Now, the number of chips used or dollars spent on computing power are super important metrics in the AI industry, but they don\u2019t mean much to the average user. The most basic versions of ChatGPT, the model that put OpenAI on the map, and Claude, Anthropic\u2019s chatbot, are powerful enough for a lot of people, and they\u2019re free. They can summarize stuff<\/a>, help you plan a vacation<\/a>, and help you search the web<\/a> with varying results. But chatbots are far from the coolest thing AI can do. <\/p>\n

The challenge to America\u2019s global AI supremacy<\/strong><\/h2>\n

What\u2019s most exciting about DeepSeek and its more open approach is how it will make it cheaper and easier to build AI into stuff. This is a huge deal for developers trying to create killer apps as well as scientists trying to make breakthrough discoveries<\/a>. It\u2019s also a huge challenge to the Silicon Valley establishment, which has poured billions of dollars into companies like OpenAI with the understanding that the massive capital expenditures would be necessary to lead the burgeoning global AI industry.<\/p>\n

It\u2019s not an understatement to say that DeepSeek is shaking the AI industry to its very core. The stock market\u2019s reaction to the arrival of DeepSeek-R1\u2019s arrival wiped out nearly $1 trillion in value from tech stocks<\/a> and reversed two years of seemingly neverending gains for companies propping up the AI industry, including most prominently NVIDIA, whose chips were used to train DeepSeek\u2019s models. <\/p>\n

It also indicated that the Biden administration\u2019s moves to curb chip exports<\/a> in an effort to slow China\u2019s progress in AI innovation may not have had the desired effect. Joe Biden started blocking exports of advanced AI chips to China in 2022 and expanded those efforts just before Trump took office. However, China\u2019s AI industry has continued to advance apace its US rivals. DeepSeek is joined by Chinese tech giants like Alibaba, Baidu, ByteDance, and Tencent, who have also continued to roll out powerful AI tools<\/a>, despite the embargo. <\/p>\n

What this means for the future of America\u2019s quest for AI dominance is up for debate. President Donald Trump praised DeepSeek\u2019s ability<\/a> to come up \u201cwith a faster method of AI and much less expensive method.\u201d He added, \u201cThe release of DeepSeek, AI from a Chinese company should be a wakeup call for our industries that we need to be laser-focused on competing to win.\u201d<\/p>\n

But we\u2019re far too early in this race to have any idea who will ultimately take home the gold. \u201cThis is like being in the late 1990s or even right around the year 2000 and trying to predict who would be the leading tech companies, or the leading internet companies in 20 years,\u201d said Jennifer Huddleston<\/a>, a senior fellow at the Cato Institute.<\/p>\n

What is clear is that the competitors are aiming for the same finish line. Liang said in a July 2024 interview with Chinese tech outlet 36kr<\/a> that, like OpenAI, his company wants to achieve general artificial intelligence and would keep its models open going forward. He added, \u201cOpenAI is not a god.\u201d Liang\u2019s goals line up with those of Sam Altman and OpenAI, which has cast doubt on DeepSeek\u2019s recent success. Microsoft and OpenAI are reportedly investigating whether DeepSeek used ChatGPT output<\/a> to train its models, an allegation that David Sacks, the newly appointed White House AI and crypto czar, repeated<\/a> this week. <\/p>\n

\"\"<\/p>\n

There is, of course, the chance that this all goes the way of TikTok, another Chinese company that challenged US tech supremacy. It was originally Trump who cited national security concerns as a reason to ban the app, which is owned by ByteDance. Congress and the Biden administration took up the mantle, and now TikTok is banned, pending the app\u2019s sale<\/a> to an American company. <\/p>\n

DeepSeek uses ByteDance as a cloud provider and hosts American user data on Chinese servers<\/a>, which is what got TikTok in trouble years ago. The concern here is that the Chinese government could access that data and threaten US national security<\/a>. DeepSeek also says in its privacy policy that it can use this data to \u201creview, improve, and develop the service,\u201d which is not an unusual thing to find in any privacy policy.<\/p>\n

Unsurprisingly, DeepSeek does abide by China\u2019s censorship laws, which means its chatbot will not give you any information about the Tiananmen Square massacre<\/a>, among other censored subjects<\/a>. But it\u2019s not yet clear<\/a> that Beijing is using the popular new tool to ramp up surveillance on Americans. At least, it\u2019s not doing so any more than companies like Google and Apple already do, according to Sean O\u2019Brien<\/a>, founder of the Yale Privacy Lab, who recently did some network analysis of DeepSeek\u2019s app<\/a>.<\/p>\n

\u201cFrom a privacy standpoint, people need to understand that most mainstream apps are spying on them, and this is no different,\u201d O\u2019Brien told me. \u201cIt’s just a question of who’s doing the spying.\u201d <\/p>\n

Which brings us back to that paywall question. There\u2019s an old adage that if something online is free on the internet, you\u2019re the product<\/a>. So while it\u2019s exciting and even admirable that DeepSeek is building powerful AI models and offering them up to the public for free, it makes you wonder what the company has planned for the future.<\/p>\n

In the meantime, you can expect more surprises on the AI front. You might even be able to tinker with these surprises, too. OpenAI recently rolled out its Operator agent, which can effectively use a computer on your behalf<\/a> \u2014 if you pay $200 for the pro subscription. This week, people started sharing code<\/a> that can do the same thing with DeepSeek for free.<\/p>\n

A version of this story was also published in the Vox Technology newsletter.\u00a0<\/em>Sign up here<\/a><\/em><\/strong>\u00a0so you don\u2019t miss the next one!<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"

OpenAI CEO Sam Altman speaks to the media as he arrives at the Sun Valley Lodge for a media finance conference on July 11, 2023. When it comes to AI, I\u2019d consider myself a casual user and a curious one. It\u2019s been creeping into my daily life for a couple of years, and at the […]<\/p>\n","protected":false},"author":1,"featured_media":565,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[15],"tags":[],"class_list":["post-563","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-innovation"],"_links":{"self":[{"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/posts\/563","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/comments?post=563"}],"version-history":[{"count":3,"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/posts\/563\/revisions"}],"predecessor-version":[{"id":569,"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/posts\/563\/revisions\/569"}],"wp:featuredmedia":[{"embeddable":true,"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/media\/565"}],"wp:attachment":[{"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/media?parent=563"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/categories?post=563"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/asian-idol.com\/index.php\/wp-json\/wp\/v2\/tags?post=563"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}