專欄中國經濟

DeepSeek's significance goes beyond changing the AI game

Yan Man: When the combination of capital and computing power is no longer the sole path to technological advancement, what does this mean for entrepreneurs and developers? Everyone in the tech community should be able to foresee the implications.
This English translation is AI-generated and provided for reference only.

During the domestic Spring Festival holiday a year ago, OpenAI released its large video generation model Sora on February 15, 2024, local time. The seamless camera movements and almost lifelike presentation in several Sora-generated videos left the domestic large model industry, still in its imitation and following stage, in shock and pessimism. The notion of "surrender" was rampant, with investors and major companies advising entrepreneurs to abandon illusions and focus on applications, declaring that starting a business in large models was a "dead end."

Who would have thought that just a year later, during this Spring Festival, the discussion would be about a domestic large model called DeepSeek. Beyond the tech circles and viral discussions, its applications have begun to penetrate households, with more ordinary people using DeepSeek to customize diet plans, edit holiday greetings, write acrostic poems, and even tell fortunes.

So far, DeepSeek has launched three generations of models. In May last year, DeepSeek, under the umbrella of Phantom Square Quantitative, released DeepSeek-V2, claiming capabilities comparable to GPT-4 but at only about 1% of GPT-4's price, sparking a year-long price war in the domestic large model market. By December, DeepSeek released the new large model DeepSeek-V3, reducing training costs to a few million dollars, earning the title of "price butcher." The latest release, DeepSeek-R1, directly competes with OpenAI's o1. The launch of "deep thinking" and "network search" features propelled DeepSeek to the top of the free charts in both China and the US.

您已閱讀31%(1605字),剩餘69%(3640字)包含更多重要資訊,訂閱以繼續探索完整內容,並享受更多專屬服務。
版權聲明:本文版權歸FT中文網所有,未經允許任何單位或個人不得轉載,複製或以任何其他方式使用本文全部或部分,侵權必究。

科技曼談

閆曼,FT中文網科技與產業板塊主編,負責FT中文網科技板塊的策劃、專訪及編輯工作。香港浸會大學國際新聞碩士,近十年一線媒體採編經驗,深耕網路科技領域新聞多年。曾爲多家知名媒體特約撰稿人。本專欄旨在分享作者身在科技、創業和投資前線的觀察,就科技領域動態作出鮮活生動的第一時間解讀。 個人公衆號:科技曼談(ID:kejimantan)

相關文章

相關話題

設置字型大小×
最小
較小
默認
較大
最大
分享×