Andreessen Horowitz general partner and Mistral board member Anjney “Anj” Midha first spied DeepSeek’s jaw-dropping performance six months ago, he tells TechCrunch.
That’s when DeepSeek introduced Coder V2, which rivaled OpenAI’s GPT4-Turbo for coding-specific tasks, according to a paper it released last year. This put DeepSeek on a path to release improved models every couple of months right through R1, he said. R1 is its new open source reasoning model that has upended the tech industry for offering industry standard performance at a fraction of the cost.
Despite the sell-off of Nvidia’s stock, Midha says R1 doesn’t mean that AI foundational models will stop spending billions to gobble GPU chips and build more data centers as fast as they can.
It means they will do more with the compute power they can obtain.
“When people are like, okay
Continue Reading on TechCrunch
This preview shows approximately 15% of the article. Read the full story on the publisher's website to support quality journalism.