> But there's only one Nvidia. I disagree. Google have their tensor chips – whet...

larodi · on Feb 20, 2024

With the pace of innovation at Google this is absolutely not guaranteed. Speaking from historical perspective it is much more likely that Alphabet acquires another startup or small ASIC-based company which got right the interconnect part. And still - it is not guaranteed. This Gemini thing tries to launch like three times already, but can't even blimp in comparison with OAI. Besides OAI is more or less MS.

danpalmer · on Feb 20, 2024

I'm talking specifically about chips here. Google has been developing their tensor stuff for a while now, and it's been publicly documented as running much of their training and inference stacks. The fact that they are getting value out of it in models competitive with others suggests that the chips are basically a success, does it not?

larodi · on Feb 20, 2024

Well there are chips and chips and at this moment we know little about the amount of Google TPUs used and performance they have.

Similarly when IBM manage to start producing volumes of their neuromorphic chip, Google’s TPUs, Nvidia and even groq LPUs may seem obsolete.

danpalmer · on Feb 20, 2024

That's fair, we don't know. I guess my point is that with all 3 of the main cloud providers with their own chip programs, and with Google having a proven track record of training/serving competitive LLMs on theirs, I'm not that bullish about Nvidia at anywhere near the current prices.

I think this is most likely a temporary blip. GPUs were a bit of a commodity 5 years ago, with Nvidia, AMD, and Intel all producing reasonable stuff. Large AI accelerator chips weren't much of a market ~5 years ago, Nvidia were first to take the market, but in a few years time they'll also be back to commodity status.

Nvidia have a small moat with CUDA, but their eye-watering prices are a huge incentive for users to try alternatives, and ultimately the current price is built on them being the only provider of GPUs with 40/80GB of memory. That's the fundamental enabling technology, and that's not particularly tricky for competitors to replicate.

Nvidia may be the "best" AI accelerator chips on the market for years to come, but being 20% better and 20% more expensive than AMD, and all the cloud providers using their own in-house chips where they can, is not a $1.8Tn company as far as I can tell, it's much more like what Nvidia were ~5 years ago.

larodi · on Feb 21, 2024

well, you make a fair point. let me also raise the question - why were Intel, AMD and alike so slow with developing custom APUs for ML tasks. this sounds incredibly short-sighted. i mean - the ML area has been developing steadily for at least 20 years, with vast amounts of new stuff coming since 2013 perhaps. this makes 10 years, and I believe the dev cycle for new chips is potentially around a decade.

so question is: where are these guys' ML chips? sorry but AVX512 is not something that provides enough juice, and apparently some smart-head at Intel decided to lock end-users out of it?

because, honestly, it was not NVidia who kicked the GPUs forward, but these brave CUDA devs who actually created some valuable software to run on top of them - first for crypto mining, then for the LLMs and NNs in general.

honestly - i start to really despise this company, even though there's a 3090TI in my home box. and with the most recent talk given by CEO - fingers crossed someone comes and eats their lunch, they so much deserve it.