All those giga scale nvidia datacenters, time to start changing plans.
#4 | POSTED BY SITZKRIEG
This is my issue with Datacenters.
NVidia is on a 6 month timeframe releasing new chipset. Not to mention Groq, by the time your replenish your datacenter its already out dated.
Groq, or inference processing is where its at for us mortals. I forsee home "AI devices taking over. No way this datacenter thing continues unabated.
Hello that's 1 token
using one word doesn't make any sense, it implies the wrong thing; Also what happens is the RAG's and Chat upload the context (secondary, and primary context) which adds to the tokens at a nonlinear rate.
platform.openai.com
Build an android app from scratch $3.00, start making changes and it goes non-linear.
If you can do 15000 words on 10,000 tokens you've had a massive efficiency breakthrough in AI
This is a matter of training, not hardware. With it though you will get mistakes, 15000 words becomes ~20-30,000 tokens.