ai:generalinfo
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | |||
ai:generalinfo [2024/07/13 13:57] – Wulf Rajek | ai:generalinfo [2024/07/23 19:56] (current) – [Hardware] Wulf Rajek | ||
---|---|---|---|
Line 215: | Line 215: | ||
GPT-3 175B model: Microsoft built a supercomputer with 285,000 CPU codes and 10,000 Nvidia V100 GPUs [[https:// | GPT-3 175B model: Microsoft built a supercomputer with 285,000 CPU codes and 10,000 Nvidia V100 GPUs [[https:// | ||
+ | |||
+ | Llama 3.1 used 16,000 Nvidia H100 GPUs to train the [[https:// | ||
===== Evaluation ===== | ===== Evaluation ===== | ||
https:// | https:// |
ai/generalinfo.1720875442.txt.gz · Last modified: by Wulf Rajek