ai:generalinfo
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
ai:generalinfo [2024/07/12 21:47] – [Terminology] Wulf Rajek | ai:generalinfo [2024/07/23 19:56] (current) – [Hardware] Wulf Rajek | ||
---|---|---|---|
Line 216: | Line 216: | ||
GPT-3 175B model: Microsoft built a supercomputer with 285,000 CPU codes and 10,000 Nvidia V100 GPUs [[https:// | GPT-3 175B model: Microsoft built a supercomputer with 285,000 CPU codes and 10,000 Nvidia V100 GPUs [[https:// | ||
+ | Llama 3.1 used 16,000 Nvidia H100 GPUs to train the [[https:// | ||
+ | ===== Evaluation ===== | ||
+ | |||
+ | https:// |
ai/generalinfo.1720817229.txt.gz · Last modified: by Wulf Rajek