ai:generalinfo
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
ai:generalinfo [2024/04/25 16:01] – Wulf Rajek | ai:generalinfo [2024/07/23 19:56] (current) – [Hardware] Wulf Rajek | ||
---|---|---|---|
Line 16: | Line 16: | ||
Instruct/ | Instruct/ | ||
+ | |||
+ | Embedding models: | ||
+ | Embedding models are used to represent your documents using a sophisticated numerical representation. Embedding models take text as input, and return a long list of numbers used to capture the semantics of the text. These embedding models have been trained to represent text this way, and help enable many applications, | ||
+ | [[https:// | ||
Frameworks/ | Frameworks/ | ||
Line 32: | Line 36: | ||
- Tensors: A tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors are a generalisation of scalars and vectors; a scalar is a zero rank tensor, and a vector is a first rank tensor. The rank (or order) of a tensor is defined by the number of directions (and hence the dimensionality of the array) required to describe it. It can be thought of as a multidimensional numerical array. An example of a tensor would be 1000 video frames of 640×480 size. | - Tensors: A tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors are a generalisation of scalars and vectors; a scalar is a zero rank tensor, and a vector is a first rank tensor. The rank (or order) of a tensor is defined by the number of directions (and hence the dimensionality of the array) required to describe it. It can be thought of as a multidimensional numerical array. An example of a tensor would be 1000 video frames of 640×480 size. | ||
- Vector: Vectors are used to represent both the input data (features) and the output data (labels or | - Vector: Vectors are used to represent both the input data (features) and the output data (labels or | ||
- | - Temperature: | + | - Temperature: |
+ | - [[https:// | ||
===== Evolution of AI ===== | ===== Evolution of AI ===== | ||
Line 211: | Line 216: | ||
GPT-3 175B model: Microsoft built a supercomputer with 285,000 CPU codes and 10,000 Nvidia V100 GPUs [[https:// | GPT-3 175B model: Microsoft built a supercomputer with 285,000 CPU codes and 10,000 Nvidia V100 GPUs [[https:// | ||
+ | Llama 3.1 used 16,000 Nvidia H100 GPUs to train the [[https:// | ||
+ | |||
+ | ===== Evaluation ===== | ||
+ | https:// |
ai/generalinfo.1714057313.txt.gz · Last modified: by Wulf Rajek