Global Information Lookup Global Information

Minifloat information


In computing, minifloats are floating-point values represented with very few bits. Predictably, they are not well suited for general-purpose numerical calculations. They are used for special purposes, most often in computer graphics, where iterations are small and precision has aesthetic effects.[1] Machine learning also uses similar formats like bfloat16. Additionally, they are frequently encountered as a pedagogical tool in computer-science courses to demonstrate the properties and structures of floating-point arithmetic and IEEE 754 numbers.

Minifloats with 16 bits are half-precision numbers (opposed to single and double precision). There are also minifloats with 8 bits or even fewer.[citation needed]

Minifloats can be designed following the principles of the IEEE 754 standard. In this case they must obey the (not explicitly written) rules for the frontier between subnormal and normal numbers and must have special patterns for infinity and NaN. Normalized numbers are stored with a biased exponent. The new revision of the standard, IEEE 754-2008, has 16-bit binary minifloats.

  1. ^ Mocerino, Luca; Calimera, Andrea (24 November 2021). "AxP: A HW-SW Co-Design Pipeline for Energy-Efficient Approximated ConvNets via Associative Matching". Applied Sciences. 11 (23): 11164. doi:10.3390/app112311164.

and 9 Related for: Minifloat information

Request time (Page generated in 0.5464 seconds.)

Minifloat

Last Update:

In computing, minifloats are floating-point values represented with very few bits. Predictably, they are not well suited for general-purpose numerical...

Word Count : 1767

NaN

Last Update:

(binary128), decimal128 256-bit: Octuple (binary256) Extended precision Other Minifloat bfloat16 TensorFloat-32 Microsoft Binary Format IBM floating-point architecture...

Word Count : 3688

Subnormal number

Last Update:

(binary128), decimal128 256-bit: Octuple (binary256) Extended precision Other Minifloat bfloat16 TensorFloat-32 Microsoft Binary Format IBM floating-point architecture...

Word Count : 1915

Long double

Last Update:

(binary128), decimal128 256-bit: Octuple (binary256) Extended precision Other Minifloat bfloat16 TensorFloat-32 Microsoft Binary Format IBM floating-point architecture...

Word Count : 1133

Decimal floating point

Last Update:

(binary128), decimal128 256-bit: Octuple (binary256) Extended precision Other Minifloat bfloat16 TensorFloat-32 Microsoft Binary Format IBM floating-point architecture...

Word Count : 2373

IEEE 754

Last Update:

fully in hardware ISO/IEC 10967, language-independent arithmetic (LIA) Minifloat, low-precision binary floating-point formats following IEEE 754 principles...

Word Count : 7402

Microsoft Binary Format

Last Update:

(binary128), decimal128 256-bit: Octuple (binary256) Extended precision Other Minifloat bfloat16 TensorFloat-32 Microsoft Binary Format IBM floating-point architecture...

Word Count : 3402

Extended precision

Last Update:

(binary128), decimal128 256-bit: Octuple (binary256) Extended precision Other Minifloat bfloat16 TensorFloat-32 Microsoft Binary Format IBM floating-point architecture...

Word Count : 4025

Binary integer decimal

Last Update:

(binary128), decimal128 256-bit: Octuple (binary256) Extended precision Other Minifloat bfloat16 TensorFloat-32 Microsoft Binary Format IBM floating-point architecture...

Word Count : 672

PDF Search Engine © AllGlobal.net