2.6 History of ieee floating-point format

High performance computing Page 1 / 1

History of ieee floating-point format

Prior to the RISC microprocessor revolution, each vendor had their own floating- point formats based on their designers’ views of the relative importance of range versus accuracy and speed versus accuracy. It was not uncommon for one vendor to carefully analyze the limitations of another vendor’s floating-point format and use this information to convince users that theirs was the only “accurate” floating- point implementation. In reality none of the formats was perfect. The formats were simply imperfect in different ways.

During the 1980s the Institute for Electrical and Electronics Engineers (IEEE) produced a standard for the floating-point format. The title of the standard is “IEEE 754-1985 Standard for Binary Floating-Point Arithmetic.” This standard provided the precise definition of a floating-point format and described the operations on floating-point values.

Because IEEE 754 was developed after a variety of floating-point formats had been in use for quite some time, the IEEE 754 working group had the benefit of examining the existing floating-point designs and taking the strong points, and avoiding the mistakes in existing designs. The IEEE 754 specification had its beginnings in the design of the Intel i8087 floating-point coprocessor. The i8087 floating-point format improved on the DEC VAX floating-point format by adding a number of significant features.

The near universal adoption of IEEE 754 floating-point format has occurred over a 10-year time period. The high performance computing vendors of the mid 1980s (Cray IBM, DEC, and Control Data) had their own proprietary floating-point formats that they had to continue supporting because of their installed user base. They really had no choice but to continue to support their existing formats. In the mid to late 1980s the primary systems that supported the IEEE format were RISC workstations and some coprocessors for microprocessors. Because the designers of these systems had no need to protect a proprietary floating-point format, they readily adopted the IEEE format. As RISC processors moved from general-purpose integer computing to high performance floating-point computing, the CPU designers found ways to make IEEE floating-point operations operate very quickly. In 10 years, the IEEE 754 has gone from a standard for floating-point coprocessors to the dominant floating-point standard for all computers. Because of this standard, we, the users, are the beneficiaries of a portable floating-point environment.

Ieee floating-point standard

The IEEE 754 standard specified a number of different details of floating-point operations, including:

Storage formats
Precise specifications of the results of operations
Special values
Specified runtime behavior on illegal operations

Specifying the floating-point format to this level of detail insures that when a computer system is compliant with the standard, users can expect repeatable execution from one hardware platform to another when operations are executed in the same order.

Ieee storage format

The two most common IEEE floating-point formats in use are 32- and 64-bit numbers. [link] gives the general parameters of these data types.

Parameters of ieee 32- and 64-bit formats
IEEE75	FORTRAN	C	Bits	Exponent Bits	Mantissa Bits
Single	REAL*4	float	32	8	24
Double	REAL*8	double	64	11	53
Double-Extended	REAL*10	long double	>=80	>=15	>=64

In FORTRAN, the 32-bit format is usually called REAL, and the 64-bit format is usually called DOUBLE. However, some FORTRAN compilers double the sizes for these data types. For that reason, it is safest to declare your FORTRAN variables as REAL*4 or REAL*8 . The double-extended format is not as well supported in compilers and hardware as the single- and double-precision formats. The bit arrangement for the single and double formats are shown in [link] .

Based on the storage layouts in [link] , we can derive the ranges and accuracy of these formats, as shown in [link] .

This figure shows two diagrams, the first labeled single precision, and the second labeled double precision. Both diagrams are thin segmented blocks, with the first section on the left labeled s, the section to the right of it labeled exp, and the rightmost section labeled mantissa. The widths of these sections are labeled below each diagram. For single precision, the entire width is measured as 32 bits. The s-section is labeled with a black block, exp is labeled with a black block, and the width of mantissa is labeled 23. For double precision, the entire width is measured as 64 bits. The s-section is labeled with a black block, exp is labeled 11, and mantissa is labeled 52.

Range and Accuracy of IEEE 32- and 64-Bit Formats

IEEE754	Minimum Normalized Number	Largest Finite Number	Base-10 Accuracy
Single	1.2E-38	3.4 E+38	6-9 digits
Double	2.2E-308	1.8 E+308	15-17 digits
Extended Double	3.4E-4932	1.2 E+4932	18-21 digits

Converting from base-10 to ieee internal format

We now examine how a 32-bit floating-point number is stored. The high-order bit is the sign of the number. Numbers are stored in a sign-magnitude format (i.e., not 2’s - complement). The exponent is stored in the 8-bit field biased by adding 127 to the exponent. This results in an exponent ranging from -126 through +127.

The mantissa is converted into base-2 and normalized so that there is one nonzero digit to the left of the binary place, adjusting the exponent as necessary. The digits to the right of the binary point are then stored in the low-order 23 bits of the word. Because all numbers are normalized, there is no need to store the leading 1.

This gives a free extra bit of precision. Because this bit is dropped, it’s no longer proper to refer to the stored value as the mantissa. In IEEE parlance, this mantissa minus its leading digit is called the significand .

[link] shows an example conversion from base-10 to IEEE 32-bit format.

This figure consists of a boxed set of numbers and a long string of binary below the box. Inside the box are three rows of text. The first row reads 172.625, Base 10. The second reads 10101100.101 x 2 ** 0, Base 2. The third reads 1.0101100101 x 2 ** 7, Base 2 Normalized. Below this diagram is the caption, add 127 for bias = 134. From the 7 in the third row is an arrow that points to the beginning of a string of binary, that reads, 0 space 10000110 space 01011001010000000000000. From the beginning of the third row in the diagram is an arrow that points at the third segment of the binary. Below the binary is a line of text that reads, 1. Assumed bit and binary point.

The 64-bit format is similar, except the exponent is 11 bits long, biased by adding 1023 to the exponent, and the significand is 54 bits long.

Questions & Answers

what is biology

Hajah Reply

the study of living organisms and their interactions with one another and their environments

AI-Robot

what is biology

Victoria Reply

HOW CAN MAN ORGAN FUNCTION

Alfred Reply

the diagram of the digestive system

Assiatu Reply

allimentary cannel

Ogenrwot

How does twins formed

William Reply

They formed in two ways first when one sperm and one egg are splited by mitosis or two sperm and two eggs join together

Oluwatobi

what is genetics

Josephine Reply

Genetics is the study of heredity

Misack

how does twins formed?

Misack

What is manual

Hassan Reply

discuss biological phenomenon and provide pieces of evidence to show that it was responsible for the formation of eukaryotic organelles

Joseph Reply

what is biology

Yousuf Reply

the study of living organisms and their interactions with one another and their environment.

Wine

discuss the biological phenomenon and provide pieces of evidence to show that it was responsible for the formation of eukaryotic organelles in an essay form

Joseph Reply

what is the blood cells

Shaker Reply

list any five characteristics of the blood cells

Shaker

lack electricity and its more savely than electronic microscope because its naturally by using of light

Abdullahi Reply

advantage of electronic microscope is easily and clearly while disadvantage is dangerous because its electronic. advantage of light microscope is savely and naturally by sun while disadvantage is not easily,means its not sharp and not clear

Abdullahi

cell theory state that every organisms composed of one or more cell,cell is the basic unit of life

Abdullahi

is like gone fail us

DENG

cells is the basic structure and functions of all living things

Ramadan

What is classification

ISCONT Reply

is organisms that are similar into groups called tara

Yamosa

in what situation (s) would be the use of a scanning electron microscope be ideal and why?

Kenna Reply

A scanning electron microscope (SEM) is ideal for situations requiring high-resolution imaging of surfaces. It is commonly used in materials science, biology, and geology to examine the topography and composition of samples at a nanoscale level. SEM is particularly useful for studying fine details,

Hilary

cell is the building block of life.

Condoleezza Reply

Got questions? Join the online conversation and get instant answers!

Jobilize.com Reply

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, High performance computing. OpenStax CNX. Aug 25, 2010 Download for free at http://cnx.org/content/col11136/1.5

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'High performance computing' conversation and receive update notifications?

Ask

	47 Biology 47 Conservation Biology Biodiversity MCQ By OpenStax Start Quiz
	13 AP Key Terms 13 Anatomy of the Nervous System By OpenStax Start Key Terms
	18 AP Key Terms 18 Cardiovascular System Blood By OpenStax Start Key Terms
©flickr: brett	Celine Dion Quiz By JavaChamp Team Start Quiz
	Engineering Communication MCQ By Steve Gibbs Start Quiz
©flickr:	Anatomy Physiology By Jemekia Weeden Start Quiz
	Clinical Issues in TB Management By Cath Yu Start Quiz
	2 Physiotherapy Flashcards Set 2 By Rhodes Start Flashcards
	Socialism Command Economy By Robert Murphy Start Test
	Social Dances 1 By Marion Cabalfin Start Test

2.6 History of ieee floating-point format

History of ieee floating-point format

Ieee floating-point standard

Ieee storage format

Ieee754 ﬂoating-point formats

Converting from base-10 to ieee internal format

Converting from base-10 to ieee 32-bit format

Questions & Answers

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!