Review R: Representing data

R1: [1] [2] [3] [4] [5] [6] [7] [8] // R2: [1] [2] [3] [4] // R3: [1] [2] [3] [4]

Problem R1.1

How many bits are in a kilobyte of memory?

There are 8,192 bits in a kilobyte:

8 bits × 1,024 bytes = 8,192 bits

byte KB KB

Problem R1.2

Approximate 2³⁴ in the form x × 10^y, with x and y both being base-10 integers. (Your answer need not be normalized.)

16 × 10⁹

Problem R1.3

Approximate 2⁴⁵ in the form x × 10^y, with x and y both being base-10 integers. (Your answer need not be normalized.)

32 × 10¹²

Problem R1.4

Approximately how many kilobytes are accessible with a 23-bit address space?

8,000. (2²³ / 2¹⁰ = 2¹³ = 2³ × 2¹⁰ ≈ 8 × 10³ = 8,000. We divide initially by 2¹⁰ because that is how many bytes are in a kilobyte.)

Problem R1.5

Perform each of the following conversions.

a.	101101₍₂₎	to decimal
b.	1010101₍₂₎	to decimal
c.	23₍₁₀₎	to binary
d.	95₍₁₀₎	to binary

a.	101101₍₂₎	= 45₍₁₀₎
b.	1010101₍₂₎	= 85₍₁₀₎
c.	23₍₁₀₎	= 10111₍₂₎
d.	95₍₁₀₎	= 1011111₍₂₎

Problem R1.6

Given that humans almost always use base-10 numbers, while modern computers always work with base-2 numbers, why do programmers often choose to write numbers in base 16?

Humans communicate much more easily with short strings, and hexadecimal numbers are 4 times shorter than binary numbers, so that is why they prefer hexadecimal over binary. However, we sometimes want to avoid decimal, since decimal numbers cannot be easily converted to binary representation, and often the binary representation is crucial to a program's meaning; by contrast, hexadecimal has a straightforward correspondence to binary.

Problem R1.7

Perform each of the following conversions. Show your work.

a.	110110₍₂₎	to decimal
b.	140₍₁₀₎	to binary
c.	`E1EC7ED`₍₁₆₎	to binary

a.	110110₍₂₎	=	54₍₁₀₎
b.	140₍₁₀₎	=	10001100
c.	`E1EC7ED`₍₁₆₎	=	1110 0001 1110 1100 0111 1110 1101₍₂₎

Problem R1.8

Perform each of the following conversions.

a.	1010101010101₍₂₎	to octal
b.	10101010101010₍₂₎	to hexadecimal
c.	101101₍₂₎	to hexadecimal
d.	560₍₈₎	to binary
e.	CAB₍₁₆₎	to binary
f.	D15EA5ED₍₁₆₎	to binary

a.	1 010 101 010 101₍₂₎	= 12525₍₈₎
b.	10 1010 1010 1010₍₂₎	= 2AAA₍₁₆₎
c.	10 1101₍₂₎	= 2D₍₁₆₎
d.	560₍₈₎	= 101 110 000₍₂₎
e.	CAB₍₁₆₎	= 1100 1010 1011₍₂₎
f.	D15EA5ED₍₁₆₎	= 1101 0001 0101 1110 1010 0101 1110 1101₍₂₎

Problem R2.1

Represent each of the following using the 8-bit two's-complement integer representation.

a.	10₍₁₀₎
b.	−60₍₁₀₎
c.	−104₍₁₀₎

a.	10₍₁₀₎	→	0000 1010
b.	−60₍₁₀₎	→	1100 0100
c.	−104₍₁₀₎	→	1001 1000

Problem R2.2

Represent each of the following in a two's-complement representation.

a.	−1₍₁₀₎	in a seven-bit two's-complement format
b.	−20₍₁₀₎	in a seven-bit two's-complement format
c.	20₍₁₀₎	in a seven-bit two's-complement format
d.	−300₍₁₀₎	in twelve-bit two's-complement format

a.	−1₍₁₀₎	→	111 1111
b.	−20₍₁₀₎	→	110 1100
c.	20₍₁₀₎	→	001 0100
d.	−300₍₁₀₎	→	1110 1101 0100

Problem R2.3

For the following, assume a six-bit two's-complement representation of integers.

a.	What numeric value does 110110 represent?
b.	What numeric value does 010101 represent?
c.	What bit pattern is used to represent −12₍₁₀₎?

a.	110110	→	−10₍₁₀₎
b.	010101	→	21₍₁₀₎
c.	−12₍₁₀₎	→	110100

Problem R2.4

a. What is the smallest (most negative) number you can represent in seven bits using sign-magnitude representation? Give both the bit pattern of the number and its base-10 translation.

b. Answer the same question for a seven-bit two's-complement representation.

a.	Sign-magnitude:	111 1111 represents −63₍₁₀₎
b.	Two's-complement:	100 0000 represents −64₍₁₀₎

Problem R3.1

Explain what ASCII is.

ASCII is a mapping to seven-bit values for the characters found on an English keyboard and special symbols.

Problem R3.2

In addition to characters, digits, punctuation symbols, and the space character, ASCII also includes some “unprintable” special symbols. Describe three of these special symbols.

The LF (linefeed) character is used to represent line breaks in a file. It is sometimes accompanied by a preceding CR (carriage return) character.
NUL (ASCII 0) is used by C to represent the end of a string.
BS represents a typed backspace
ESC represents a typed escape key
HT represents a tab character
BEL represents a sounded bell
FF represents a command to eject a page (or clear the screen)

Problem R3.3

Identify three categories of characters that are defined in Unicode but not in ASCII.

There are many answers here, but some possibilities: accented characters for other European languages (like ñ, é, and ö), Chinese characters (Han alphabet), the Cherokee alphabet, the Greek alphabet, the Arabic alphabet, the Hebrew alphabet, Egyptian hieroglyphics, playing cards, emoticons, mathematical symbols, musical notes and symbols.

Problem R3.4

Distinguish the UTF-8 and UTF-16 encodings for Unicode.

Both represent ways to encode Unicode code points into binary representation. UTF-16 works in two-byte blocks with all code points up to U+FFFF being encoded in precisely two bytes, But UTF-8 works in one-byte blocks with code points up to U+FFFF being encoded in one to three bytes; this allows UTF-8 to be backwards compatible with files represented simply with one ASCII character per byte.