Alphanumeric Codes
In order to communicate, you need not only numbers, but also letters and
other symbols. In the strictest sense, alphanumeric codes are codes that
represent numbers and alphabetic characters (letters). Most such codes,
however, also represent other characters such as symbols and various
instructions necessary for conveying information.
At a minimum, an alphanumeric code must represent 10 decimal digits and 26
letters of the alphabet, for a total of 36 items. This number requires six
bits in each code combination because five bits are insufficient (25
= 32). There are 64 total combinations of six bits, so there are 28 unused
code combinations. Obviously, in many applications, symbols other than just
numbers and letters are necessary to communicate completely. You need
spaces, periods, colons, semicolons, question marks, etc. You also need
instructions to tell the receiving system what to do with the information.
With codes that are six bits long, you can handle decimal numbers, the
alphabet, and 28 other symbols. This should give you an idea of the
requirements for a basic alphanumeric code. The ASCII is a common
alphanumeric code and is covered next.
ASCII
ASCII
is the abbreviation for American Standard Code for Information
Interchange. Pronounced “askee,” ASCII is a universally accepted
alphanumeric code used in most computers and other electronic equipment.
Most computer keyboards are standardized with the ASCII. When you enter a
letter, a number, or control command, the corresponding ASCII code goes
into the computer
ASCII has 128 characters and symbols represented by a 7-bit binary code.
Actually, ASCII can be considered an 8-bit code with the MSB always 0.
This 8-bit code is 00 through 7F in hexadecimal. The first thirty-two
ASCII characters are nongraphic commands that are never printed or
displayed and are used only for control purposes. Examples of the control
characters are “null,” “line feed,” “start of text,” and “escape.” The
other characters are graphic symbols that can be printed or displayed and
include the letters of the alphabet (lowercase and uppercase), the ten
decimal digits, punctuation signs, and other commonly used symbols
Table 2–7 is a listing of the ASCII code showing the decimal, hexadecimal,
and binary representations for each character and symbol. The left section
of the table lists the names of the 32 control characters (00 through 1F
hexadecimal). The graphic symbols are listed in the rest of the table (20
through 7F hexadecimal).
Extended ASCII Characters
In addition to the 128 standard ASCII characters, there are an additional 128 characters that
were adopted by IBM for use in their PCs (personal computers). Because of the popularity
of the PC, these particular extended ASCII characters are also used in applications other
than PCs and have become essentially an unofficial standard.
The extended ASCII characters are represented by an 8-bit code series from hexadecimal
80 to hexadecimal FF and can be grouped into the following general categories: foreign
(non-English) alphabetic characters, foreign currency symbols, Greek letters, mathematical
symbols, drawing characters, bar graphing characters, and shading characters.
Unicode:
Unicode provides the ability to encode all of the characters used for the written languages
of the world by assigning each character a unique numeric value and name utilizing the
universal character set (UCS). It is applicable in computer applications dealing with multilingual text, mathematical symbols, or other technical characters.
Unicode has a wide array of characters, and their various encoding forms are used in many
environments. While ASCII basically uses 7-bit codes, Unicode uses relatively abstract “code
points” non-negative integer numbers that map sequences of one or more bytes, using
different encoding forms and schemes. To permit compatibility, Unicode assigns the first 128
code points to the same characters as ASCII. One can, therefore, think of ASCII as a 7-bit
encoding scheme for a very small subset of Unicode and of the UCS.
Unicode consists of about 100,000 characters, a set of code charts for visual reference,
an encoding methodology and set of standard character encodings, and an enumeration
of character properties such as uppercase and lowercase. It also consists of a number of
related items, such as character properties, rules for text normalization, decomposition,
collation, rendering, and bidirectional display order (for the correct display of text containing both right-to-left scripts, such as Arabic or Hebrew, and left-to-right scripts).
0 Comments
Post a Comment