cluster
Blu-ray
DVD-ROM

LAST
UPDATE: 10/30/06
Under
REconstruction!
This is a rewrite of my first
prototype presentation of I/O.
I will improve this, with
better
graphics, to make it more suitable for independent learning.
Blinking text designates
things
that I need to work on; the material is not wrong, but can be improved.
(Don't worry, I don't like
blinking
text, either, so there will not be any in the finished product!)

LEARNING MODULE
IV
SECONDARY STORAGE
Data
processing requires the use of primary memory. However, primary memory,
being electronic, is volatile and thus can not be utilized for
permanent
storage. Instead, secondary (auxiliary) storage is needed to
permanently
save data and programs until they are needed, at which time the
computer
will load them from secondary storage into primary memory. The general
term for the secondary storage containers of data and programs is "files";
however, the term is so general that it has many, often confusing,
uses.
The
sequence of presentations in this learning module is as follows.
You can click on any link to jump directly to that section.
Two
outstanding Web sites for comparisons of secondary storage (and all
other
hardware) are PRICE WATCH, www.pricewatch.com/,
and C|NET's Hardware subsite, computers.cnet.com/hardware
(Type storage in the SEARCH field; C|NET has a
nice "compare" feature, which displays the specifications of
different items that you select.).
The
goals of this Learning Module are to help the student:.
- understand
the fundamental principle that everything saved on secondary
storage
is contained in a file and
- that
there are only two fundamental types of computer files, program
files and data files, and
- that the
file format of a data file is defined by the application that created it
and
- a data
file will appear correctly when opened by the application that created
it.
- understand
the basic differences in direct access and sequential access
technologies.
- understand
the similarities and differences between the most popular storage
media,
especially magnetic, optical, and magneto optical storage media.
- appreciate
the balance between secondary storage, RAM/ROM, and CPU memory (the
"memory
hierarchy") and how this can be optimized to provide a cost-effective
computer
system.
NOTE:
If you are an independent learner (not attending the on-campus
classes),
it is especially important to read the study
guide for this LM.
Even
though it is virtually impossible to simulate the interactive in-class
presentation on this Web site, I do try. However, I need your
help,
so read the study guide to try to understand what I am trying to
do.
(I'd appreciate your feedback on how to improve this simulations of the
class environment.)
TPQ
1: Rewrite the preceding objectives in terms of personal
accomplishments
to be attained after finishing the study of this learning module.
The
sequence of presentations in this learning module is as follows.
You can click on any link to jump directly to that section.
- FILES
ARE NAMED COLLECTION OF BITS ON SECONDARY STORAGE
- CLASSIFYING
COMPUTER STORAGE
- SEQUENTIAL ACCESS
STORAGE DEVICES
- DIRECT ACCESS
STORAGE DEVICES
- EMERGING
SECONDARY STORAGE TECHNOLOGIES
- SUMMARY
1. FILES
ARE
NAMED COLLECTIONS BITS ON SECONDARY STORAGE:
Review LM IIIB, Section 2
to emphasize the distinction between primary memory and secondary storage.
It should be remembered
that, although humans envision files containing
number, words, pictures, sounds, etc., everything on secondary
storage
(as well as in primary storage) are actually composed of bits which
are the only forms that can be processed by digital computers. These
bits
must be converted to human-recognizable forms before they are output.
Figure
SS-1 shows the different ways humans and computers view files
and
illustrates the different types of data files.
- There are
basically
two kinds
of files:
- Program files
which have
two forms
- source code
instructions
written in programming languages. These must be translated into
machine
language in order to become...
- executable files
(which
have ____(1) extensions in the Windows environment) that
contain machine
language programs. When these programs are "run" they process
associated.
- Data files
which contain binary
numbers or binary codes characteristic of the application that
created
the file, e.g. the ______(2) code for alphanumeric
characters.
Some examples (See Figure SS-1.)
of data files, with a human view of their contents, are:
- database
files
or flat
files (often confusingly called "data files", simply, "files")
contain files
which contain records which contain fields.
- text files
which can
be encoded using ASCII or the newer 16-bit Unicode (See the previous
discussion in Learning Module IIA.) or as "document" files
in
which formatting is encoded along with the text, e.g. the "_____(3)"
files of Microsoft Word. (See the Note at the end of this
section.)
- spreadsheet
files which
consist of tables of numbers, formulas, or text; the basic unit, called
a "cell", is the intersection of a row and a column.
- graphic
files
which contain
two basic forms (bitmap images and vector graphics;)
- A bitmap
graphic
is an
image that is represented by a binary code that specifies the state
of each individual pixel displayed on a computer screen. For
black/white
this requires only one bit per pixel; for gray scale or color several
bits/pixel are required,
e.g. one byte/pixel can provide ______(4)
colors (or shades of gray).
Bitmap
graphics are created using "paint"
programs which can modify individual
pixels and erase parts of the image; these can not be done in vector
graphics.
- A vector
graphic
(sometimes
called an object oriented graphic) is an image that results from the
execution
of a mathematical instruction which continuously "draw" the image
(e.g. a square or circle), continuously, on a computer screen.
Since this does not require storage of pixel data, vector graphic files are much smaller that
equivalent bitmap graphic files. A vector graphics
application (often called a "draw"
program) must be used to display the images. The instructions
are converted
into binary code to store or transmit the picture. Unlike
bitmap
graphics, vector graphic images can be moved, resized, etc., but
individual pixels
can not be modified, i.e. there is no erase capability.
- audio files
contain digitized
sounds, and
- video files
contain
multiple
bitmap images that are displayed sequentially (e.g. 30 frames per
second)
to give the illusion of motion video.
Note: The preceding
file
types are generic. Actual data files are produced by
applications and have characteristic file extensions,
e.g. ".xls" files of Microsoft _______(5). Thus a
better
term
would be "Application Data Files" because they are defined
by the application that created them. When such a file is opened
by a different application it looks like garbage. (Try
opening an .xls (Excel) file in
text editor, like Notepad; you will see garbage. (Actually modern
word processors have built-in facilities that allow them to "import"
some spreadsheet files, e.g. Word can import Excel files (but
formatting is usually messed up); however, this can not be done in
unrelated word processor or spreadsheet applications.)
- The form of file
names
depends on the O.S. used.
- Command line
O.S.
(e.g.
DOS/Windows3.1, UNIX, VMS) use old-fashioned file naming conventions
with
frustrating restrictions on the number of characters that can be used
in
file names. For example, DOS/Windows3.1 file names are limited to eight
characters separated (by a period) from a three-character extension.
Thus
a batch file (containing a series of O.S. commands) that a user
would like to automatically execute when he/she boots the
computer
is named "AUTOEXEC.BAT". Such DOS limitations do not occur in
the MacOS or Windows 95 and later.
- Modern O.S
with
GUI (e.g. ____________(6)) can
have virtually any name (including blanks) for a file, although there
is
a practical limit on the number of characters, depending on the
particular
O.S.
To avoid problems
(especially in network access) it is still advisable to avoid
blanks in file names.
SAQ
1: What are the similarities and differences between (a) program files
and data files, (b) a Word file and a WordPerfect file, and (c) a Word
file and an Excel file.
2.
CLASSIFYING COMPUTER STORAGE:
When transaction processing replaced batch processing
as
the dominant computer activity, direct access storage media
(magnetic
and optical disks) became the dominant storage devices. Sequential
access
media (magnetic tape), characteristic of batch processing, is now
primarily
used for inexpensive off-line storage (backup storage).
2.1 Volatile RAM Versus
Nonvolatile
Secondary Storage:
- Random Access
Memory (RAM),
the "working storage" of primary memory, is
- is volatile
( its contents
are lost if power to the computer is interrupted).
- has limited
capacity
when compared to secondary storage
- is very
expensive
when
compared to secondary storage
- Secondary Storage
is
nonvolatile,
cheap, and has unlimited capacity.
2.2 Fixed Versus Removable Secondary
Storage :
- Fixed media (hard
disks), usually sealed
in a contaminant-free environment, can not be touched or removed by the
user.
- Removable media
(floppies, cartridges, USB drives, etc.) have less storage/disk, but
provide virtually unlimited
storage
via swapping. They are less reliable than hard disks and are more
easily
damaged.
2.3 Sequential Access Vs.
Direct Access
Storage:
- Sequential access
media (magnetic
tape) requires all data to be accessed serially until the
desired
data is found.
- Direct access
(magnetic, optical
disks, or flash memory) media allows the desired data to be found:
- on disk media, by moving
the read/write
head straight to it.
- on flash media, by accessing memory locations directly
(like RAM access).
2.4 Real vs. Virtual Storage:
- Real primary
memory
consists
of RAM and ROM (See Learning
Module IIIB, section 2.B.)
-
Virtual
memory
systems augment
primary memory with secondary storage, i.e. parts of secondary
storage "appear to the OS" as if they are slow RAM. (Review
LM
II, section 4.E.) For
example, programs that are too large
to fit into primary memory are subdivided into fixed length pages or
variable length segments in secondary storage; these can be
switched,
under O.S. control, into primary memory whenever they are needed.
Thus, to the OS, it "appears" as if the system RAM is larger than it
actually is.
SAQ
2: What are the similarities and differences in each of the pairs
discussed in the preceding four subsections.
3.
SEQUENTIAL ACCESS STORAGE DEVICES:
Sequential access is an inherent characteristic of tapes.
This also applies to analog media like video tapes and audio
tapes.
The idea even predates magnetic computer storage, e.g. data used to be
stored by punching holes in paper tapes. Magnetic tape used to be
a popular backup medium, but is being suplanted by faster access media
like optical disks, e.g. DVDs.
3.1 Magnetic Tape:
- Data is stored as
a
magnetic
pattern of bits on tracks that run the length of the tape.
- A magnetic spot
represents a
1; no magnetic spot represents a 0.
- A tape contains
9
tracks, 8
for a byte of data and one for a parity bit used for checking for
errors.
- Because tapes can
read
or write
only when moving at full space, records (groups of data) are usually blocked
together; the number of records in a block are designed to optimize
data
transmission.
- Tapes are now used
mainly for backups
of disk storage.
3.2 Cartridge and Cassette Tapes:
- Small,
inexpensive,
and convenient
1/4" data cartridges are popular media for backup.
- Cassette tapes
were
used with
early, cheap micros, but are virtually obsolete.
SAQ
3: Sequential access storage technology is virtually
obsolete.
If so why is it still used?
4.
DIRECT ACCESS STORAGE DEVICES:
Direct
access (
often
misleadingly called "random
access")
is characteristic of disks. Thus the idea predates
computer
storage, e.g. phonograph records are direct access analog
storage.
In order to store files on any disk that disk must be formatted,
i.e. initialized so that data can be stored in an organized manner;
this
organization is characteristic of the O.S. being used.
Therefore
a Windows based computer can not read Mac files, Unix files, etc.,
unless
special translation software is available. Formatting creates sectors
and tracks (See below.) on which the data is stored and creates
a file directory (called a file allocation table or FAT in
Windows)
which is loaded into RAM along with the O.S. when the system
boots.
(Remember Figure S-3 of
Learning
Module II?) Direct access storage
is currently dominated by magnetic media (hard disks, removable
hard disks, and floppies), but magneto-optical and read/write
optical media (DVD, DVD-RAM, and DVD+RW) promise to
revolutionize
storage technologies!
4.1 Magnetic Disks:
- Data is stored on
a
series of
concentric circles called tracks; this data is encoded as
magnetic
bit patterns which can be created or read by the read/write head of the
disk drive. (
See Figure
6-6 in the text.)
- Storage
capacity
is a
function of the number of tracks per inch (TPI), the number of disk in
a disk pack, and the bit density (bits per inch, bpi) of the disks.
- Data is directly
accessed
by rotating the disk beneath an actuator (single disk) or access arms
(multi-disk)
which move the read/write head in and out across the tracks.
- Data on single
disks
is organized
in sectors, pie-like subdivisions; specific data is located by
its
sector and track numbers
- Data on "disk
packs"
is organized
in cylinders, vertically aligned tracks. Data sequences are
placed
on the same tracks of adjacent disks rather than on adjacent tracks on
one disk; this minimizes the movement of the access arms thus
maximizing
the speed of access. (
See Illustration.)
- A group of two
or
more integrated
hard disks is called a RAID (redundant array of independent
disks). (
See Illustration.)
- Unlike sectors, cylinders, and RAID
(which are physical components of secondary storage), a cluster is the logical unit of file storage on a hard disk, managed by the computer's operating system.
A file consists of one or more clusters of
storage, which can be scattered over different locations
on the hard disk. The clusters associated with a particular file are
recorded
in the hard disk's file allocation
table (FAT). When a file is accessed, the
complete file is retrieved without the user being aware of the
different clusters where it is stored.
- Since a cluster is a
logical rather than a physical unit
(it's not
built into the hard disk itself), the size of a cluster can be varied.
- Disk access
time
is a
function of the seek time (to position the r/w head over the
desired
track) and rotational delay (to rotate the r/w head to the data
position), and data movement time (time to transfer data
between
disk storage and primary memory).
- Types of
magnetic disks:
- Hard Disks
(
See Illustration.) are
rigid aluminum platters coated
with
magnetic oxide whose high precision provide the highest storage
capacities
and quickest access rates of all magnetic media. This requires a
contaminant-free
environment; if contaminants do reach the disk surface a "crash"
occurs where a scratch on the disk surface results in data loss. (
See Illustration.)
Hard disks, currently reaching double
digit MB ranges, have the advantage of being the fastest mass
storage
but are permanent (i.e. the storage disks can not be switched).
- Diskettes
("floppies")
are made of flexible Mylar plastic coated with magnetic oxide. They
come
in 8, 5.25, and 3.5 inch sizes, can have single (SD), double (DD), or
high
densities (HD), and can be either single- or double-sided. Older
diskettes,
5.25" DD that hold 360 KB, 3.5" DD that hold 720 KB,
and
3.5" HD diskettes that hold 1.4 MB are all virtually obsolete
now because better
alternatives, such as USB flash drives (See below.) have much greater
storage capacity, are more reliable, and much faster. (About
their only worth is as cheap ways to give small digital files to
another person).
- Hard Cards
are hard disks that are mounted directly on their interface card and
fit
directly into the expansion slots on a PC motherboard.
- Cartridge
disk
drives are
devices that accept small removable (but hermetically sealed) disk
cartridges
(typically 3.5" with 10 MB to single digit GB storage
capacity.)
- Magneto-optical
disk
(read/write/erasable) are often confusingly categorized as optical
disks
because lasers are used to read data as well as facilitate writing
data.
However, the data is stored magnetically in microscopic "magnetic
domains".
When the high-power laser heats the magnetic storage film the domains
can
be aligned in higher densities than on regular magnetic disks, thus
giving
higher storage capacity than hard disks. Data is retrieved by
reflecting
a low-power polarized laser beam off of the magnetic film. The
polarization
of reflected beam can be interpreted as binary data.
- Magneto-optical
disks are removable and have removable gigabyte storage capacities,
supposedly
up to 20 GB!!
- The read/write
capability of
these disks is currently compromised by its relatively slow access
time
compared to hard disks.
- DMA (____________(7)
and
Ultra DMA are technologies for transferring data between secondary
storage
and RAM without ___________________(8). (Review LM IIIC, section
3.A.a.)
SAQ
4: What are the similarities and differences between hard disks
and
diskettes, (b) hard disks and cartridge disks, and (c) cartridge disks
and magneto-optical disks?
4.2 Optical Disks:
- Data is encoded
on the
disk surface by a laser beam either burning holes (ablative method) or
heating the surface until a bubble forms (bubble method). Data, text,
audio
signals and video images are stored as digitized patterns in frames.
- Data is read
when the
surface reflects light through a series of mirrors to a photodiode
(which
converts light to electrical signals). (
See Illustration.)
- Types of Optical
Disks:
- Analog data can
be stored on read-only Video disks which has been typically used to store
text, graphics, video images, and audio signals.
- Digital data
is stored on three fundamentally different types of optical storage
media:
- Read-only
storage disks includes CD-ROM (650-700
MB)
and DVD-ROM (4.7-17 GB).
- WORM
(Write
Once Read Many) technology is used in CD-R (compact disk
recordable) which is popular for
permanent storage, especially as a backup medium. (WORM
drives can not be used as secondary
storage
because data can be stored on them only once.)
- Read/write
Optical disks;
see the next section.
- Read/write
Optical media for digital data is revolutionizing
secondary
storage capabilities. Technologies include:
- CD-RW drives,
which have read/write capability at CD-ROM capacities (650 MB),
appeared
in 1997. They can read CD-ROMs and can write to CD-R disks, but a CD-RW
disk can only be read by a CD-RW drive.
- DVD
(which originally stood for digital video disks but now means digital
versatile
disks) is the dominant read/write optical storage technology. It
utilizes red lasers, but has two
competing
technologies.
- DVD-RAM,
(which is currently shipping) backed by Hitachi,
Panasonic,
Toshiba, and others, can store 2.6 GB per side and
- DVD+RW,
backed by HP, Phillips, Sony, and others, (but many can store 3.6 GB
per
side; both types of drives can read DVD-ROM and all CD formats.
Currently
their read/write times are less than hard disks.
Blu-ray
is a new optical disk technology tha utilizes
a blue-violet
laser (which has a higher
resolution than the red laser of today's DVD
players); consequently a blue laser can store data in a smaller space.
- Blu-ray's storage capacity is enough to store a
continuous backup copy
of small hard drives on a
single disc. The first products
will
have a 27 gigabyte (GB) single-sided capacity, 50 GB on dual-layer
discs. Data streams at 36 megabytes per second (Mbps),
fast enough for high quality video recording. Single-sided Blu-ray
discs can store up to 13 hours of standard video data, compared to
single-sided DVD's 133 minutes. People are referring to Blu-ray as the
next generation DVD, although according to Chris Buma, a spokesman from
Philips (quoted in New Scientist) "Except for the size of the
disc, everything is different."
- Blu-ray discs will not play on current CD and DVD
players, because they
lack the blue-violet laser required to read them. If the appropriate
lasers are included, Blu-ray players will be able to play the other two
formats. However, because it would be considerably more expensive, most
manufacturers may not make their players backward
compatible.
- See Wikipedia's article on
Blu-ray.
- See
TechEncyclopedia's
excellent comparison of the different types of DVDs (as well as
DVDs vs. CDs and Blu-ray).
EXPAND...
A
really neat Web site for comparison shopping for hardware is
PRICE
WATCH, whose URL is www.pricewatch.com/.
Another
site that gives independent analysis and recommendations of hardware is
C|NET's Hardware
site.
SAQ
5: What are the similarities and differences between recordable
CDs,
CD-RW, and DVD?
SAQ
6: What are the fundamental similarities and differences between
sequential and direct access storage technologies?
5.
SOLID STATE SECONDARY STORAGE TECHNOLOGIES:
Replacements for the obsolete floppy are long overdue. The super
disk and Zip disks should have replaced floppies long ago,
but now they face
competition
from portable digital
devices, especially flash memory. The
current problem is that these
technologies are currently
incompatible
and one
can only speculate as to
whether
the market can support more than one standard and, if not, which
technology
will predominate. (See the
Computer
Desktop Dictionary for illustrations
of these memory modules.)
- USB Flash drives
(Review Section
2.A.e, LMIIIB.) are
flash memory modules that plug into a USB port, serving as small, long
lifetime, rapid access secondary
storage. Small and light
enough to hook onto a key chain or be worn as a necklace, they serve
efficient portable
secondary storage devices that will probably become the most popular
type of secondary storage. (Flash
evolved from the EPROM and EEPROM chip technologies and is comonly used
in BIOS chips; review Section
2.B.c, LMIIIB.)
- CompactFlash
is
a new
generation of high capacity secondary storage cards for digital cameras
that is available in capacities of 4, 8, 12, 16, 24, and 32 MB. The
technology
can provide sustained write speeds up to 750 KBps. It also features
an intelligent power
management
scheme to reduce power consumption up to 100 per cent (says the vendor)
under read/write conditions, as well as reduced stand-by current
requirements.
(Definition
from Internet Product Watch.)
- Smart
media: An
ultra-compact flash memory format developed by Toshiba. About the size
of CompactFlash, but as thin as a credit card, SmartMedia cards are
popular
storage for digital cameras with capacities up to 32 MB. Available in
3.3
and 5 volt variations, SmartMedia cards require no assembly in
manufacture
as they are actually flash memory chips in a unique chip package. The
cards
can be plugged into a SmartMedia socket or into a standard Type II PC
Card
slot with an adapter. (Definition
from The Computer Desktop Dictionary.)
- Memory
stick: A
flash memory card from Sony
designed for handheld digital appliances
such
as cameras and camcorders. Introduced in 1998 with 4 and OMB
capacities,
the tiny modules are less than 1x2" and about a tenth of an inch thick
(.85 x 1.97 x .11"). Transfer to a PC is made via a PC Card
adapter.
(Definition
from The Computer Desktop Dictionary.)
- Other
types of cards:
- Multimedia
Card: A
small (32x24x1.4mm, weighing less than two grams) flash memory card
designed
for handheld devices such as cell phones and pagers. It was introduced
in
1997 with a OMB capacity which has be improved to KGB. It is
backward
compatible
with
older multimedia cards.
- SD Memory Cards and xD Cards are used with digital
cameras and other handheld devices.
- RS-MMC
and miniSD cards are
used in cellphones and other handheld devices.
6.
THE MEMORY HIERARCHY:
- The traditional
memory
hierarchy,
which ranks storage methods on their storage capacity, access time, and
storage cost, is based on the speed differences between the CPU
and other parts of the computer.
- In essence, a
computer system
is a set of staging areas. The only processing goes on in the
registers within
the CPU; everything else is for support.
- To minimize
costs
and maximize
efficiency primary memory is used as a temporary storage.
- The memory
hierarchy
from the
slowest/cheapest to the fastest/costliest:
- Backup
(off-line) media
(long-term, non-volitile secondary storage, traditionally tapes but being replace by high density obtical media) is used
to save all data and
programs
so that they can be retrieved after a malfunction of other storage
devices.
- On-line
Direct access
media (short-term, non-volitile secondary storage, usually hard disks and removable media) store data and
programs until they are loaded into primary memory.
- Primary memory
(volatile
"working" storage used during processing includes RAM and cache) hold
data and/or programs
immediately needed by the CPU. (Programs/data must be in primary
memory in order to be processed by the CPU)
- Registers
(active storage
used during processing of individual instructions) hold the data
actually being processed within
the
CPU.
SAQ
7: Why is it important to understand the memory hierarchy?
7.
SUMMARY:
- Everything
stored in a computer is a file,
a named collection of bits.
- There are two
types of files:
- Program files
can be
source code (in a computer programming language) or executables (in
machine code).
- Data files
are digital
files characteristic of the application that created them.
- File name
characteristics depend on the
operating system.
- Computer
storage can be viewed several ways.
- RAM is volatile
(requiring power to retain data), but secondary storage is (relatively)
static.
- Fixed storage is
inaccessible, but removable
storage is, by design, portable.
- Sequential access
is characteristic of tape media, whereas direct access is characteristic of
disk media.
- Real memory
actually exists and is limited, but virtual
memory (perceived by the operating system) is the augmentation
of RAM with secondary storage, making it appear that a computer has
more RAM than its real memory.
- Sequential storage devices
(virtually obsolete, used only for long term backup on large systems)
consist of magnetic tape and tape cartridges.
- Direct access storage
devices consist of magnetic or optical disk media.
- Flash memory devices,
being faster (no moving parts), with longer lifetimes, and being more
portable are rapidly becoming the most popular secondary storage
technology for computers as well as digital cameras.
- The "memory hierarchy"
is a way to categorize memory and secondary storage according access
speed (and cost). An optimum memory/storage system is typically a
cost effective combination of:
- offline
backup
- online,
direct access secondary storage
- primary
memory (including cache)
- CPU
registers, characteristic of the CPU architecture.
|
FIGURE
SS-1: DIFFERENT TYPES OF DATA FILES
|
 |