13.1 The Evolution of Tape Hardware
The first practical demonstration of magnetic recording was performed by Valdemar Poulsen in 1898 who recorded analog audio on a steel piano wire [
46]. Recording on something resembling modern tape was first demonstrated by Fritz Pfleumer in 1927 who recorded analog audio on thin strips of paper coated with iron oxide powder [
47]. Another key step was the 1934 development by Eduard Schüller of the ring recording head which provided a more focused magnetic field and enabled more precise control of the magnetization of small regions of the magnetic media [
47]. Over the next two decades, a foundation for the development of tape data storage were laid in the continued improvement of analog audio recording on magnetic tape, led primarily by AEG in Germany and Ampex in the U.S.A. [
79]. The first magnetic tape system for data storage, the UNISERVO I, was commercialized by the Remington Rand corporation in 1951 and was the primary I/O device for the UNIVAC I computer. The UNISERVO I used a metal tape made from a very abrasive nickel-plated phosphor bronze alloy. It recorded data using a staggered head in an 8-track format using six tracks for data, one for parity and one for timing. A full reel of tape weighted about 25 pounds requires the use of large motors and a complex pulley and lever mechanism to buffer the tape and enable the rapid start and stop motion required to input small blocks of data to the computer for processing. The block size was limited by the small amount of memory available in early compute systems. Data was recorded at 128 bits/inch at a tape speed of 100 inches/sec. The large inertia of the reels necessitated the use of a relatively large 2.5 inch inter block gap between data blocks of 60 words of 12 characters, resulting in a transfer rate of about 7,200 characters per second. The 1,200 foot long tape of the UNISERVO I had a capacity of about 1.5 MB [
201]. Note that early tape systems recorded data as characters rather than bytes, however, to simplify comparisons to later products the capacities of these systems have been converted to an effective capacity in bytes. In 1952, IBM announced the IBM 726 magnetic tape unit for use with the IBM 701 computer. The 726, shown in Figure
33, implemented an innovative vacuum column technology to buffer the tape during start and stop, which enabled the use of a much lighter but more fragile polymer-based tape and a smaller 0.75 inch inter block gap. The 726 had a 7-track format with an in-line head, using 6 tracks for data and one for parity. The 726 recorded at a linear density of 100 bits/inch in an NRZI encoding format, at tape speeds of 75–100 inches per second. The tape had a length of 1,200 feet and a capacity of about 2 MB. The use of vacuum columns, polymer tape and the 7-track format were widely adopted and became a de facto industry standard. Signal amplification and logic were implemented using vacuum tubes. An important focus in early tape development was on increasing data rates to match the increasing speed of compute systems. Initially this was achieved through a combination of linear density and tape speed scaling. For example, the IBM 727, released in 1955, operated at 15 kB/s and the 729-3, released in 1958 at 63 kB/s. Starting with the 729-2, the vacuum tubes used in earlier drives were replaced with transistors. Another important innovation of the 729 was the introduction of a dual gap head that enabled on the fly read while write verification of the data [
83].
The first automated tape library, the IBM 7955 (Tractor), was developed in 1962 and delivered to the NSA as part of the Harvest computer system. The design was very innovative for its time, using 1.75-inch wide tape, housed in dual reel cartridges and the library was equipped with an automated mechanism to fetch, load and unload cartridges from the six tape drives that made up the system [
165]. However, only one system was ever made, and it would be another 30 years before automated tape libraries began to see widespread adoption. The next decade saw a continued focus on tape I/O performance as well as innovation in tape handling and system integration. In 1964, IBM launched the IBM 2104 (Models 1-3) for use with the IBM System/360. The 2104 was the first 9 track tape format (8 for data and 1 for parity) and introduced CRC. It had a capacity of about 5 MB per reel and a data rate of up to 90 kB/s. The 2104 Models 4-6 followed in 1965 that had a capacity of about 10 MB and a data rate up to 180 kB/s. Two key innovations that enabled this increased performance were the implementation of electronic skew buffers and self-clocking tracks [
83]. The IBM 2420, announced in 1968, operated at a data rate of 320 kB/s and introduced a self-threading mechanism for the tape that provided significant operator time savings. The use of DC motors enabled a tape speed of 200 inch/sec and a remarkable start/stop time of less than 2 ms [
83]. Three models of the IBM 3420 were introduced in 1971 for use with the IBM System/370 computer and operated at a linear density of 1,600 bpi and data rates up to 300 kB/s. In 1973, three additional models were introduced operating at a linear density of 6,250 bpi and data rates up to 1,250 kB/s. The 3420 embedded a set of around one thousand hardware-assist instructions that eliminated the need for a separate switching unit to provide more than one CPU access to the tape drive and can be viewed as the beginning of tapemicrocode [
110].
At the end of 1964
Digital Equipment Corporation (
DEC) introduced the relatively low cost DECtape system (also called Microtape) with its PDP-7 minicomputer. DECtape was based on the LINCtape system designed at MIT-Lincoln Labs. Compared to the other tape systems discussed above (and those that follow), DECtape had many unique aspects. For example, DECtape was a random access block addressable storage device that essentially behaved like a hard disk with very high latency and could be used as the main storage for the operating system. It used 250 feet of 3/4
tape wound on an
reel with 10 tracks, (6 data, 2 mark tracks and 2 clock tracks). Mark and clock tracks were pre-formatted during tape manufacturing. To improve reliability, each track was paired with a non-adjacent track that contained the same data. To make the tape durable enough to support random IO, the surface of the magnetic recording layer was coated with mylar to protect the recording layer from wear. In 1978, DEC introduced DECtape II which was a similar block addressable random-access device but used 0.15
tape housed in a miniature cartridge [
16,
40,
41,
42].
In 1974, IBM released the first commercially available automated tape library, the IBM 3850
Mass Storage System (
MSS). The 3850, shown in Figure
34, was very innovative for its time, utilizing a variant of helical scan [
46,
157] recording technology and cylindrical plastic cartridges with a 1.86-inch diameter and 3.49-inch length. The cartridges held 770 inches (20 m) of tape with a 50 MB capacity and were housed in honeycomb cells along 2 walls of the library that were accessed by two robots (accessors). Data was staged in and out of the system via DASD (HDD) and the host and application treated all data as if it were stored on DASD making the 3850 the first instance of a virtualized storage system. Despite the innovation, the concept was ahead of its time and the 3850 had limited commercial success with no follow-on products [
46,
84,
121].
In 1984, IBM released the 3480 magnetic tape subsystem that became a de-facto industry standard; companies including Fujitsu, M4 Data, Overland Data, StorageTek, and Victor Data made tape drives compatible with the 3480 standard. The 3480, shown in Figure
35, was a significant departure from previous open reel tape drive designs and introduced multiple new innovations including a chromium dioxide particle based media housed in a 4” x 5” x 1” cartridge that replaced the previously used 10.5” reels and set the stage for automated tape libraries. In addition, the 3480 introduced an 18-track MR head manufactured using thin film technology. This was the first use of thin film technology in magnetic recording and was a key technology to enable future areal density scaling via the continuous miniaturization of the write/read transducers through improvements in lithography technology. In addition, the 3480 eliminated the vacuum columns used in previous open reel drives enabling a more compact drive that required half the floor space of the 3420. The 3480 also achieved significant improvements in error detection and correction using
adaptive cross parity (
AXP) coding. The 3480 had a capacity of up to 400 MB and operated at a data rate of 3 MB/s. In 1986, IBM released the IBM3480 IDRC (improved data recording capability) which added hardware-based data compression and enabled a 2x increase in capacity and data rate [
46,
95]. In the same year autoloaders were added to the 3480. The autoloader held up to seven cartridges and automatically exchanged the current tape for a queued cartridge from the bottom of the loader. The 3490E drive released in 1989 used a 36-track format and provided an increase in native capacity to 800 GB [
23,
84].
In the same year the 3480 was released, DEC released the TK50 Compac Tape drive which was intended for use with minicomputers rather than mainframes. The TK50 used a 22-track tape format, recording with a single channel head in a linear serpentine fashion. It had a data rate of 45 KB/s and a cartridge capacity of 94.5 MB [
43]. The second generation, the TK70, was released in 1987 and provided 294 MB capacity at the same data rate as TK50 [
44]. The third generation, the THZ01, which was later rebranded as the DLT260, was released in 1991 with a capacity of 2.6 GB and a data rate of 800 kB/s. The THZ01/DLT260 introduced the use of cylindrical guide rollers to guide the tape through the tape path. Over the next 8 years there were multiple follow-on versions of DLT (digital linear tape) each providing increases in capacity and data rate. The number of channels was first increased to two in the DLT2000 drive and then 4 channels in the DLT 7000. The final version, the DLT8000, was released in 1999 and had a capacity of 40 GB and a data rate of 6 MB/s. In 2000, Quantum introduced the SuperDLT (SDLT format) that used an optically read servo pattern on the backside of the tape for track following [
176]. The first generation SDLT 220 had a capacity of 110 GB and a data rate of 16 MB/s. The final generation was released in 2007 with a capacity of 800 GB and a data rate of 60 MB/s.
In 1987, Storage Tech (STK) launched the Cimarron 4400 ACS (Automated Cartridge System) which became the first tape library to achieve major commercial success and introduced the concept of “Nearline” storage. The 4400 used IBM 3480 compliant tape drives and was based on a modular cylindrical library called a silo. A silo supported up to 16 tape drives, held up to 5,500 200 MB cartridges for a total capacity of over 1TB. The robot accessor had integrated cameras to read barcoded labels on the cartridges. In 1992 STK released the follow-on PowderHorn library that held up to 6,000 cartridges and provided up to 350 cartridges/hour accessor performance as well as smaller libraries called TimberWolf and WolfCreek that held up to 500 and 1,000 cartridges, respectively [
2]. In 2005, Storage Tech launched the SL8500 library with a capacity of up to 10,088 cartridges and up to 64 drives but has since discontinued development.
In the late 1980’s in Japan, a variety of automated tape libraries were also developed. Examples include the NEC N7645 library announced in 1988 with a capacity of 6,250 cartridges, the Fujitsu F6455 with up to 5152 cartridges and the Hitachi H-6951-1 library with up to 6,560 cartridges [
155].
Exabyte Corp. was founded in the mid 80’s with the goal of using consumer videotape technology [
46,
157] for data storage. In 1987 Exabyte released the EXB-8200 drive, the first commercial data tape drive to use video helical scan technology. The EXB-8200 operated at a data rate of 246 kB/s, had a native capacity of up to 3.5 GB and used 8 mm particulate based consumer video tape media. In 1990, Exabyte released that EXB-8500 that provided an increased data rate of 500 kB/s and in 1992 they released the EXB-8505 that provided an increased native capacity of up to 5 GB. The EXB-8900 Mammoth drive was released in 1996 and used
advanced metal evaporated (
AME) media to achieve a 3 MB/s data rate and 20 GB capacity. The final version, the Mammoth-2 was released in 1999 with a data rate of 12 MB/s and capacity up to 60 GB.
In 1989, Sony released the first generation of
Digital Data Storage (
DDS) which was based on
digital audio tape (
DAT). It used helical scan recording [
46,
157] on a 3.81 mm tape and achieved a capacity of 2 GB and a data rate of 183 kB/s. Over the next decade, 5 more generations of the technology were released, scaling capacity and data rate with each generation. The last two generations used 8 mm wide tape and operated at capacities 80 GB and 160 GB and data rates of 6.9 MB/s and 12 MB/s, respectively.
In 1993, IBM launched the 3495 robotic tape library that used a large (c.a. 400 kg), bright yellow, 6-axis industrial robot to serve cartridges to tape drives that were housed in a linear string of frames, as illustrated in Figure
36. The base model was 13.4 m long and held 5,660 cartridges. Three larger configurations were also available, the largest of which was 28 m long and held up to 64 tape drives equipped with autoloaders and up to 18,920 cartridges. The robot could mount 120 cartridges per hour but was not fast enough to keep 64 tape drives continuously busy [
76,
84]. The internal IBM code name for the product was Caballero, but it was often referred to as Conan, as in Conan the Librarian. In the same year, IBM also released the 3494 library for mid-range/open systems. Cartridges were stored horizontally rather than vertically as in the 3495, which simplified the accessor design considerably. The 3494 was a more compact design with a much lighter custom designed robot that could perform up to 250 mounts per hour. The first version consisted of 2 frames, with two 3480 tape drives without autoloaders. This was extended to 8 frames in 1994 and then 16 frames in 1996 providing a capacity of up to 6420 cartridges. In 1997, dual accessors were added, improving the mount rate to 610 mounts per hour. A further enhancement to the 3494 was the introduction of the
Virtual Tape Server (
VTS). The VTS was a large disk cache and server responsible for managing the data and tape media and was placed between the host and tape library. The VTS enabled a much more efficient use of tape capacity and simplified migration to newer tape technology [
84,
131]. In 2000, IBM released the 3584 library that could handle both LTO (see below) and DLT technology. The 3584 used a similar linear architecture as the 3494 and was configurable with 1 to 16 frames and up to 192 drives. In 2006 it was renamed the TS3500. In 2008 the capacity was enhanced through the introduction of HD (high density) slots in which multiple cartridges are stored one behind the other, providing a capacity of up to 1,320 LTO cartridges in a single frame [
76,
84]. Starting around 2004 IBM also began offering a range of mini and mid-range libraries. Examples of other companies producing tape libraries in this period include Qualstar, ADIC (Advanced Digital Information Corporation), Hewlett Packard, and later additional companies including Overland and Spectralogic. IBM’s latest large scale tape library offerings include the TS4500 introduced in 2014 which has a similar architecture to the TS3500 and the single frame TS6000 Diamondback introduced in 2022.
In 1995, IBM released the 3590 Magstar MP (multi purpose) tape drive which was the first tape drive to implement track follow servo control. See Figure
37. Track follow servo was a key innovation that enabled a new era of much faster track density scaling that continues today. The 3590 used an amplitude based servo pattern that is described in reference [
159]. The first version of the 3590 (Model B) had 128 tracks and provided a capacity of 20 GB and data rate of 9 MB/s. It was followed by the Models E and H in 1999 and 2002 which had capacities of 40 GB and 60 GB respectively; both operated at 14 MB/s.
In 1996, IBM introduced the 3570 tape drive targeting mid-range computer systems. See Figure
38. It used a dual reel cartridge and implemented a novel midpoint load design to minimize data access time and boasted an impressive load ready time of 6.7 s and average seek time of only 8 s. The 3570 was the first tape drive to use the TBS technique that has become standard in all recent linear tape drives. The 3570 operated at a data rate of 7 MB/s with an initial capacity of 5 GB that was extended to 7.5 GB in 1999 and 10 GB in 2002 [
76].
In 1996, Sony introduced
Advanced Intelligent Tape (
AIT) which was based on helical scan recording [
46,
157] and used AME media. The first generation had a capacity of up to 35 GB and a data rate of up to 4 MB/s. A total of five generations were released, scaling capacity and data rate to 400 GB and 24 MB/s, respectively, in the 5th generation that was released in 2006. Gen5 of AIT was the first tape drive to use
giant magneto resistive (
GMR) reader technology.
In 1999, Ecrix, which later merged with Exabyte corporation, released VXA tape based on an 8mm helical scan technology [
46,
157]. The first generation, VXA-1 had a capacity of 33 GB and a data rate of 3 MB/s and was followed by two more generations in 2002 and 2005 that had capacities of 80 GB and 160 GB and data rates of 6 MB/s and 12 MB/s, respectively.
In the 1970’s, 80’s, and 90’s there was a proliferation of tape products and formats. In addition to those discussed above, other examples include the
quarter inch cartridge (
QIC) format launched in 1972, QIC-Wide launched in 1994, Travan launched in 1995, QIC-EXtra (QIC-EX) launched in 1996, and SLR (scalable linear recording) launched in 1997. Most of the formats of this era were proprietary. Incompatibility between the many formats made it difficult for customers to change technology and vendor and hence tended to lock customers in. In response to this situation, in the late 90’s IBM, HP and Seagate formed the LTO consortium with the goal of developing a new more open format that provided interchangeability between drives and media from different manufactures. In 1998, the consortium announced the LTO Roadmap. Initially two formats were planned, a single reel format called Ultrium and a dual reel, mid-point load format based on the IBM 3570 format that was called Accellis. The Accellis Gen 1 format was never commercialized and no follow on formats were developed. As a result the term LTO is currently used to refer exclusively to the Ultrium format. In 2000, IBM, HP, and Seagate each released LTO Ultrium Gen 1 drives (see Figure
39). LTO-1 had a capacity of 100 GB and a data rate of 20 MB/s. It used a linear serpentine recording format with an 8-track head that spanned only a quarter of the width of tape reducing the sensitivity to TDS effects by about a factor of 4. To read and write across the full width of tape, the IBM LTO-1 drive adopted a novel architecture that combined a coarse actuator with a fine actuator for track follow servo and replaced the air bearing tape guides used in early IBM drives with tape guide rollers. Another major innovation was the introduction of flat lapped heads built using HDD head fabrication technology on AlTiC wafers to replace the much larger contoured and slotted nickel-zinc ferrite heads used in earlier tape drives [
20]. In addition to other advantages, the light weight flat lapped heads enabled higher bandwidth servo control and hence better track following performance. However, the most revolutionary innovation of LTO was interchangeability, i.e., any LTO1 cartridge provided by the multiple LTO licensed media vendors could be written/read in drives manufactured by any of the LTO drive vendors and then later written/read in a drive from any of the other vendors. LTO-1 drives and follow on generations were made in a higher performance FH form factor and a lower performance/lower cost HH form factor intended for lighter workloads. LTO tape is often referred to as a “mid range” tape solution in terms of performance and cost relative to “Enterprise” solutions often used with mainframes and low-cost solutions used with PCs. Competition between the multiple LTO drive and media providers resulted in more competitive pricing compared to proprietary formats and contributed to LTO becoming the dominant tape format. To date, nine generations of LTO have been brought to market. LTO-2 was introduced in 2002 with a 200 GB capacity and 40 MB/s data rate and introduced a PRML (partial response maximum likelihood) data channel and a more efficient 16/17 modulation code. LTO-2 drives were also backwards compatible in that they were able to read and write LTO-1 cartridges. Seagate’s tape business was renamed Certance in 2003 and then acquired by Quantum in 2004. LTO-3 was also released in 2004 with a new 16 track head format and again doubled capacity and data rate to 400 GB and 80 MB/s, respectively. The potential in linear tape recording to scale the number of parallel channels in a straightforward manner enabled faster data rate scaling compared to helical scan based technologies and was also a contributing factor to LTO becoming the dominant format. LTO-4 was released in 2007 with an 800 GB capacity and 120 MB/s data rate. Tandberg Storage also participated in the LTO consortium for three generations, releasing HH versions of LTO-2 (TS400, 2005), LTO-3 (TS800, 2007), and LTO-4 (TS1600, 2008) drives. LTO-5 was released in 2010 with a 1.5 TB capacity and 140 MB/s data rate followed by LTO6 in 2012 with a 2.5 TB capacity and 160 MB/s data rate. LTO-5 introduced the capability to partition the tape into 2 tape partitions and LTO-6 extended this capability to 4 partitions. LTO-7 was released in 2015 with a 6 TB capacity and used a 32-channel head to enable a data rate of 300 MB/s. The two most recent generations, LTO-8 (2017) and LTO-9 (2021) use the same 32 channel head format and operate at capacities of 12 TB and 18 TB and data rates of 360 MB/s and 400 MB/s, respectively. The latest LTO roadmap describes 5 future generations of the technology, each of which is expected to double the capacity of the previous generation.
SUN Microsystems acquired STK in 2005 and subsequently launched a new family of Enterprise class drives in 2006 that used a linear serpentine format and a single reel cartridge. The first generation, the T10000 had a capacity of 500 GB and a data rate of 120 MB/s. The second generation, the T10000B was released in 2008 with a capacity of 1 TB and a data rate of 120 MB/s. In 2009, Oracle acquired SUN and subsequently released the T10000C in 2011 with a 5 TB capacity and 240 MB/s data rate. In 2013, Oracle launched the T10000D drive with a capacity of 8.5 TB and a data rate of 252 MB/s. A T10000E drive was planned but eventually cancelled when Oracle stopped tape development in 2016.
In 2003, IBM also launched a new family of Enterprise class drives. The first generation was branded 3592 and operated at a capacity of 300 GB and a data rate of 40 MB/s. See Figure
39. The second generation, the TS1120, operated with a capacity of 700 GB and a data rate of 100 MB/s. The TS1120 was the first storage device to provide built-in hardware-based encryption which later also became standard in LTO drives. The TS1130 was released in 2008 and used GMR reader technology to provide a capacity of 1 TB and data rate of 160 MB/s. Compared to LTO drives of the same time frame, Enterprise class drives typically provided higher performance in terms of capacity, data rate, access time and error rate and in addition, enabled “up formatting” of the previous generation of media to a higher capacity using the latest generation of tape drive. The TS1140 was released in 2011 with a 4 TB capacity and a data rate of 250 MB/s. The TS1140 introduced flangeless tape guides that reduced media edge wear as well as active skew control and technology to improve rewrite performance. In addition, the TS1140 introduced a new
data dependent noise predictive maximum likelihood (
DD-NPML) data channel, a more powerful ECC scheme that doubled the length of the C2 code, a novel 3 module head design with 32 active channels and a new servo pattern. In 2014, the TS1150 was released with a 10 TB capacity and 360 MB/s data rate. The TS1150 introduced a new read head based on TMR (tunneling magneto resistive) readers, the first use of TMR technology in tape. The TS1155 was released in 2017 with a 360 MB/s data rate and a 15 TB capacity that was partially enabled by the introduction of a new writer with notched poles and a new high moment liner technology. In 2018 the TS1160 introduced a new tension based active TDS compensation scheme that helped to enable a 20 TB capacity and 400 MB/s data rate. The TS1160 introduced new ECC technology that included a new C2 code that again doubled the code length as well as a novel iterative decoding scheme. The latest generation, the TS1170 was released in 2023 with a 50 TB capacity and 400 MB/s data rate. Many of the technologies introduced in the TS11xx family were also implemented in subsequent generations of LTO drives and were critical to enabling the scaling of IBM’s LTO drives. Currently, LTO and IBM TS11xx Enterprise are the only tape formats still under active development. However, despite consolidation in the industry and convergence to two formats, tape has a healthy ecosystem with companies including IBM, HPE, Dell, Quantum, Spectralogic, Overland and Oracle selling tape drives and libraries, two media manufactures and multiple brands of media. Moreover, tape has significant potential for continued scaling as discussed in Section
12.
13.2 The Evolution of Tape Media
Apart from the UNISERVO 1 tape system that used a metal tape made from nickel plated phosphor bronze alloy, all other commercial data tape media have used a polymer substrate coated with a thin layer of magnetic material. Companies who have manufactured magnetic data tape media include: BASF, Datatape/Graham Magnetics, Fujifilm, Imation, Maxell, Memorex, SONY, TDK, and 3M, with both Fujifilm and Sony still actively developing new tape media products. Two types of magnetic coating have been used: particulate and
metal evaporated (
ME) coatings. In particulate based tape, the recording layer (mag-layer) is made up of small magnetic particles fixed to the polymer substrate with a binder (i.e., “glued” to the substrate). The tape is manufactured by coating the particles onto the substrate in a liquid slurry that also contains solvent and binder, followed by evaporation of the solvent to “bind” the particles to the substrate. In ME tape the mag-layer consists of a continuous, granular magnetic metal film deposited on the polymer substrate under vacuum by evaporation. Exabyte’s Mammoth 1 and 2 as well as Sony’s AIT and SAIT tape, used a metal evaporated mag-layer technology that is described by Kawana et al. [
130]. All other polymer tape media to date, including state-of-the-art products, are particulate based and, therefore, we will focus on the evolution of particulate tape technology here.
Early generations of particulate tape, such as that used for the IBM 726, had a simple bilayer structure consisting of an approximately 10
thick layer of magnetic particles and binder coated directly onto an acetate substrate. The magnetic particles were an acicular form of gamma ferric oxide with particle lengths on the order of 0.2 to 0.8 microns. To achieve the required surface quality, i.e., low defect density, “IBM designed and built the world’s most advanced tape coater in Poughkeepsie, and the first clean room used in manufacturing.” [
23]. In later generations of tape, a thin, carbon loaded back-coat layer was added to control tribo-charging effects and improve the winding/unwinding properties of the tape. Over time, the thickness of the magnetic recording layer and the size of the magnetic particles have been continuously reduced to enable scaling of the areal recording density, as discussed in Section
3. In the 1980’s, FeCo particles with a length on the order of 250 nm were used in a mag-layer with a thickness on the order of 3–5
. In the 1990’s, CrO2 particles were used in the IBM 3480 and then 3490 tape media. In the late 1990’s Fujifilm developed a dual coating technique in which a non-magnetic undercoat and a much thinner mag-layer are deposited simultaneously. This technology enabled a significant reduction in the mag-layer thickness to around 300 nm initially with further decreases in subsequent generations to around 100 nm over the next decade. The introduction of a more advanced dual layer coating technology in 2011 enabled a further reduction to around 60 nm [
154].
Early generations of LTO and Enterprise tape media (IBM TS11xx and STK T10000) used MP (metal particle) technology which was based on acellular particles with a CoFe core and a (non-magnetic) Yttrium based shell a few nm thick which prevented oxidation of the magnetic core. The magnetic axis of the particles was aligned in the longitudinal direction of the tape, i.e., parallel to the tape transport direction through the application of a magnetic field during the coating process. The first generation of LTO media used particles with a volume on the order of 10,000
. LTO-1 media had a mag-layer thickness of 220 nm which was reduced to 100 nm for Gen2 [
20]. With each generation of media, the volume of the particles was continually reduced. Reductions were also made in the width of the distribution of particle sizes to improve SNR. LTO Gen4 media, released in 2007 used a particle volume of about 4500
[
82] and LTO Gen 5 (2010) used particles with a volume of about 2800
[
28]. Below a volume of around 2800
it is difficult to maintain a sufficient particle coercivity to ensure the thermal stability of recorded data [
154], as discussed in Section
2.
In 2011, Barium Ferrite (BaFe) particle technology was introduced in Enterprise tape media and then later also adopted in LTO media. LTO Gen6 supported both MP and BaFe media, whereas generations 7 to 9 are based exclusively on BaFe. BaFe particles have a hexagonal platelet shape and a magnetization that results from crystalline order rather than shape anisotropy. The coercivity of the particles can be tuned by doping the particles with elements such as Co, Zi, or Ti. Moreover, BaFe (BaFe
12O
19) is an oxide and, therefore, does not require a non-magnetic shell to protect against oxidation. As a result, BaFe particles can be scaled to much smaller sizes than MP technology. Initial generations of BaFe media (e.g., IBM TS1140 JC tape) utilized a particle volume of 2,100
and a mag layer thickness of about 70 nm [
71]. In the TS1150 JD media, the particle volume was reduced to around 1950
[
137]. In both JC and JD media, the particles had an essentially random orientation. The particle volume of the TS1160 JE media was further reduced to 1,700
and the particles were partially oriented in the perpendicular direction through the application of a magnetic field during the coating process [
69].
The effective distance between the read and write transducers and the magnetic particles, referred to as magnetic spacing, is determined predominantly by the roughness of the surface of the mag-layer. To enable areal density scaling, tape roughness has been continuously reduced as new generations of media were introduced. For example, in the early 90’s, the roughness of the mag-layer as measured by optical interferometry, was on the order of Ra 7 nm. The introduction of dual coat technology in the late 90’s enabled a reduction to around Ra 5 nm and a continual decrease in subsequent generations to around 1.5 nm by 2012 [
154].
Early open reel tape products used an Acetate substrate which was later replaced by PET to improve tape robustness against breakage during the continuous start/stop usage pattern of early tape drives. Later, PEN was also used as a substrate (for example in the IBM 3590 Extended cartridge). Recent generations of LTO tape have used PET, PEN, and Spaltan, which is a blend of PET and Aramid, whereas the most recent generations of enterprise tape media have used Aramid as a substrate. Of these four materials, Aramid is the most robust and has the lowest dimensional change versus environmental conditions, but is the most expensive.
Over the history of tape, the thickness of the tape media has been gradually reduced to enable longer tape lengths and hence higher reel/cartridge capacities. For example, the 8-inch diameter (203 mm) reel used in the IBM 726 held 1,200 ft (365 m) of tape that was about 58 thick. The tape cartridge introduced with the IBM 3480 had a diameter of about 96 mm and held 541 ft (165 m) of tape that was about 30 thick. LTO cartridges have about the same reel diameter but in Gen 1 they held 609 m of 8.6 thick tape and by Gen 9 the length was scaled to 1,035 m of 5.2 thick.
The most common tape width format is ½ inch (12.65 mm), introduced with the Uniservo 1 and IBM 726 and still used in LTO and recent Enterprise tape media. However, over tape’s 70+ year history a variety of other formats have been used, including 4 inch, 1 inch, ¾ inch, 8 mm, ¼ inch, 4 mm and 1/16 inch.
The Uniservo 1 and IBM 726 used open reels with an 8-inch diameter. In 1953, IBM introduced the 727 magnetic tape unit with 10.5-inch reels that became a de facto industry standard for around 25 years [
46]. In 1984, IBM introduced the 3480 that replaced the 10.5
reels with a
(
mm
mm
mm) cartridge. More recent generations of IBM and STK/Oracle Enterprise tape have used a similar size cartridge. The LTO tape cartridge format is slightly smaller (102 mm x 105.4 mm x 21.5 mm) and is similar to the dimensions of the DLT cartridge. A variety of dual reel cartridge formats have also been used, such as in the DDS, IBM 3570, and AIT. Dual reel cartridges can provide an advantage in terms of access time, but have a lower volumetric storage efficiency.
In the first generation of AIT, released in 1996, Sony introduced the concept of a CM with their MIC (memory in cassette) technology that was used to store meta data and improve data access performance. In the second generation of AIT, Sony introduced a contactless CM technology called R-MIC and used it to enable the first WORM tape cartridge. LTO and recent Enterprise tape products also use a contactless CM and have also offered WORM capability in all recent generations, starting from STK T9940A, IBM 3592, and LTO-3.
State-of-the-art media technology is described in Section
3 and potential future media technologies are discussed in Section
12.
13.3 Tape Usage and Software History
Over the more than 70 year history of magnetic tape storage there have been profound changes in the way tape is utilized in the computing environment, and corresponding changes in the types of software available for making use of tape storage.
Magnetic tape storage and the wide availability of general-purpose computers both appeared in the early years of the 1950s. However, neither of these technologies entered a vacuum; in fact, both replaced or augmented equipment and processes that had been developed over many decades.
In the 1890s, Herman Hollerith’s use of punched-card equipment to process the 1890 U.S. Census ushered in an era of card-based record-keeping and accounting procedures that would form the basis for most early automated business processing. Hollerith’s Tabulating Machine Company eventually became a part of the
Computing-Tabulating-Recording (
CTR) Company, which was renamed in 1924 to
International Business Machines (
IBM) [
12].
By the 1930s, IBM and other manufacturers had a wide array of card-based equipment available, including card readers, sorters, printers, collators, and even plugboard-programmable “tabulators” capable of counting, addition and subtraction, and providing multiple levels of accumulators. This equipment was often referred to collectively as
Electric Accounting Machines (
EAM) [
12].
13.3.1 Early Computing and Tape.
By the time the first generation of business-oriented electronic computers became available in the 1950s, EAM equipment was in wide use in academia, business, and government. Early computer systems did not entirely replace this equipment, but instead were often integrated into existing workflows and used to augment the EAM equipment. The card equipment and printers that were developed for EAM were adapted as I/O devices for the early computers, and much of the processing initially remained card-based.
Into this environment came magnetic tape storage. Tape had several benefits over card data storage, including faster I/O time, removal of the 80-character record limit, and much improved volumetric efficiency. However, it also had some drawbacks. Data on tape is intrinsically difficult to re-sort into a different order (a common process in EAM processing) and cannot be accessed or modified manually (like a card in a bin). Additionally, many potential customers were skeptical of the safety of data stored invisibly on a tape reel [
12].
The first software specifically written for tape storage would almost certainly have been the
Input/Output Control Systems (
IOCS) for accessing tape devices from users’ programs. The IOCS provided subroutines that relieved the programmer of having to rewrite the code to perform tape I/O and manage potential errors for each program [
89].
Because early computer systems had different architectures (instruction sets, word lengths, etc.) not only between manufacturers but also between different models from the same company, IOCS routines had to be rewritten for each machine type that could attach to tape devices. These routines were sometimes in the form of an executable card deck appended to an object (executable) program deck, as was the case for tape-only IBM 1401 systems [
91]. At other times they were incorporated into a simple disk-based operating system, as was the case for the IBM 1410/7010 [
92].
Another early form of tape software provided simple utility functions such as card-to-tape, tape-to-printer, tape-to-punch, and tape-to-tape copies, and record sorting [
90]. A classic example of a utility from the System/360 era was called DEBE (“Does Everything But Eat”); such utility functions were important because, as mentioned earlier, much early computer processing remained card-based.
13.3.2 Tape Adoption and Early Usage.
Over time the advantages of tape storage over cards became apparent, and tapes began to replace cards as the medium for keeping the master version of an organization’s data [
12]. Records on the tape “master file” were usually written in order by a key such as employee number or account number. A typical daily or weekly processing run might consist of sorting a set of transactions to be applied to the master file into key order, then processing the transactions against the master tape and writing updated or unchanged master records to a new output tape. The output tape then became the new “generation” of the master file. A system like this had the added benefit of creating point-in-time backups of the master file as it existed before each transaction processing run.
Although smaller computers like the IBM 1401 were often used as the only computing system for a small-to-medium sized business, for larger installations running computers like the IBM 7000 series, smaller systems were frequently used as front-end and back-end processors. Data to be processed by the larger system arrived in the form of cards and was copied to tape by the smaller machine. The tape inputs were then processed on the faster, more powerful system, and the results were written to output tapes. Finally, the smaller system wrote the output reports from tape to a printer. Such a workflow allowed an organization to optimize the use of its larger, more expensive systems [
12]. This approach also helps demonstrate the need for simple tape-based utilities.
While disk drives were invented in the mid 1950s, it would be many years before the capacity and cost of disk drives made them an attractive replacement for the large-scale data storage role that tapes filled.
13.3.3 Advanced Operating Systems.
The 1960s saw many innovations in the computing landscape, including two very ambitious operating system development projects: Multics [
152], the time sharing system project that was a joint venture between MIT, AT&T Bells Labs, and General Electric, and OS/360 [
167], the operating system for the new, converged line of System/360 processors from IBM.
Multics was to be MIT’s follow-on to its
Compatible Time-Sharing System (
CTSS) [
197], which ran on an IBM 7090 system using 19 IBM tape units. While CTSS allocated two tape units to each online user, according to one Multics developer tape “was considered a throwback to an earlier age” [
151], and was used primarily for backup.
OS/360 became the ancestor of the IBM operating system that was known for many years as MVS and is today the mainframe operating system z/OS [
107]. Because OS/360 was initially designed as a batch-oriented operating system, tape support was integral to it. (Time sharing, in the form of the Time Sharing Option, or TSO, was later added to OS/360.)
Both of these projects were very optimistic in their estimates of the amount of time and effort required to build a large-scale, third-generation operating system. At IBM, the OS/360 project was so far behind that it was necessary to build a set of scaled-down, “interim” operating systems. Two of these were eventually named DOS/360 (Disk Operating System), and TOS/360 (Tape Operating System). The two systems shared most of their code, and either could be generated from the same source files. As their names imply, the difference between them is whether the Operating System is resident on disk or tape. TOS/360 would run on systems having only tape storage and as little as 16K bytes of main memory [
167]. (As an interesting aside, DOS/360 became DOS/VSE and eventually z/VSE; IBM support for z/VSE was finally dropped in 2023.)
Due to resource constraints, some managers on the OS/360 project favored requiring disk drives on all System/360 systems. However, a survey of early orders showed that more than 20% of the orders for low-end System/360s were for tape-only systems [
167].
13.3.4 Device Independence.
When OS/360 eventually became available, one of its notable characteristics was device independence for application programs. Access to datasets (files) was managed by a set of “access methods” (somewhat analogous to modern file system drivers). If an underlying device was capable of supporting a particular access method (for example, sequential access), then the access method took care of all of the device details, and application programmers were isolated from the type of device the dataset resided on [
93].
Taking this a step further, the system had a “catalog” that kept track of the volume on which a particular dataset was stored. When a dataset was created, the JCL that initiated the program creating the dataset could specify a device type for the output (e.g.,
UNIT=TAPE or
UNIT=DISK). When the dataset was later read, the JCL had only to specify the dataset name, and the system catalog would fill in the volume information. Both disk and tape datasets had standard dataset header records that recorded the dataset name and other information. All this allowed disk and tape storage to be used interchangeably for datasets using sequential access [
93]. The system also supported stacking multiple datasets on a tape (the catalog would keep track of the dataset number), but in practice this slowed down access and made tape reclamation more difficult, and so was not routinely used.
The device independence of OS/360 meant that many application programs could use either tape or disk storage, with the decision deferred to execution time. Of course, there were applications that used non-sequential (e.g., direct offset) access that could not be mapped to tape.
OS/360 also included a large collection of utility programs [
94], most of them supporting tape as well as disk operations. Included were programs for dumping disk volumes to tape as an early form of disk backup. (Data could also be copied to tape at the dataset level for backup, but no backup management software as we know it today was included.) Some utility programs, such as sort/merge, were specifically optimized for tape usage. (One of the most successful early non-IBM software products was an optimized sort/merge program called SyncSort.)
13.3.5 Minicomputers.
Another major development of the 1960s was the appearance of minicomputers, small systems with fewer capabilities but a much lower cost than the reigning “mainframe” systems of the day.
Like their larger counterparts, minicomputers had a proliferation of different operating systems. Each of these operating systems had some form of software support for backup to tape, written either by the manufacturer or sometimes by a third-party software company.
The earliest commercially successful minicomputer was arguably the DEC PDP-8, released in 1965. It was just one of a long line of DEC PDP systems.
Many of the PDP-series machines included one or two DECtape devices [
16,
40,
41,
42]. As mentioned earlier, DECtape was a random access, block addressable device. Machines such as the PDP-8 and PDP-11 could run a small operating system directly from DECtape, sometimes using a second DECtape as a swap device. Of course, DECtape could also be used as data storage, and users often kept their private files on the pocket-sized 4-inch reels.
13.3.6 Unix/Linux.
Unix [
175] was originally developed on DEC minicomputers (the first being a PDP-7, but quickly reimplemented on the PDP-11). It was developed by AT&T employees who had earlier worked on Multics, and in fact the name was originally a pun on the name Multics. (Linux is a later reimplementation of all of the major concepts and features of Unix, but as an open-source project.)
One of the key concepts in the Unix operating system is that of a “device file”, that is the idea that every device can be identified by a name in the file system tree, and that any device can be read or written (with appropriate authority) by using the name of the device as a file name (for example, /dev/st0 for a tape in a modern Linux system).
Unix I/O is essentially byte-stream oriented, and any device can be read or written as a byte stream. (Obviously there are devices, such as disk drives, where this would be disastrous, but it is possible. The system is designed so that normal users do not typically have the authority to write directly to system devices.)
For magnetic tape devices, this means that data can be read from or written to tape just by specifying the name of the tape device file as the input or output target for essentially any command. (Once again, with the proper authority.) This gives modern Unix and Linux systems a similar level of device independence for applications as was discussed earlier for OS/360.
There are a set of utility programs in Unix/Linux for managing tape devices. The first is the mt (magnetic tape) command, which allows for device control such as rewinding, writing tape marks, skipping to tape marks (files), erasing, and so on. More well-known is the tar (tape archive) command, which bundles files into a single archive (similar to a zip file) and (optionally) writes the file to tape.
There are many other utilities which can be used to write and manage backups of data to tape (or other devices) on Unix/Linux systems. One of the best-known is rsync.
13.3.7 Microcomputers.
In the later part of the 1970s, a new generation of computer user was introduced to a very different type of tape storage. Early microcomputers such as the TRS-80 and the Commodore PET used cassette tape storage for both programs and data. These systems used commercial audio cassette mechanisms and media to read and write data, and a standard for using audio tapes for computer data called the “Kansas City” standard [
53,
144] was developed. Games, assemblers, and utility programs for the systems were released on cassettes during the late 1970s.
Cassette tape storage on these systems worked but was not particularly reliable. Many hours were spent by early microcomputer users waiting for cassette tape programs to load, or trying to debug problems reading back previously written data.
There were some early microcomputer tape systems that attempted to replace the use of cassette drives with more flexible and reliable tape-based devices. One example was the Exatron “Stringy Floppy” [
145], released in 1978, which used a tape cartridge about the size of a business card and 3/16 of an inch thick (often referred to as a “wafer”). The tape was in a continuous loop, and moved in only one direction. It was a “direct access” device that could be used in place of a diskette drive. Wafer sizes varied from 5 to 75 feet, which corresponded to about 4 KB–64 KB.
13.3.8 PC Era.
The introduction of the IBM Personal Computer (PC) [
109] in 1981 signaled the start of a new era in small computers. While microcomputers were already being used in some businesses, the category as a whole was still considered primarily a hobbyist one. The introduction of a small computer with the name IBM changed that, and fairly quickly IBM PCs were visible in businesses of all types and sizes. The term “PC” soon replaced the older term microcomputer.
While the original target price-point for the IBM PC was $1,500, in practice, after adding a monitor, one or two 5¼-inch diskette drives, and perhaps a printer, the price was considerably higher. This made the new IBM system less attractive to hobbyists and home users, and more of a system for business.
The IBM PC did have a cassette tape port in the same form-factor as that on the TRS-80, and cassette-based commands were part of BASIC in ROM. However, given the market for the new system, it is unlikely that many systems ever attached a cassette drive. The cassette port was dropped in the follow-on version, the IBM PC-XT.
By the later half of the 1980s many companies were producing tape backup systems for PCs based on QIC mini-cartridge tape technology. In 1989 PC Magazine tested QIC mini-cartridge drives from 14 different manufacturers, and identified six others [
199]. A Computerworld article in 1987 predicted sales of more than 250,000 QIC mini-cartridge drive units for that year alone [
148].
One early manufacturer, Colorado Memory Systems, was eventually acquired by HP. HP continued to market tape backup systems using the Colorado brand into the early 2000s. A 1998 version of their product attached to the IDE disk controller of a PC, could use a cartridge with a capacity of up to 8 GB, and included sophisticated backup software [
85].
Other tape solutions for PCs based on tape formats such as DLT and DAT were also introduced. However, by the late 1990s most PCs were being shipped with Compact Disc (CD) drives, and writable CDs with capacities in the 600–700 MB range were becoming common. The ubiquity and ease of use of CD-based optical media soon made it the preferred backup medium for individual PC users.
In the enterprise environment network-based backup software was becoming more common. For example, the Adstar Distributed Storage Manager (ADSM, later re-branded to TSM) [
27] was introduced in 1993 and uses a client-server backup architecture. Servers can be run on mainframes, Unix workstations, and even well-equipped PCs, while client software exists for almost every imaginable system from PC-DOS to Cray supercomputers. ADSM/TSM servers typically use a combination of disk and tape devices for storage, and the server supports a wide array of different tape formats and devices.
13.3.9 Workstations.
At about the same time that PCs were becoming common, another type of system was entering the marketplace and becoming common in certain environments. This was the workstation, as exemplified by products from companies such as Apollo, Sun, Silicon Graphics, HP, and others. Workstations were usually similar in form factor to PCs: a processor (with memory, etc.) in a case, along with a keyboard and monitor, and perhaps some specialized input/output hardware attached. The difference was primarily one of scale and focus. These systems were often used by a single user but also typically ran operating systems that gave them multi-user capability.
Workstations were usually significantly more powerful in one or more respects than a PC of the same era. Some workstations focused on CPU power, with support for floating-point performance, for example. Others focused on graphics, with high-resolution displays and perhaps 3D graphics acceleration. Still others focused on workgroup support with powerful networking features. Most ran operating systems that were much more powerful than the PC operating systems of the day; often the OS was a derivative of Unix or otherwise Unix-like. As on PCs, tape was commonly used for backup on workstation systems, attached to either the workstation directly or to a central server. DLT tape appears to have been a popular alternative to QIC cartridges for workstation backup.
As PC hardware and software evolved through the 1980s and beyond (often driven by the high-resolution 3D graphics and powerful CPUs required for PC gaming), the differentiation between workstations and PCs blurred, and today there is little to separate the two.
13.3.10 Changing Uses of Tape.
As can be seen from the preceding discussions, in the PC, workstation, and even parts of the minicomputer environments the main use case for tape had always been data backup.
However, in the older large systems environment tape had initially been the primary high capacity data storage device. Early disk drives were too small and too expensive to be used to replace tape. But as disk technology advanced, the cost of disk storage became more competitive with tape, driven partly by competition from the many disk drive vendors that sprang up beginning in the 1970s.
Thus during the 1970s and 80s much of the traditional role of tape storage began to shift to disk. There were many factors that drove this change, including:
—
the growing capacity and shrinking cost of disk storage.
—
the increasing use of real-time access to data through data terminals.
—
the use of database systems.
—
the growing use of online/time-sharing systems.
—
the cost and labor involved in storing, managing, and manually fetching and loading tapes.
During this period, however, new use cases for tape in the enterprise environment evolved and began to grow in importance. The advantages of tape in capacity and long-term cost for nearline and offline data drove its use in new technologies such as HSM, while its value in data protection (backup), and data repository (archive) led to new applications in those areas.
In addition, the transition from large reels of tape to small cartridges allowed the creation of automated tape libraries, which in turn enabled the storage and access of huge amounts of data in a relatively small footprint, and without human intervention.
These technologies and use cases, along with others that have developed more recently, have already been covered in detail in earlier sections.