Zoë Blade's notebook

44.1 kHz

44.1 kHz is a popular sample rate, for legacy reasons.

Digital audio processors

Digital audio processors generate a lot of digital data. Back in the 1980s, it was simply too much to fit on any household medium, with the exception of videotape. So that's what they recorded to.

Standard Framerate (contains two interlaced fields) Visible lines Lines used by digital audio processor Total lines
PAL 25 Hz[1] 576 588 625[1]
B&W NTSC 30 Hz[1] 480 490 525[1]
Colour NTSC 29.97 Hz[1] 480 490 525[1]

By multiplying the framerate (frames per second) by the number of lines used per frame, we get the number of lines used per second:

Standard Frames per second Lines used per frame Lines used per second
PAL 25 Hz 588 14,700
B&W NTSC 30 Hz 490 14,700
Colour NTSC ≈29.97 Hz 490 14,685.3

Each line can store 3 stereo pairs of 16-bit samples. By multiplying the lines used per second by the pairs of samples per line, we get the sample rate per second:

Standard Lines used per second Pairs of samples Sample rate
PAL 14,700 3 44,100 Hz
B&W NTSC 14,700 3 44,100 Hz
Colour NTSC ≈14,685.3 3 44,056 Hz

Each line stores 3 pairs of 16-bit samples, so 3 × 2 × 16 = 96 bits per line.

For example, Sony's PCM-F1 could record onto NTSC videotapes at 44.056 kHz or PAL/SECAM videotapes at 44.1 kHz, depending on whether you bought the Japanese/American version or the European version. (Theoretically, B&W NTSC could support 44.1 kHz, but there was no such thing as B&W NTSC consumer videotapes.)

CDs

The PAL sample rate was adopted by the global CD format. CDs were originally made using the output of a professional digital audio processor, such as Sony's PCM-1600.

Curiously, while CDs are split into sectors and frames (analogous to fields and lines respectively), they're different to any of the above videotape field and line lengths, at 75 fields per second, 98 lines per field, and 6 stereo pairs of 16-bit samples per line.[2]

Incidentally, a full 75 Hz sector is the shortest addressable point on a CD, just as a single field is the shortest addressable point on a videotape. This explains some tracks — such as Nine Inch Nails's "A Warm Place" — starting with a very brief snippet of the previous track. Perhaps they should have addressed the next sector instead...

While I'm not clear on how CDs were able to be so much smaller than LaserDiscs, it presumably helped that they have no colour signal, and that the luminance signal can be entirely Boolean, as that's all a PCM decoder needs.

CDs became wildly popular, as the first pre-recorded digital format available to consumers. As a result, the sample rate of 44.1 kHz became even more popular, spreading to home computers and their beige successors, and subsequent formats such as DAT and MiniDisc.

References

  1. "ITU-R BT.470-6" International Telecommunication Union, p. 2
  2. "CD-Recordable FAQ: What's a Frame? CIRC Encoding? How Does ECC Work?" Andy McFadden

Sample rates: 8.363 kHz | 31.25 kHz | 32 kHz | 44.056 kHz | 44.1 kHz | 48 kHz