Canon · Audio · Lineage Mode

Edition I — Audio. The Phonograph as Object Lesson

2026-05-15 · Stax Edition I · 40 min · CC BY-NC 4.0

The audio component of Stax Edition I — the Phonograph capsule. Six chapters tracing the lineage of single-purpose tactile audio control surfaces, from the telegraph key of 1844 through the studio fader, tape transport, hi-fi knob, and remote control, to the Sonos volume knob the Phonograph is built against.

See also: the companion essay, the object specification, and the workshop log for this audio production.

Direct downloads: Opus (30.8 MB, 96 kbps) · MP3 (36.7 MB, 128 kbps) · Metadata (JSON) · Chapter markers (TXT)

Duration
40 min 7 sec
Chapters
6 (plus a brief program introduction)
Narration
Piper TTS (en_US-libritts_r-medium)
Ambient bed
Synthetic pink-noise reverb (sox-generated, CC0-equivalent)
License
CC BY-NC 4.0

Chapters

  1. Intro. Program Introduction
  2. I. The Telegraph Key
  3. II. The Studio Volume Fader
  4. III. The Tape Deck Transport Buttons
  5. IV. The Hi-Fi Volume Knob — Marantz, McIntosh, Krell
  6. V. The Remote Control's Rise And Disappearance
  7. VI. The Sonos Volume Knob, And What The Phonograph Inherits

The Lineage Mode pipeline

This is the first Lineage Mode audio program. The pipeline that produced it is a shell harness around Piper TTS, sox, and ffmpeg — proven tools wired together without ceremony. The production-grade Membrane Framework rewrite is gated on the AETHER orchestration substrate per the design system specification §4; that integration ships in a later session.

The four-component capsule rule is preserved: the object (the Phonograph remote itself), the editorial (the Sonos-monoculture essay), the software (the AGPL firmware + companion client), and this audio program together constitute Edition I. The capsule is the unit. No component ships in isolation.

Transcript

Full narration text. The audio above is generated from this script via Piper TTS.

Program Introduction

A sixty-minute audio program. Stax Edition One. The audio component of the four-component capsule that ships with the Phonograph.

This is the history of the single-purpose audio remote, traced from the telegraph key of eighteen forty-four to the studio fader handle of the nineteen thirties, to the hi-fi volume knob of the nineteen fifties and sixties, to the remote control's rise and quiet disappearance, and finally to the Sonos volume knob that the Phonograph is built against. Six chapters. Ten minutes each. The argument across all six is the same argument the companion essay makes in print: the touchscreen-on-everything monoculture has lost the plot, and the lineage of single-purpose tactile control surfaces is the substrate the next decade of consumer hardware will be re-built on. The Phonograph is one instance of that lineage. It is not the only one. It is the one we are personally responsible for.

Chapter 1 — The Telegraph Key

In the spring of eighteen forty-four, in the basement of the Capitol building in Washington, Samuel Morse sat at a small wooden table and tapped the words "What hath God wrought" along a copper wire that ran forty miles to Baltimore. The message was received. It was sent again. It was repeated to demonstrate that the line was real and that the line was reliable. That demonstration is the founding moment of electric communication, and the device Morse was tapping is the founding object of the lineage this audio program traces. The telegraph key was the first single-purpose tactile control surface in the history of electric machines. Every Phonograph, every studio fader, every Hi-Fi knob, and every remote control we will discuss across the next five chapters is in some sense descended from the telegraph key, because the telegraph key established the basic architectural fact that an operator's hand, performing a small repetitive motion against a calibrated mechanical resistance, can transmit information across an arbitrary distance through an electric substrate.

The key itself is a beautifully minimal object. A flat brass lever, hinged at one end, with a knob at the other. A pair of contacts at the bottom of the lever's travel, separated by a spring. A binding post for the wire. That is the entire device. The operator presses the knob, the contacts close, current flows, an electromagnet at the receiving end pulls a stylus down onto a moving paper tape. The contact is held for a short interval or a long interval. The intervals are read off the tape as dots and dashes. The dots and dashes are decoded into letters. The letters are read into words. The words are read into messages. The whole communication architecture rests on one operator's thumb pressing one knob against one spring with one calibrated rhythm. Everything else in the system, from the wire to the relays to the receiving register, is in service to that single tactile gesture.

The telegraph key matters to the Phonograph's lineage for three specific reasons.

The first reason is mechanical. The telegraph key is the original example of what the industrial-design literature would later call a single-purpose tactile control: one job, one motion, one calibrated mechanical response. The operator's hand learns the spring tension. The operator's fingertip learns the contact distance. The operator's wrist learns the rhythm at which the knob can be pressed without fatigue, and the operator's ear learns to distinguish a clean dot from a smeared dot at the receiving end. None of those learnings transfers to a different device. They are calibrated to the telegraph key specifically. By eighteen sixty, after the network had grown to thousands of miles and tens of thousands of trained operators, telegraph operators were a distinct labor category with a distinct apprenticeship and a distinct professional identity. They were the first electric-substrate workforce, and their identity was bound up in the specific tactile properties of the specific instrument they had spent years learning.

The second reason is economic. The telegraph key was deliberately cheap to manufacture and deliberately easy to repair. Western Union, by the eighteen seventies, had standardized on a key form factor that any field operator could replace with a screwdriver. The bearings could be tightened. The contacts could be filed clean. The spring could be swapped out from a parts box that every Western Union office kept stocked. The key was an appliance, in the original sense of the word: an object that performed a specific function reliably, that could be maintained in the field, and that did not require returning to the manufacturer when it wore. The economic durability of the telegraph key is one of the structural reasons the telegraph network expanded as fast as it did. Operators could keep their equipment running.

The third reason is doctrinal. The telegraph key established that the right interface for a high-frequency repetitive control task is a single dedicated object with one job and one feedback mode. Anyone who has read the human-factors literature knows this. Alphonse Chapanis, working at Johns Hopkins in the nineteen forties and nineteen fifties on aircraft cockpit design, codified the principle in research that is still cited today. Operators performing safety-critical or high-frequency tasks while attending to a primary workload — flying an airplane, driving a car, controlling a Morse circuit during a thunderstorm that's degrading the line — benefit from controls that are tactilely distinguishable, mechanically calibrated, and dedicated to a single function. The telegraph key was the prototype.

The early Morse keys, the so-called "camelback" keys of the eighteen forties and fifties, gave way over time to flatter, lower-profile designs. The Vibroplex semi-automatic "bug" key, patented in nineteen-oh-four, mechanized the dot sequence with a vibrating reed so an operator could send faster without fatigue. The straight key was retained for slower or formal traffic and for emergency use, where the mechanical simplicity made it more reliable than the bug. Over a hundred-year window, the telegraph key remained recognizably the same object: a single tactile control, a calibrated mechanical resistance, a binary electrical contact. It was finally displaced in commercial service by teleprinters and by voice radio in the mid-twentieth century, but the object did not disappear. Amateur radio operators still use them. Maritime services kept them on inventory through the nineteen nineties. The U.S. Navy was teaching Morse code at its Naval Telecommunications Command training school until at least the late nineteen nineties, and a subset of submarine and special-operations communicators still know it today as a low-bandwidth, low-power, jamming-resistant fallback.

The point is not that the telegraph key is old. The point is that the architectural posture the telegraph key embodies — single function, calibrated tactile response, repairable in the field, learnable by an operator over years — has been the right posture for high-frequency control tasks for the entire history of electric communication. Whenever an industry has departed from that posture, in pursuit of generality, in pursuit of cost savings, in pursuit of multi-function consolidation, it has eventually come back to it. We will see the pattern play out four more times in the chapters ahead.

Before we leave the telegraph era, one historical note about who controlled the bottleneck. Western Union, by the eighteen seventies, owned virtually all of the long-distance telegraph capacity in the United States. The telegraph keys themselves were commodity manufacturing. The wire was commodity manufacturing. The operators were trained but not unionized in the modern sense. The defensible commercial position was the network — the routing infrastructure between offices, the relationships with the railroad rights-of-way, the rate schedules with the major newspapers and brokerages. The keys were a tool. The substrate was the asset. We will come back to this distinction in chapter four, when we discuss the Marantz and McIntosh audio companies. The substrate-versus-appliance question is the structural question that runs through the whole audio program, and Western Union is the first lesson in it.

Chapter 2 — The Studio Volume Fader

In the autumn of nineteen-twenty-six, the British Broadcasting Corporation moved into a new studio at Savoy Hill in London. The building had been converted from offices, the acoustics had been treated with carpet and curtain, and the broadcast-engineering team had built the first generation of mixing consoles to handle the multiple microphones, multiple gramophone turntables, and multiple announcer positions that a live BBC broadcast required. Those consoles were the first generation of what the audio industry would come to call studio mixing desks, and the control surface on them was a row of rotary potentiometers — knobs, basically — that the broadcast engineer would turn to balance the level of one microphone against another, to fade in a gramophone record under an announcer's introduction, and to bring up a remote feed from a concert hall on the other side of London.

The rotary potentiometer worked. It had been the standard volume control in radio sets since the early nineteen-twenties. But the rotary control had an ergonomic problem for live broadcast engineering: the engineer's hand had to grip and turn the knob, and the turn was a continuous arc that required visual attention to read the position. If the engineer wanted to fade two channels simultaneously, the engineer had to use both hands. If the engineer wanted to fade a channel down while reading the script for the next announcement, the engineer had to look away from the script to confirm the knob's position. The rotary worked for static mixing. It worked poorly for the dynamic mixing that live broadcast required.

The solution that emerged over the nineteen-thirties was the linear fader. A linear fader is a sliding potentiometer with a long travel, mounted vertically on the console face. The engineer puts a finger on the fader knob, and a downward motion brings the channel level down, an upward motion brings the channel level up. The position of the fader can be read at a glance, by the position of the fader knob along the travel, without the engineer having to lift their eyes from the script or the cue sheet. Two faders can be moved simultaneously with two fingers of the same hand. Four faders can be moved with four fingers. The whole console becomes a tactile instrument that the engineer can play with the same kind of muscle memory a pianist develops, and the play is fast enough to keep up with the pace of a live broadcast.

The earliest linear faders were carbon-track potentiometers with metal slide carriages, prone to crackling as the track wore. By the nineteen-fifties, conductive plastic tracks had largely replaced carbon, and the smoothness and longevity improved enormously. By the nineteen-seventies, when the British console manufacturer Solid State Logic launched the SSL Four Thousand series console, the fader was a Penny and Giles unit with a hundred-millimeter travel, a smooth conductive-plastic element, a calibrated detent at the zero-decibel mark to give the engineer's finger a tactile reference for unity gain, and a side handle on the fader knob that an engineer could grip rather than pinch.

The fader handle is the part of this story we want to focus on. The fader knob is not just an electrical contact point. It is a sculpted handle, designed to be gripped from above by an index finger, or pinched from the side by an index finger and thumb, or pushed from below by a thumb in a fast bring-up move. The geometry is calibrated to the human hand. Older Penny and Giles faders had a knob that was essentially a rounded rectangular block, fluted on the sides for grip. Newer ones from companies like Alps and ALPS Alpine have refined the geometry. Some studio engineers prefer a particular knob style and have it swapped on every console they work on. The fader knob is, for a working broadcast or recording engineer, a personal instrument in the same sense a violinist's bow is a personal instrument.

The reference points are explicit. The Solid State Logic Four Thousand series, in widespread service from the late nineteen-seventies through the nineteen-nineties, was the dominant large-format mixing console in pop and rock recording. Albums on the SSL include the entire eighty-percent of major-label production in that window, and engineers trained on SSL consoles have a distinct working idiom built around the SSL fader and the SSL channel strip. The Neve Eighty-Eighty consoles from the nineteen-seventies, particularly the Eighty-Eighty-Eight and Eighty-Eighty-Four models that became cornerstones of recording rooms like Sound City and Trident, used a different fader feel — more friction, longer travel — and engineers who trained on Neve developed a different mixing rhythm. The Solid State Logic versus Neve distinction is felt by working engineers as a difference in handheld feel as much as a difference in sonic character.

We dwell on these distinctions because they are the same distinctions that the Phonograph specification dwells on. The companion document to this audio program — the industrial-design specification for the Phonograph — names two reference points for the rotary encoder feel: the Behringer X-Thirty-Two channel strip encoder and the Sennheiser HD Six-Fifty headphone cup detent. Both of those references come out of the studio fader tradition. The Behringer X-Thirty-Two is a modern digital mixing console that uses rotary encoders for its channel strip in a way that consciously imitates the SSL feel. The Sennheiser HD Six-Fifty is a flagship audiophile headphone whose adjustment detents were tuned by people who came out of the broadcast and recording side of the industry. The Phonograph's knob lineage is, through these two named references, a direct descendant of Savoy Hill in nineteen-twenty-six.

The economic note about the fader era is worth making explicit. The studio fader was never a mass-market consumer object. It was always a professional tool, sold in batches of dozens at a time to console manufacturers who would integrate them into multi-thousand-dollar mixing desks. The whole supply chain — Penny and Giles, P and G Audio, Bourns, Alps, ALPS Alpine, the bespoke high-end faders from Penny and Giles' Cwmcarn factory in Wales — was a professional supply chain. The unit volumes were small. The margins per unit were high. The customer base was specialized and demanding and informed. That is the economic shape of every premium tactile-control category we will discuss across this audio program. It is not a mass-market shape. It is a professional-tools or premium-niche shape, with high margins per unit, low unit volume, and a customer base that knows what it wants.

The Phonograph follows the same shape. It is not a mass-market remote control. It is a premium tactile-control object sold in runs of one hundred to a customer base that has organized itself around no longer tolerating the touchscreen-on-everything monoculture. The economic ancestor of the Phonograph is not the universal remote you bought at Best Buy in nineteen-ninety-eight for fifteen dollars. The economic ancestor of the Phonograph is the Penny and Giles fader you bought in nineteen-seventy-eight for nine hundred dollars, retail, single unit, mounted in a Solid State Logic console that cost the studio half a million dollars over the next decade.

Chapter 3 — The Tape Deck Transport Buttons

In nineteen-fifty-six, Ampex shipped the VR-One-Thousand videotape recorder to CBS. The machine was the size of a large refrigerator and recorded broadcast-quality video to two-inch magnetic tape running on twelve-inch reels. It was the first practical broadcast videotape system, and it revolutionized television production by decoupling the broadcast schedule from live performance. Before the VR-One-Thousand, every CBS broadcast either had to be live, or had to be filmed on kinescope by pointing a film camera at a television monitor, which produced a noticeably degraded image. After the VR-One-Thousand, CBS could record a broadcast in California at four PM Pacific, ship the tape east, and air it at seven PM Eastern with full broadcast-quality video. The economic implications for broadcast television were enormous, and the engineering implications for the audio industry were enormous too, because the tape-deck transport — the mechanical assembly that pulled the tape across the heads at a calibrated speed and in a controlled direction — became the central engineering puzzle of professional audio for the next twenty-five years.

The transport buttons on a professional tape deck are the third object in the lineage we are tracing. They are the row of large, dedicated, mechanically-positive buttons that control the tape motion: play, stop, fast-forward, rewind, record, and on some machines pause and tape-edit cue. Each button is a physically dedicated control, with its own internal mechanical engagement against the tape transport's solenoids and pinch rollers. Each button has a tactile click that signals to the engineer's finger that the command has been accepted by the deck. Each button has a position on the deck face that has been the same position on every professional tape deck since approximately nineteen-sixty-five. The play button is always on the right side of the transport cluster. The rewind button is always on the left. The stop button is always center or near-center. The record button, on machines with a record function, is always interlocked with the play button so that recording requires pressing record-and-play together — a deliberate safety interlock against accidentally erasing a master tape.

The professional tape decks of the nineteen-sixties, seventies, and eighties — the Ampex ATR-One-Hundred, the Studer A-Eight-Hundred and A-Eighty, the MCI JH series — were premium-niche products sold to commercial recording studios at unit prices in the tens of thousands of dollars. The transport feel was a load-bearing part of the product. An engineer who has spent twenty years working on Studer A-Eighty machines develops a wrist memory for the exact position of the Studer transport buttons and the exact tactile response of the Studer fast-forward engagement. Switching that engineer to a different deck — say, an MCI — is not a re-training task on the order of an afternoon. It is a re-training task on the order of weeks. The buttons are not interchangeable in any operationally meaningful sense.

The transport buttons are also the lineage from which the consumer-audio tape deck inherits. When TEAC and Sony and Pioneer and a half-dozen other Japanese consumer audio manufacturers entered the cassette-deck market in the nineteen-seventies, they imported the professional transport-button conventions wholesale. A nineteen-seventy-eight Pioneer CT-F-Eight-Hundred cassette deck has play, fast-forward, rewind, stop, pause, and record buttons in essentially the same physical configuration as a contemporary Studer reel-to-reel. The Japanese consumer-audio firms understood that the tactile conventions had been worked out in the professional world, and that consumers who had been exposed to professional studios — through TV, through music journalism, through the audiophile press — would recognize and reward the same conventions in their home equipment. The consumer-audio tape deck was, in this sense, professional gear scaled down for home use, with the transport conventions intact.

The economic lesson here is important. The transport-button convention was successfully democratized from the professional studio to the consumer audio market because the convention was already correct. There was no need to redesign for the consumer market. The professional conventions had been refined over twenty years against the most demanding users — recording engineers, broadcast operators, mastering engineers — and they were already optimized for the underlying physical task of operating a magnetic-tape transport. The consumer was a less-demanding version of the professional user, but the underlying task was the same task. Carrying the professional conventions over meant the consumer got an interface that had been optimized by twenty years of professional use, for free. The Japanese audio firms understood this, and the result was that the nineteen-seventies and nineteen-eighties cassette deck became one of the most universally usable pieces of consumer electronics in history. Anyone who learned to operate one cassette deck could operate any cassette deck, anywhere in the world, in any language, because the transport buttons were positioned in the same place and labeled with the same icons.

The collapse of this convention came with the rise of the integrated audio system in the late nineteen-eighties and through the nineteen-nineties. Boomboxes, rack systems, mini-systems, home-theater receivers — all of which combined a cassette deck with a tuner, a CD player, an amplifier, and eventually an integrated remote — began consolidating the transport controls into smaller, less differentiated, less tactile buttons clustered alongside other controls on a single panel. The dedicated, large, mechanically-positive transport buttons gave way to small membrane switches positioned among similar-looking switches. The eye had to find the right button before the finger could press it. The interface began to require visual attention for operations that, on a dedicated tape deck, had been completable in the dark.

This is the first place in our six-chapter survey where we see the regression that the Phonograph is built against. The transport buttons of the nineteen-seventies cassette deck were larger, more tactile, more differentiated, and more dedicated than the transport buttons of the nineteen-nineties mini-system. The newer interface was cheaper to manufacture — fewer parts, smaller footprint, common membrane-switch supplier — but it was a worse interface for the operator. The regression was not driven by usability research. It was driven by industrial-design tooling economics, by the consolidation of multiple audio components onto a single front panel, and by the marketing department's preference for sleeker, simpler-looking equipment. Sound familiar? The same logic will repeat itself in the smartphone era, on a larger scale, as we discuss in chapter five and chapter six.

The professional tape deck did not, of course, disappear. The Studer A-Eighty was in continuous production until the early nineteen-nineties, and the Studer A-Eight-Hundred had a similarly long run. Mastering studios continued to use analog tape into the two-thousands and into the present day for certain genres and certain engineers. The professional transport-button convention is preserved in those rooms. The convention is also preserved, deliberately, in the design of high-end audiophile equipment from companies like McIntosh and Krell, which retained large, mechanically-positive controls on the front panel long after the mass market had moved on. We turn to those companies in the next chapter.

Chapter 4 — The Hi-Fi Volume Knob — Marantz, McIntosh, Krell

The hi-fi volume knob is the central object in this entire audio program. Every other object we have discussed — the telegraph key, the studio fader handle, the tape transport buttons — and every object we will discuss — the universal remote, the Sonos volume knob, the Phonograph itself — is in some sense a variation on the central problem the hi-fi volume knob solves. The problem is: a human operator wants to adjust the loudness of an audio signal continuously, by a calibrated amount, with a single hand, in a low-light environment, without visual attention to the device. The hi-fi volume knob, refined over six decades, is one of the best single-purpose tactile control objects in the history of consumer electronics. We want to spend ten minutes understanding why.

The first thing to say is that the hi-fi volume knob is a specific class of object, not a generic knob. The defining features are: a substantial diameter — typically thirty to fifty millimeters — a substantial weight, often a brass or aluminum slug rather than a plastic shell, a calibrated rotational resistance that the human hand can sense as a steady drag rather than as friction or freewheel, a specific shape of grip — often a knurled outer edge, sometimes a smooth surface with a single indicator pip, occasionally a flat-sided or scalloped form — and a precise relationship between rotational angle and audio level. A good hi-fi volume knob feels like an object the manufacturer has thought about. A bad one feels like a stamped plastic part.

Saul Marantz, founding Marantz in New York in nineteen-fifty-three, built the first generation of high-end tube preamps and integrated amplifiers that established the post-war American hi-fi market. The Marantz Model One preamplifier of nineteen-fifty-three is the founding object of the high-end American consumer audio category. It used a multi-deck rotary switch for source selection and a custom potentiometer for volume — both controls mounted to substantial machined-aluminum knobs with a fluted outer edge and a precision indicator. The Marantz Model Seven, released in nineteen-fifty-eight, refined the design further and became the canonical American tube preamplifier of the late nineteen-fifties and early nineteen-sixties. The Model Seven volume knob is, by reputation among collectors and restorers, one of the most precisely-feeling rotary controls in any consumer-audio product ever made.

Frank McIntosh, founding McIntosh Laboratories in Binghamton, New York in nineteen-forty-nine, took the same posture in a slightly more conservative direction. The McIntosh C-Twenty-Two preamplifier of nineteen-sixty-three, and the C-Twenty-Eight that followed in nineteen-seventy, established the McIntosh house style: black anodized aluminum faceplates, large green-illuminated meters, large rotary knobs with a chrome ring around the perimeter and a distinct indicator marker. The MR-Sixty-Five tuner of nineteen-fifty-nine — released the same year as the introduction of FM stereo broadcasting in the United States, and built specifically to take advantage of it — used a continuous-tuning dial with a large flywheel-weighted knob that, when spun, would continue rotating under its own inertia for several seconds, allowing the operator to scan across the FM band by giving the dial a single flick of the wrist. That flywheel-weighted tuning knob is one of the most distinctive tactile-control gestures in the history of consumer audio, and it is preserved in the McIntosh product line essentially unchanged into the two-thousand-twenties.

The McIntosh house style is worth dwelling on because it is the most explicit case in consumer-audio history of a company that has built a multi-decade financial position on the premium-tactile-control posture. McIntosh has been continuously producing the MC-Two-Seventy-Five tube power amplifier in some form since nineteen-sixty-one. The faceplate design, the meter style, the knob shapes, and the chrome accents have all been preserved deliberately across the entire run. A two-thousand-twenty-four McIntosh MC-Two-Seventy-Five looks remarkably like a nineteen-sixty-one MC-Two-Seventy-Five. The internal electronics have been updated several times. The transformers have been re-wound. The tubes have been swapped to whatever is currently in production. The chassis has been re-engineered for modern safety standards. The external appearance and the tactile feel of the knobs have not changed in sixty years. McIntosh has explicitly priced its products against the premium-niche segment, has explicitly refused to chase the mass-market consumer audio business, and has explicitly priced its premium pricing as a permanent commitment rather than as a marketing-window scarcity tactic. The McIntosh financial model is the closest existing template, in the consumer-audio category specifically, for what the Stax Editions financial model is trying to instantiate at a different price point and on a different product surface.

Krell Industries, founded in nineteen-eighty by Dan D'Agostino in Orange, Connecticut, sits in the third corner of the high-end-audio triangle alongside Marantz and McIntosh. Where Marantz was tube-based and aesthetically refined, and McIntosh was tube-based and conservatively styled, Krell was solid-state and aggressively powerful. The Krell KSA-Two-Hundred power amplifier of the early nineteen-eighties weighed over a hundred pounds, drew enough current at idle to noticeably warm a small room, and could drive loudspeaker loads that other solid-state amplifiers physically could not. The Krell faceplate was a machined billet of aluminum that was thicker than most cars' brake calipers. The Krell volume knob was a substantial machined aluminum disc with a knurled outer edge, mounted on a precision potentiometer through a steel shaft sleeve to eliminate any wobble. The whole product was aggressively over-built, aggressively over-engineered, and aggressively priced — which was its market posture.

The three companies together — Marantz, McIntosh, and Krell — established the premium tactile-control posture in American consumer audio across the nineteen-fifties through the nineteen-eighties. Other companies in other countries did the same in parallel: Quad in England with the Quad Twenty-Two preamp and the original electrostatic speakers, Audio Note in Britain with the all-tube no-feedback designs, Mark Levinson in the U.S. with the LNP-Two preamp, Goldmund in Switzerland with the brutalist-modernist Reference turntables, and many others. The premium-niche segment of consumer audio has, throughout the entire post-war period, been a place where small companies have sold opinionated, single-purpose, tactile-control-first objects to a customer base that has paid premium prices for an exquisitely-tuned appliance layer. The category has not disappeared. The category has compounded. The McIntosh of two-thousand-twenty-four is more profitable, in real terms, than the McIntosh of nineteen-sixty-three.

The lesson the Stax Editions financial model takes from this is direct. The premium-tactile-control segment is a durable commercial position. The mass-market mid-tier of consumer audio has been under continuous margin pressure since the nineteen-eighties as cheap Asian manufacturing has commoditized the substrate. The premium-niche has not been under that pressure. McIntosh, Krell, and the surviving high-end audio brands are not competing with Sonos. They are competing with the absence of Sonos at the price point and quality posture they occupy. The Phonograph at two-hundred-eighty dollars retail occupies a different price point — premium but accessible — and a different product surface — control rather than amplification — but the financial logic is the same. Premium-niche, opinionated, tactile-control-first, sold to an audience that has organized itself around no longer tolerating the mass-market category's compromises.

Chapter 5 — The Remote Control's Rise And Disappearance

The remote control is the bridge from the hi-fi volume knob to the Sonos volume knob, and it deserves its own chapter because the history of the remote is also the history of how the touchscreen-on-everything monoculture won. To understand the Phonograph, we need to understand specifically what the remote control was, why it succeeded, and how the success was undone.

The first commercial television remote was the Zenith Lazy Bones, released in nineteen-fifty. It was wired. A cable ran from the television to a handheld unit with mechanical buttons that controlled tuning and on-off. The Zenith Flashmatic of nineteen-fifty-five was the first wireless remote — it used a directional photocell receiver on the television and a flashlight aimed at one of four corner sensors to advance the channel or turn the set on or off. The Flashmatic was elegant in its mechanical simplicity, but it had an obvious flaw: any source of bright light, including direct sunlight, could trigger the receiver. The Zenith Space Command of nineteen-fifty-six replaced the optical link with an ultrasonic link, using four small aluminum rods inside the remote that, when struck by little hammers, emitted ultrasonic tones at four different frequencies that the television's microphone receiver could distinguish. The Space Command was the first wireless remote in widespread use, and it established the convention that a television remote would be a small handheld object with a few large physical buttons dedicated to specific functions.

Across the nineteen-sixties and seventies, the ultrasonic remote was gradually replaced by the infrared remote — pioneered by Telefunken and Toshiba — which offered better range, no audible side-effects, and more channel capacity for additional command codes. The infrared remote became the dominant television-control technology by the early nineteen-eighties. The Sony RM-Series remotes from the late nineteen-seventies and early nineteen-eighties established what we now think of as the canonical television remote: a rectangular handheld unit, twelve to twenty large rubber buttons, dedicated power, volume up and down, channel up and down, mute, and a numeric keypad. The buttons were tactile, mechanical, and dedicated. The remote could be operated in the dark by feel alone. The remote was almost universally a good piece of industrial design.

The TiVo Peanut remote of nineteen-ninety-nine is the cleanest single example we have of what a remote control could be when designed by people who understood the underlying control task. The TiVo company shipped its first digital video recorder in nineteen-ninety-nine, and the remote that came with it was designed by a small team that included Paul Newby and Steve Perlman. The Peanut had a distinctive elongated peanut shape, which fit the human hand more naturally than a rectangular block. The transport buttons — play, fast-forward, rewind, pause — were oversized and shaped distinctly from the other buttons so the user could find them by touch. The thumbs-up and thumbs-down rating buttons, which were specific to the TiVo product feature set, had their own distinctive shapes and locations. The directional pad in the center was large, with a clear central confirm button. The whole remote was deliberately limited in feature surface — TiVo refused to make the Peanut a universal controller for the whole living room, even though it would have been technically straightforward — and the limit was a feature. The user got a remote that did one job well and did not try to do other jobs poorly.

The IDEO industrial-design retrospective on the Peanut, and the subsequent writings by Don Norman in revisions of his book "The Design of Everyday Things," document the design choices in detail. Button differentiation by shape and size. Prominent dedicated controls for the most-used operations. A deliberate refusal to be a universal controller. Limited feature surface as a positive design property. These are the same principles the Phonograph specification names explicitly, and they are not novel principles. They were worked out in nineteen-ninety-nine and were generally available to anyone building a remote-control product after that date.

What happened next is the part we are interested in. The Peanut was not widely imitated. TiVo as a company shifted away from standalone hardware in the early two-thousands as its business pivoted toward licensing its software to cable and satellite operators. The Peanut faded from production. The remote-control category, instead of converging on the principles the Peanut had demonstrated, moved in the opposite direction.

Two trends collapsed the remote-control category over the two-thousand-tens. The first was the universal-remote-fueled feature creep of the nineteen-nineties — universal remotes from Logitech Harmony and other companies that tried to control every device in the living room from a single handheld unit, accumulating dozens of buttons and complex programming sequences and never quite working reliably with all of the devices the customer owned. The second was the rise of streaming media players — Apple TV, Roku, Amazon Fire — which collapsed the device-control problem from "control my receiver, my Blu-ray, my TV, my cable box" to "control my streaming box, which speaks to all of those devices over HDMI-CEC." That collapse removed the rationale for the universal remote — it was no longer necessary to control five devices, because you were really only controlling one — but the streaming-box companies, instead of building a Peanut-quality remote for the simplified task, took the opposite move. They built minimal remotes with as few buttons as possible, and routed everything else through the on-screen menu and through the streaming app on the user's phone.

The Apple TV Siri Remote of two-thousand-fifteen is the canonical example. The original Siri Remote replaced the dedicated transport buttons of earlier Apple TV remotes with a touch surface. The touch surface required the user to look at the remote to verify what they were touching, because the touch surface had no tactile differentiation. Customer complaints across the next six years prompted Apple to release a redesigned Siri Remote in two-thousand-twenty-one, which partially restored a directional pad — but did not restore the dedicated transport-button cluster. The Roku and Fire remotes followed similar arcs. Voice-assistant buttons were added; dedicated number pads were removed; the trend was toward fewer, less tactile, more software-dependent remotes.

By two-thousand-twenty, the standard industry expectation was that anything beyond the most basic control should be handled in the phone app, not on the remote. The Sonos S2 app is the most visible instance of this expectation, but it is not the only one. The Apple Music app on iPhone has effectively replaced the Apple TV remote as the primary control interface for Apple TV. The Spotify app on iPhone has replaced the dedicated controls on Sonos speakers. The thermostat app on iPhone has replaced the thermostat itself for control of home heating. The whole control-surface category for the modern home has migrated from dedicated tactile objects to a single touchscreen running in a phone app, and the cost has been substantial. The five-tap excavation through a phone app to turn down the music is a real cost, paid in attention and patience, every time the operation is performed.

This is the regression the Phonograph is built against. The Phonograph is, in a literal sense, a reconstruction of the Peanut remote's design principles — single purpose, tactile, dedicated controls, limited feature surface — implemented on two-thousand-twenty-six substrate. The substrate is different: Bluetooth Low Energy Five-Point-Three rather than infrared, ESP32-C6 microcontroller rather than a custom-encoded ASIC, NATS-over-Bluetooth event publishing rather than a hardware-baked code table. But the design posture is the same posture the Peanut took in nineteen-ninety-nine. We have been here before. The industry walked away from this posture twice — once after the Peanut, once after the original Nest thermostat in two-thousand-eleven — and the Phonograph is one attempt to walk back to it.

Chapter 6 — The Sonos Volume Knob, And What The Phonograph Inherits

The Sonos volume knob is the central comparison point this audio program orbits around, and we have saved it for last because it is the most recent and the most directly relevant to the Phonograph. Sonos was founded in two-thousand-two by John MacFarlane, Craig Shelburne, Tom Cullen, and Trung Mai in Santa Barbara, California. The original Sonos product, the ZP-One-Hundred Zone Player released in two-thousand-five, was the first commercially successful multi-room audio system that synchronized music playback across multiple rooms over a household Wi-Fi mesh. The product was a substantial engineering achievement. The mesh-grade firmware that handled inter-zone time synchronization, the cross-platform audio-format support, the seamless integration with cloud streaming services as they emerged through the late two-thousands and early two-thousand-tens — all of that was non-trivial engineering, executed at high quality, by a team that genuinely understood what they were building.

The Sonos volume knob, in the original product line, was a physical knob on the ZP-One-Hundred and the original Sonos Connect. It was a small rotary control on the speaker face, with a tactile click for volume changes. It was not the primary interface — the primary interface was the original Sonos Controller CR-One-Hundred, a dedicated handheld remote with a screen and a click-wheel that was the closest analog to the original iPod's click-wheel from the same period. The CR-One-Hundred was a substantial piece of dedicated hardware that Sonos shipped as a first-class product alongside the speakers themselves. The CR-One-Hundred is the Sonos product we want to focus on, because it is the moment Sonos got the architecture right and the moment Sonos walked away from it.

The CR-One-Hundred, sold from two-thousand-five through approximately two-thousand-twelve, was a Wi-Fi-enabled handheld controller dedicated specifically to controlling a Sonos zone. It had a click-wheel, dedicated transport buttons, and a screen for browsing the music library. It was, in the Phonograph's lineage, the closest direct ancestor we have. Sonos owners who used the CR-One-Hundred remember it as the best Sonos control experience the company ever shipped. It was eyes-free for volume changes. It was fast for transport operations. It was dedicated to the single task of controlling Sonos. It was, in short, the appliance-layer object that the Sonos S2 phone app should have been.

Sonos discontinued the CR-One-Hundred circa two-thousand-twelve and software-deprecated it in two-thousand-eighteen. The discontinuation was driven by the company's strategic decision to move all control to the phone app. The reasoning was the same reasoning we have heard throughout this audio program — cheaper to manufacture nothing, more flexible to update via software, better platform positioning for the equity market — and the result was the architectural decision that produced the Sonos S2 app catastrophe of two-thousand-twenty-four. Patrick Spence, then-CEO, eventually resigned over that catastrophe. The CR-One-Hundred is, in retrospect, the road Sonos didn't take.

The Phonograph is, in a precise sense, an attempt to re-take the road Sonos didn't take, by a different company. It is not a Sonos product. It is not licensed by Sonos. It is built against the Sonos ecosystem from outside, talking to Sonos speakers through the existing UPnP control interface — the same interface the original Sonos Controller used, which was never deprecated, which is still present on every Sonos speaker, and which can be controlled by any device that speaks UPnP on the local network. The Phonograph's host bridge — running on the owner's Mac or Linux machine — speaks UPnP to Sonos and translates the Phonograph's Bluetooth events into Sonos control commands. The architecture is deliberate: the Phonograph does not depend on Sonos, but the Phonograph composes with Sonos. If Sonos disappears, the Phonograph still works against other audio systems. If the Phonograph disappears, Sonos still works through whatever app remains.

The capsule the Phonograph ships with — the four-component capsule of object, essay, software, and audio — is itself an instantiation of the architectural posture this audio program has been arguing for over six chapters. Each component is single-purpose. The object is a tactile control surface, and only a tactile control surface. The essay is a written argument, and only a written argument. The software is the substrate that makes the object work, and only the substrate. The audio — this audio program — is the historical context that explains why the other three components have the shape they have, and only the historical context. None of the four components tries to be all of them. The four components compose, but they do not collapse into each other. The capsule is a coordinated set of single-purpose objects, in the same way that a hi-fi system of the nineteen-seventies was a coordinated set of single-purpose components — preamp, amplifier, tuner, turntable, speakers — each doing one job well, connected by standardized interfaces.

The financial model the capsule extends is the premium-niche model we have seen recur throughout this audio program. Penny and Giles fader handles to working broadcast engineers. McIntosh amplifiers to audiophiles. Krell power amplifiers to ultra-high-end audio buyers. Leica M-series cameras to manual-focus photographers. Anordain watches to collectors who don't want a smartwatch. The customer base for each of these products is small in unit volume, large in margin per unit, and informed enough to recognize the value proposition without needing it explained. The Phonograph at two-hundred-eighty dollars retail and a one-hundred-unit Edition One run sits comfortably in this lineage. It is not trying to compete with Sonos at Sonos's volume. It is trying to occupy the premium-tactile-control segment that the mass-market consumer audio industry has, over the last fifteen years, vacated.

We close with a return to the telegraph key from chapter one. The architectural posture of the telegraph key — single function, calibrated tactile response, repairable in the field, learnable by an operator over years — has been the right posture for high-frequency control tasks for the entire history of electric communication. Every time an industry has departed from that posture in pursuit of generality or cost savings or platform consolidation, it has eventually come back to it. The current departure — the touchscreen-on-everything monoculture — is the most aggressive departure the consumer-hardware industry has ever made from the telegraph-key posture, and we are now in the early phase of the return. Euro NCAP's two-thousand-twenty-six physical-controls rule for new cars is one signal of the return. Volkswagen's mid-two-thousand-twenty-four announcement that the ID line would restore physical buttons for primary controls is another. The Phonograph is a third. None of these signals is, by itself, large enough to reverse the industry direction. Together, they are the early indicators that the direction is reversing.

That is the argument the audio program has made. The Phonograph is the object the argument is built around. The companion essay published at stax dot dev makes the same argument in writing, with footnotes to primary sources, for readers who want the prose version. The Phonograph's specification document, also at stax dot dev, makes the same argument in industrial-design language, for readers who want the engineering version. This audio program is the historical version, for readers who want the lineage. All three documents say the same thing, in three different substrates, because the four-component capsule format is itself an instantiation of the appliance-layer doctrine the Mercantile Thesis names. The substrates are different. The thesis is the same.

Thank you for listening. The Phonograph is Stax Edition One. The next Edition — the Almanac — ships in the following quarter. The Lineage Mode audio program is a continuing series, of which this is the first. We will return.


The audio component of Stax Edition I — The Phonograph. The other three components of the capsule ship to the Stax direct surface on the 2026 Q2 drop window.