Why The Sonos S2 App Is Bad — And What That Tells Us About The Touchscreen-On-Everything Monoculture
Field statement. The Sonos S2 app is the most visible failure of an architectural pattern that has spread through consumer hardware over the last fifteen years: outsource the appliance layer to a phone app, render every control as a touchscreen list, and call the result modern. The pattern wasn't chosen because users wanted it. It was chosen because the underlying financial incentives — tooling cost, OTA-update flexibility, platform framing — made it the path of least resistance for industrial designers and the people they answer to. The disease is structural. The cure isn't nostalgia. The cure is the return of opinionated, single-purpose, tactile controls as a premium product category, sold to customers who have organized themselves around no longer tolerating the monoculture.
In May 2024, Sonos shipped a rewrite of its mobile control app and broke its own product line. Local-library playback regressed. Queue management — the core workflow for any audio system above the elevator-music tier — became multi-tap to perform what the previous app had executed in one. The alarms feature, which the company's older speakers shipped with for over a decade, stopped working. The accessibility features that had earned Sonos a small but loyal disabled-customer base were removed and not replaced. Public outcry built across the summer. By January 2025, CEO Patrick Spence had resigned1.
The instructive part isn't that Sonos shipped a bad app. Companies ship bad apps. The instructive part is what kind of bad app it was. The S2 rewrite did not fail because the engineers were incompetent. The speakers Sonos sells — Era 100, Era 300, Arc Ultra, Five — are genuinely well-engineered: tuned drivers, proprietary Trueplay room calibration that runs an iPhone microphone through a directional sweep, mesh-grade firmware, a release-engineering record of supporting hardware for roughly nine years per generation2. The gap between the engineering quality of the substrate (the speakers, the mesh, the tuning) and the engineering quality of the appliance the user actually touches (the phone app) is one of the widest gaps in any consumer electronics category in 2025–2026.
That gap is the subject of this essay. I want to argue two things in order. First, that the S2 disaster is a symptom, not the disease, and that the disease has a name: the touchscreen-on-everything monoculture in consumer hardware. Second, that the cure has a name too, and a financial model attached: opinionated, single-purpose, tactile-control objects, sold at premium prices to customers who have run out of patience with the substrate this essay names. The Phonograph — the object I am shipping as Stax Edition I3 — is one instance of that financial model. It is not the only one. It is the one I am personally responsible for, and the one this essay accompanies.
The argument leans on a doctrine I have published separately as the Mercantile Thesis4: that durable wealth in any commercial system flows to whoever owns the bottleneck the rest of the economy must route through, and that the appliance layer — the surface the user actually touches — is the bottleneck the rest of the consumer-electronics stack must route through. Sonos's failure is precisely a Mercantile-Thesis case study. They built an excellent substrate and rented the appliance layer to a touchscreen-app paradigm that strategically belongs to Apple. The diagnosis generalizes beyond Sonos. The cure generalizes too.
II. The receipts — what Sonos S2 actually does
This section is the evidence base. Every claim here is footnoted to a primary or near-primary source. I am not interested in piling on Sonos for sport; I am interested in establishing, with citations, the exact shape of the failure, so the structural argument in later sections does not get to wave at "users hate it."
The May 2024 rewrite
In May 2024, Sonos released what the company internally called the "new app" — a full rewrite of the S2 mobile controller, released to coincide with the launch of the Ace headphones5. The rewrite shipped with the following regressions, each of which was acknowledged in writing by Sonos within the first sixty days:
- Local library playback broke. Users who had spent a decade indexing personal music collections via their NAS or local disk could no longer reliably browse or queue from those libraries6.
- Queue management lost features. The "now playing" view stopped showing the upcoming queue without an additional navigation step; reordering the queue required a tap-and-hold workflow that had previously been a single-tap drag5.
- The alarms feature — bedside-clock alarm functionality that had shipped on Sonos speakers since the original Play:5 — was missing entirely from the new app. The feature was eventually restored, but the restoration took roughly three months and required users to roll back to the legacy "S2 controller" app during the interim7.
- Accessibility features for blind and low-vision users — VoiceOver compatibility, larger-touch-target modes — regressed without warning. The disability community's response prompted formal apologies from the company8.
By August 2024, Patrick Spence, then-CEO, had published a public apology and a recovery roadmap9. By January 2025, Spence had departed the company. The board appointed Tom Conrad, a Sonos board member, as interim CEO10. The Verge, Wired, and the Washington Post each ran multi-thousand-word retrospectives across the summer and fall of 2024 documenting the customer revolt11.
The mandatory cloud-account login
There is a second failure mode that predates the May 2024 disaster and that is more structurally interesting than the rewrite itself. Sonos requires a cloud account login to control speakers on your own local network. The speakers are physically on your LAN; the control app is on your phone, also on your LAN; the iPhone and the speaker can route packets to each other without leaving the building. Sonos's design nevertheless requires authentication against a Sonos-operated cloud service before the iPhone can issue a play command to a speaker in the same room12.
This is a load-bearing architectural decision. It means: a Sonos system is unusable if the Sonos cloud is down. A Sonos system is unusable if your home internet is down, even though every piece of hardware involved is sitting on the same Wi-Fi network. A Sonos system is unusable if you decline to create an account. The speakers you bought, that you own, that are physically present in your house, are not addressable without permission from a server in someone else's data center.
The defensible reading of this design is that it enables features like multi-room synchronization across cloud-streamed services (Spotify Connect, Apple Music, etc.) and that the cloud dependency is necessary to broker authentication between Sonos hardware and third-party music vendors. That defense is real. But it does not require the local control path to depend on the cloud. The architectural decision to route everything through the same authentication boundary is a separate decision from the integration with Spotify, and Sonos's competitors (notably the Bluesound / NAD ecosystem) demonstrate that local control of a multi-room mesh is implementable without mandatory cloud auth13.
The latency and tap-count problems
The May 2024 rewrite also regressed on raw interaction latency. On a 2023-vintage iPhone 15 Pro running the post-rewrite S2 app, the time from app launch to a tappable pause button was, in my own informal timing across roughly two weeks of use in May–June 2024, in the range of 4–7 seconds on a cold launch — meaning the app had been backgrounded long enough that iOS had purged it from memory14. The earlier Sonos Controller app, on the same hardware, was tappable in roughly 2 seconds.
The tap counts for common operations also regressed. Grouping two speakers for synchronized playback in the pre-rewrite app was a 2–3 tap operation (open speakers list, tap "+" on one, tap target). In the post-rewrite app, the same operation was 4–6 taps15. Pausing playback from the home screen of the app — which used to be a single tap on the persistent "now playing" bar — required navigating into the now-playing detail view in the rewritten version.
I want to be specific about evidence here. The latency observations above are my own timing across a small sample of cold launches; they are not a controlled benchmark. I am calling them out as a qualitative observation. The tap-count observations are corroborated by aggregated user reports on r/Sonos and the Sonos community forum, but they are not a randomized usability study. I have not run one. The systematic evidence I can offer is the published coverage in The Verge, Wired, and the Washington Post — which is journalism, not user research, but which is at least audited by editorial standards. The structural argument below does not depend on the exact tap counts; it depends on the direction of the regression, which is unambiguous.
The architectural decision: every UI is a touchscreen list
Underneath the specific bugs is an architectural pattern. Every Sonos UI primitive is a vertically scrolling list of items rendered on a touchscreen. There is no physical knob anywhere in the Sonos line. There is no hardware "skip" or "scene" button on the speakers themselves beyond the limited capacitive controls on the speaker top. There is no peripheral remote control sold by Sonos as a first-class product. The previous-generation IR-style remote (the original Sonos Controller CR100, discontinued circa 2012) had a screen and a click-wheel; the architectural lineage of "device with physical input that controls the system" stopped with that product16.
This is not an accident. The platform decision is: render everything in the phone app, route control through the cloud, eliminate any physical surface that would compete with the touchscreen for the user's attention. The decision rationalizes a long list of cost reductions (no hardware controller to manufacture, no remote-control supply chain, no second UI to maintain). It is also the decision that creates the gap this essay describes.
III. The disease — touchscreen-on-everything monoculture
Sonos is not unique. The pattern of routing all consumer-hardware control through a touchscreen — either the device's own panel or the user's phone — is the dominant architectural pattern of the 2015–2025 consumer-hardware decade. The instructive examples beyond audio:
Modern automotive HVAC
Tesla's Model 3 and Model Y, launched 2017 and 2020 respectively, removed nearly every physical control from the dashboard. Climate control, mirror adjustment, wiper speed, headlight mode, and glove-box release all live behind a touchscreen menu. The decision was widely imitated: Volvo, Volkswagen ID.4, Ford Mustang Mach-E, and several Hyundai/Kia models followed with touchscreen-first HVAC over the 2020–2024 window17.
The empirical record on what happened next is unusually clean. In 2024, Euro NCAP — the European New Car Assessment Programme, the dominant European safety-rating body — announced that, starting in 2026, vehicles that fail to provide physical controls for five specific functions (turn signals, hazard lights, wipers, horn, and emergency-call) would not be eligible for a five-star safety rating18. The announcement is the first major regulatory pushback against the touchscreen-on-everything pattern, and it is based on accumulated evidence that touchscreen-only controls produce measurable driver-distraction effects relative to physical controls.
The underlying research that prompted the Euro NCAP rule includes a 2022 study by the Swedish car magazine Vi Bilägare, which timed driver-response on a closed track across 12 vehicles, comparing touchscreen-based control adjustments to the equivalent operations on a 2005 Volvo V70 with physical controls. The 2005 Volvo's drivers performed the four-task workflow (adjust radio, change climate, reset trip computer, turn on heated seats) in 10 seconds at 110 km/h; the touchscreen-equipped MG Marvel R took 44.6 seconds19. That is not a marginal effect. The distance traveled at 110 km/h during the additional 34 seconds is roughly 1 kilometer — a kilometer of driving with the driver's eyes off the road.
The NHTSA in the United States has not yet adopted a comparable rule, but its 2012 driver-distraction visual-manual guidelines (still nominally in force) suggest a 2-second per-glance and 12-second total task ceiling for in-vehicle interfaces20, a threshold the Vi Bilägare touchscreen times routinely exceed.
The Volvo-touchscreen example is especially instructive because Volvo's own brand identity, prior to the 2020s, was safety. The company that pioneered the three-point seatbelt now ships vehicles whose climate controls require taking eyes off the road. The financial-incentive explanation is simpler than the marketing one: a touchscreen costs less per dashboard than the equivalent panel of physical switches, and a single touchscreen part can be re-skinned across an entire product line.
Modern white goods
Capacitive-touch control panels on washing machines, stoves, and dishwashers became dominant in the 2015–2020 window. The repair industry has documented the consequences:
- Failure rate. Capacitive panels on appliances exposed to steam, hot water, soap residue, and wet hands fail at materially higher rates than the mechanical knobs they replaced. The repair-focused magazine Appliance Service News and the iFixit community have published service-call data across multiple model years21.
- Wet-hand operability. Capacitive sensors require skin-contact conductivity, which is reduced by water, gloves, and soap. The cooking workflow where the user has just handled raw meat or wet pasta and needs to adjust the burner is broken by the touch panel.
- Repairability. Mechanical knobs are field-replaceable for a $5 part. Capacitive touch panels typically require replacing the entire control board, often at $200–$400 service-call cost21.
The honest framing is that there is no rigorous large-N study of appliance-touch-panel failure rates that I have located; the evidence base is the repair-industry trade press and aggregated user reports, which is weaker evidence than the automotive case. But the direction is consistent with the automotive evidence, and the structural cause is identical.
The death of the physical remote
Streaming-box remotes from Apple TV, Roku, and Amazon Fire have followed a trajectory of fewer-buttons-more-on-screen-menus from roughly 2015 onward. The Apple TV Siri Remote in particular went through a 2015 redesign that removed the dedicated transport buttons (skip-forward, skip-back, rewind) in favor of a touch surface; user complaints prompted a 2021 redesign that partially restored the directional pad but did not restore the dedicated transport-button cluster22. The newer Roku and Fire remotes added voice-assistant buttons and removed dedicated number pads.
The architectural decision is consistent across the category: physical buttons are expensive, on-screen menus are cheap, and the user can be trained to tolerate the change. The customer experience is consistently worse on every axis the user values — speed, eyes-free operation, tactile feedback — and is preferred only by the manufacturer's cost-and-tooling team.
The honest scope of the indictment
I want to be careful here. The touchscreen is not a uniformly bad input modality. For tasks that involve reading and arbitrary-text-entry — composing email, browsing a long catalog, searching a music library by artist name — the touchscreen-and-keyboard combination is the right tool. The argument is not "abolish touchscreens." The argument is return touchscreens to their right scope: reading-and-typing, not control.
Control tasks — adjust the volume, pause the audio, switch the lights to dim, change the cabin temperature, advance to the next track — share three properties:
- They are performed frequently (many times per session).
- They are performed while the user's primary attention is elsewhere (driving, cooking, talking to a guest, falling asleep).
- They map naturally to a small, fixed set of states (volume is a scalar; play/pause is binary; track-skip is increment/decrement).
For tasks with all three properties, a physical control is better than a touchscreen on every axis that matters: speed (a knob turns instantly; a touchscreen requires waking the device, finding the right menu, locating the control), eyes-free operation (a knob is felt; a touchscreen must be looked at), tactile feedback (a knob clicks at the detents; a touchscreen has no feedback), longevity (mechanical knobs from 1990s stoves still work; capacitive panels from 2018 stoves are failing), accessibility (blind users can operate a knob; touchscreens are categorically harder for non-sighted operation, even with screen-reader support).
This is not a contested claim in human-factors research. The literature on tactile-feedback advantages in safety-critical control tasks predates touchscreens by decades and was established in cockpit-design research as far back as the 1950s23. The industry's drift away from physical controls was not driven by usability studies. It was driven by industrial-design economics.
IV. Why the disease persisted — the financial structure
The touchscreen-on-everything pattern is not a designer error. It is a financial-incentive pattern. Naming the incentives is what makes the cure tractable.
Industrial-design tooling cost. A single touchscreen part — typically a capacitive-touch IPS display with a stock controller IC — can be re-used across an entire product line. A 7-inch panel on the dashboard of a Volvo XC40 is, at the BOM level, very similar to the panel on a Volvo XC60 or XC90. Changing the displayed UI is a software job. Changing the displayed UI on a physical-control dashboard is a tooling job: new switches, new knob shafts, new dashboard injection molds. The capital cost of physical-control variety is large; the capital cost of touchscreen variety is approximately zero. Industrial designers who are evaluated on per-vehicle BOM cost will rationally choose the touchscreen.
OTA-update flexibility. Marketing departments love that a touchscreen UI can be updated after the device ships. The argument is: if we get the UX wrong at ship, we can fix it later. This is partially true and partially the source of the failure mode this essay is about. The Sonos S2 May 2024 rewrite is exactly what "OTA-update flexibility" looks like when it goes badly. The flexibility cuts both ways: it lets the manufacturer fix mistakes, and it lets the manufacturer break things that previously worked, often in service of a different organizational priority than what the original engineers had optimized.
Platform framing. Investors prefer companies whose products are platforms — recurring engagement, app-store dynamics, analytics, future revenue streams — over companies whose products are single-purpose objects. A Sonos speaker is a single-purpose object. A Sonos system with a phone app is a platform. The financial framing of the company shifts when the appliance layer is software: it becomes a software company that sells hardware, rather than a hardware company that sells hardware. Software companies trade at higher multiples than hardware companies. The architectural decision to render every control as a touchscreen menu in a phone app is also a positioning decision for the equity market.
These three incentives produced a generation of consumer hardware whose substrate engineering is excellent and whose appliance layer is awful. The Sonos S2 case is the most public instance because the substrate quality is conspicuously high (the speakers really are good) and the appliance failure is conspicuously bad (the app really is broken). The pattern is general.
V. The Mercantile Thesis applied — what owning the appliance layer means
The argument from the Mercantile Thesis is that durable commercial position flows to whoever owns the bottleneck the rest of the economy must route through, and that the bottleneck in consumer hardware is the appliance layer — the surface the user touches4. Sonos's failure is precisely diagnosed by that lens.
Sonos owns its substrate: drivers, tuning, mesh firmware, room calibration. Sonos does not own its appliance layer. The appliance layer for Sonos is a touchscreen-app paradigm whose conventions are set by Apple and Google. The phone's notification design language, the gestural conventions, the App Store gating, the iOS/Android version-compatibility windows, the platform-mandated UI patterns — none of that is Sonos's. Sonos rents that surface from the phone platform. When the platform changes (new iOS version, new Material Design conventions, new App Store guidelines), Sonos has to chase. When the platform's UI patterns are wrong for control tasks (because they were designed for reading-and-typing, not control), Sonos cannot fix the patterns. They can only work within them.
That is the spread-scalper position from the Mercantile Thesis, applied to a hardware company. Sonos buys appliance-layer surface wholesale from Apple/Google, repackages it with its substrate, and sells the bundle retail. The margin lives in the gap. The gap closes when the substrate commoditizes (and it is commoditizing: KEF LSX, Bluesound, even Apple's own HomePod are credible substrate alternatives) or when the appliance-layer rent becomes intolerable (and it has, with the May 2024 rewrite). The Mercantile lens predicts what happens next: either Sonos moves down into the substrate they already own and builds an appliance layer they control, or they get repriced.
The same diagnosis applies to:
- Modern Tesla buyers who tolerate climate-control-in-a-menu because there is no alternative inside the Tesla appliance layer. Tesla rents the appliance layer from "touchscreens are how cars work now." When Euro NCAP's 2026 rule lands, the rent goes up.
- Bose / Bang & Olufsen / KEF buyers who tolerate phone-app control for speakers whose drivers cost more than the iPhone controlling them. The substrate is excellent; the appliance layer is rented from the same Apple/Google touchscreen paradigm.
- Modern stove buyers who navigate three layers of capacitive menu to turn on a back burner. The substrate (induction coils, burner timing) is excellent; the appliance layer is rented from the white-goods industry's collective decision to switch to capacitive panels.
The structural pattern: substrate-excellent companies that rent their appliance layer to the touchscreen paradigm are all in the spread-scalper position relative to whoever ends up owning the tactile-control alternative.
VI. The Phonograph thesis — what Stax Edition I instantiates
The Phonograph is Stax Edition I of the Stax Editions drop-house catalog3. The Edition is a four-component capsule: object, essay, software, audio. The object's role is to instantiate the thesis this essay argues.
Concretely, the Phonograph is an anodized-aluminum BLE 5.3 remote, single tactile knob (volume by rotation, play/pause by push), three programmable buttons, ESP32-C6 inside, CR2032 cell, etched serial. It is the anti-touchscreen for an audio system. It controls one substrate (Sonos via UPnP, AirPlay, Spotify Connect, or the Stax NATS substrate) through one knob and three buttons. It is opinionated about its scope: it does not browse libraries, does not display album art, does not show notifications, does not require an account. It does control — volume, transport, scene — and nothing else.
The argument I want to make about this object is not aesthetic. It is structural. The Phonograph is the appliance layer that the Sonos S2 app should have been: tactile, fast, eyes-free, single-purpose, opinionated. The phone app stays — for setup, library browsing, occasional deep configuration. The Phonograph handles the 90% of interactions that are control. The capsule splits the workload across the right surfaces. The touchscreen does what touchscreens are good at (reading and typing); the knob does what knobs are good at (control).
The financial model of Stax Editions is what makes the Phonograph economically viable as a counter-pattern. The capsule is priced as a four-component bundle — object plus essay plus software plus audio — and that bundle justifies a premium ($280 for Edition I) that pure-hardware-as-commodity could not support. The customer is not buying a remote control. The customer is buying a thesis (this essay), an editorial program (the Lineage Mode audio companion), an open-source substrate (the AGPL firmware and client), and a piece of jewelry-grade engineering (the object). The bundle is the moat. Without the bundle, a tactile remote control is a $40 commodity that competes against a $0 phone app and loses. With the bundle, it is a $280 capsule that competes against the Sonos S2 app and the customer experience around it, and wins.
The Edition is the proof of concept for an architectural posture, not a one-off product. The posture, in one sentence: ship single-purpose tactile objects as premium capsules, financed by their editorial and software companions, sold direct to the audience that has run out of patience with the touchscreen-on-everything monoculture. The Phonograph is Edition I. The Almanac is Edition II. The Lineage Album is Edition III. Each Edition extends the same financial model to a different surface.
VII. The other examples — beyond Sonos
The case for tactile control as a premium category does not rest on Sonos alone. The instructive companions in the same architectural tradition:
Leica M-series cameras. The Leica M11, retailing in the $9,000 range, is a 60-megapixel rangefinder with manual focus, a physical aperture ring on every lens, a physical shutter-speed dial on the body, and a single film-advance-style ISO dial. The camera deliberately lacks autofocus and many automation features that cheaper competitors include24. It is sold and bought, in significant volume, precisely because the appliance layer is exquisitely tuned: the aperture ring clicks at the right resistance, the focus tab on every lens lands in the same place, the shutter dial has the right detent. Leica's M-series sales have been stable to growing through the 2010s and 2020s despite the absence of every feature the smartphone-camera category considers table-stakes. The Phonograph is in this architectural lineage.
Anordain watches. Anordain, the Glasgow-based independent watchmaker, produces roughly 200 watches per year25. Their Model 1 sells in the $2,000–$4,000 range — a single mechanical complication, hand-enameled dial, no electronics, no app. The financial model is identical to what Stax Editions extends to electronics: small-run, opinionated, single-purpose objects sold to customers who have organized themselves around no longer tolerating the mass-market category's compromises. Anordain's watches do not compete with the Apple Watch. They compete with the absence of the Apple Watch. Stax Edition I competes with the absence of the Sonos S2 app.
The original TiVo Peanut remote (1999). TiVo's original remote — distinctive peanut shape, oversized dedicated transport buttons, a clear "thumbs up / thumbs down" rating cluster — has been retroactively recognized as one of the best-designed consumer remotes of the digital era. The IDEO retrospective on the design, and Don Norman's later writings about it, document the design choices that made it work: button differentiation by shape and size (so the user could find the right button by touch), prominent dedicated controls for the most-used operations (no menu navigation for play/pause), and a deliberately limited feature surface (the remote did not try to be a universal controller for the whole living room)26. The Peanut was discontinued when TiVo's broader business shifted away from standalone hardware. Its design principles were never seriously revived by Roku, Apple TV, or Amazon Fire, all of which moved toward smaller, fewer-button remotes that route through on-screen menus.
The early Nest Learning Thermostat (2011). The original Nest, designed by Tony Fadell's team at Nest Labs and released in 2011, replaced the touchscreen-and-button paradigm of contemporary smart thermostats with a single rotating ring that controlled the setpoint and a single press-to-confirm interaction27. The ring was the appliance layer; the touchscreen on the front was strictly the display. Reviews at the time, and the device's subsequent commercial success, validated the design. After Google's 2014 acquisition, subsequent Nest generations shifted more functionality to the touchscreen and to the phone app; the ring's primacy was diluted. The architectural reading: the appliance layer was the ring, and Google's organizational gravity moved away from ring-as-primary because Google is a software company whose default surface is a screen. The original Nest is the closest direct ancestor of what Stax Edition I tries to do.
The pattern across these four cases is consistent. Single-purpose, tactile-control-first objects sold to customers willing to pay a premium for an exquisitely-tuned appliance layer occupy a defensible commercial position. The category is not large in unit terms. It is large in margin-per-customer terms. The Mercantile Thesis predicts that the category compounds as the touchscreen-on-everything monoculture continues to fail its customers.
VIII. The objection — "isn't this just nostalgia?"
The strongest objection to the argument so far is that it sounds like nostalgia: that the appeal of physical knobs over touchscreens is an emotional preference for a previous era rather than a structural argument about better usability. The objection deserves a direct answer.
Nostalgia is the preference for an inferior earlier version of something for emotional reasons. Vinyl records sound objectively worse than digital audio on every measurable axis (dynamic range, noise floor, frequency response), and vinyl-buyers usually concede this and prefer vinyl anyway because of the ritual and the artwork and the memory. That is nostalgia, and it is a legitimate position for a consumer to hold — but it is not the position this essay defends.
The Phonograph is not a previous-version remote control. It is a 2026-vintage device built on a 2024 BLE 5.3 chipset, with OTA security updates, a 14-month CR2032 battery life enabled by the chip's deep-sleep modes, Matter and Thread protocol support on the same silicon, an AGPL-3.0-licensed firmware stack that the customer can audit and fork, and a NATS-based substrate that integrates with modern multi-room audio systems via UPnP/AirPlay/Spotify Connect. It is not the 1999 TiVo remote. It is the architectural posture of the 1999 TiVo remote — tactile-control-first, single-purpose, opinionated — implemented on 2026 substrate.
The same point applies category-wide. Euro NCAP's 2026 rule is not asking automakers to ship 1995-vintage dashboards. It is asking them to keep five specific functions (turn signals, hazard, wipers, horn, e-call) on physical controls because those functions have a measurable safety profile that touchscreens degrade. The rule is structural, not nostalgic.
A secondary version of the objection: "the touchscreen revolution was real and the gains were significant; you are romanticizing the limitations of the previous era." This is also worth answering directly. The touchscreen was the right interface for the smartphone — a device whose primary tasks are reading messages, browsing the web, navigating maps, and capturing media. Those are tasks the touchscreen does well. The argument is not that the touchscreen revolution was wrong. The argument is that the touchscreen was generalized into product categories where the underlying tasks were poorly matched to its strengths, and that the generalization was driven by industrial-design economics rather than by usability gains. The smartphone-and-tablet category should keep its touchscreens. The car, the stove, the audio system, the thermostat, and the living-room remote should not have been so aggressively re-tooled to imitate the smartphone.
The third version of the objection is about gradient-of-choice. Most consumer-hardware buyers do not care enough about the appliance-layer-architecture question to pay a premium for the tactile-control alternative. Even granting the full structural argument above, the addressable market for a $280 BLE remote is small relative to the addressable market for a $400 Sonos speaker. This is correct, and the financial model of Stax Editions is sized accordingly. The Phonograph's run is 100 units in its first quarter, not 100,000. The category does not need to be mass-market to be commercially viable; it needs to be premium-margin and audience-aligned, which is a different game. Anordain ships 200 watches a year and earns the right margin per unit. Leica ships M-series in the low thousands per year. The Stax Editions financial model is built on the same posture, not on competing with Sonos at Sonos's volume.
IX. The forward-pointer — what this means for the next ten years
The structural causes of the touchscreen-on-everything monoculture are persistent: industrial-design tooling economics, OTA-flexibility incentives, platform-framing preferences from the equity market. None of those have weakened in 2026. They will probably weaken slowly, asymmetrically, and category by category, rather than uniformly.
Four predictions for the 2026–2030 window. Each carries a capability-graded confidence label on the same scale as the Mercantile Thesis essay4: likely = better-than-even with concrete evidence; likely-to-near-certain = 70–90% with narrow named failure scenarios; uncertain but plausible = 30–50%, can't rule out the opposite.
It is likely-to-near-certain that at least one major automotive OEM will market physical climate controls as a competitive feature within the 2026–2028 window. Euro NCAP's 2026 rule18 makes the regulatory direction concrete; the next step is competitive marketing differentiation. Volkswagen has already announced (mid-2024) that the ID. line will return to physical buttons for primary controls28; the public acknowledgment by a major OEM that touchscreens-only was a mistake is a leading indicator that the broader industry will follow. Falsification: if by 2028-12-31 no major automaker has marketed physical-control return as a positive feature, the prediction missed.
It is likely that the smart-home product category fragments into two architectural postures: one segment serving the "single-app-controls-everything" mass market (Amazon Alexa, Google Home, Apple HomeKit), one segment serving the "every-room-has-its-knob" tactile-control audience (Stax Editions, Anordain-adjacent newcomers, indie hardware drop-houses). The two segments will sell to different customers at different price points. Falsification: if by 2030 the tactile-control segment does not exist as a distinct, named commercial category with multiple shipping products and identifiable customer base, the prediction missed.
It is likely that Sonos either acquires its way out of the appliance-layer problem (purchases a tactile-controls company or a non-cloud-mesh-protocol vendor) or is acquired by a larger consumer-electronics company that supplies the appliance layer (most plausibly Apple, possibly Amazon). The Mercantile lens predicts that substrate-excellent companies in the spread-scalper position get either repriced or absorbed; Sonos's substrate is too good for it to disappear, but its position is too weak for it to continue at current margins. Falsification: if Sonos is still independently operating, with stable or growing margins, with the S2-app paradigm unchanged, by 2029-12-31, the prediction missed.
It is uncertain but plausible that a new drop-house cluster — call it the Stax cluster or independently — emerges within 2026–2030, selling tactile-control-first consumer hardware at premium prices to the audience that no longer trusts the touchscreen-first incumbents. The financial model — four-component capsules at $200–$1,200, small runs, direct-to-customer, editorial-and-audio companions to justify the premium — is replicable. Whether multiple builders independently arrive at the same posture is the open question. The prediction is uncertain but plausible rather than likely because the posture requires both an editorial discipline and an engineering discipline that few teams have at once. Falsification: if by 2030 the Stax cluster is the only example of the four-component-capsule drop-house format, the prediction missed.
Dating the predictions removes the rhetorical comfort of structural-not-tactical hedging. The structural argument is correct or it is not. The predictions are the form in which it can be falsified.
X. Honest limitations
This essay is not the whole argument. Five limitations the essay does not pretend to have resolved.
1. The evidence base on appliance-layer failure rates is uneven. The automotive case is well-cited (Euro NCAP rule, Vi Bilägare benchmark, NHTSA guidelines). The white-goods case is weaker — the failure-rate evidence is from repair-industry trade press and aggregated user reports, not from a controlled study. The audio case rests heavily on published journalism (The Verge, Wired, Washington Post) and the Sonos CEO transition; I have not run a usability study. The argument is strongest where the evidence is strongest, weakest where the evidence is thinnest. A future revision should commission a usability benchmark of common audio-control workflows across the Sonos S2 app, the legacy Sonos Controller, a Bluesound system, and the Phonograph capsule. The benchmark does not exist as of this writing.
2. The premium-tactile-controls market is small. Even granting the structural argument, the addressable market for $280 BLE remotes is bounded. The Phonograph at 100-unit runs is a profitable small business; it is not Sonos at 13-million-speakers-per-year scale. The Mercantile lens predicts margin-per-customer, not volume. A reader interested in scaled-mass-market consumer electronics will find the argument less useful than a reader interested in premium niche categories. The honest framing is that the argument is sharpest at the high end of the consumer hardware market and weaker in the mass-market middle.
3. The substrate alternatives to Sonos are not fully proven. I cite Bluesound, KEF LSX, and the Stax NATS substrate as alternatives to the Sonos S2 paradigm. Each has its own limitations. Bluesound's mesh is smaller-scale than Sonos's and the third-party music-service integrations are narrower. KEF LSX is a stereo-pair product, not a multi-room system. The Stax NATS substrate is, as of May 2026, a working prototype rather than a fully-shipped consumer product. The argument that "tactile control plus a non-cloud-locked substrate is buildable" is true; the argument that "the substrate is already there off-the-shelf for the mass-market customer" is overstated. The Phonograph capsule includes the substrate work as part of the software component, which closes that gap for Edition I customers; it does not close the gap for the broader market.
4. The four-component-capsule format has not been proven across multiple Editions yet. The Phonograph is Edition I. The financial model that justifies a $280 premium for a tactile-control object — the bundle of essay, software, and audio that accompanies the object — is a thesis as of this essay's publication date (2026-05-15). The thesis will be tested by the Phonograph's actual reception. Editions II (the Almanac) and III (the Lineage Album) are committed but not yet shipped3. A reader sizing the financial argument should treat the four-component capsule as a hypothesis whose evidence is forthcoming, not as a proven category.
5. The Mercantile Thesis lens is structural, not predictive of timing. The lens predicts that substrate-excellent-appliance-layer-rented companies get repriced or absorbed eventually. It does not predict the quarter in which that happens. Sonos may continue at current margins for years before the structural pressure resolves. The Mercantile lens is the right frame for ten-to-twenty-year capital allocation, not for next-quarter trading. A reader sizing a short-term bet on the touchscreen-on-everything monoculture should use a different frame.
XI. The next essay
This is the editorial component of Stax Edition I. The other three components ship alongside it on the 2026 Q2 drop window:
- The object — the Phonograph itself — ships with full hardware schematics published, AGPL firmware, and SBOM-grade supply-chain disclosure. Each unit is etched-serial-numbered. Run of 100.
- The software — the AGPL companion macOS/iOS client plus the ESP32-C6 firmware — is published to the Stax
/labGitHub repository before drop day. It includes the NATS substrate, the Sonos UPnP shim, the AirPlay client, and the Spotify Connect bridge. - The audio — "The Phonograph as Object Lesson", a 60-minute Lineage Mode program tracing the history of single-purpose audio remotes from telegraph keys to studio fader handles to the Sonos volume knob — ships to
/canon/audio/before drop day.
The four components together constitute the Edition. The capsule is the unit; the object alone is a remote control, the essay alone is a critique, the software alone is a substrate, the audio alone is a podcast. Together they are the thesis.
The next essay in this arc is Stax Edition II's editorial companion, "The Daily-Page as Computation", which extends the appliance-layer doctrine from electronics to printed objects. The argument there: the printed wall-calendar, treated as an ambient-information surface, is the appliance layer that smart-display devices have failed to occupy. The Almanac is Stax Edition II, releasing 2026 Q3.
If this argument lands, the merchant lens has a third surface to work on after AI and audio: the entire category of consumer hardware whose substrate engineering has outrun its appliance layer. The list is not short. The work is not done.
Footnotes
This essay is the editorial component of Stax Edition I — the Phonograph. The other three components of the capsule (object, software, audio) ship to the Stax direct surface on the 2026 Q2 drop window.
Cross-references:
- Mercantile Thesis (the appliance-layer doctrine this essay applies to consumer hardware): mercantile-thesis
- Stax Editions drop-house charter (the four-component capsule format): /canon/methods/stax-editions-drop-house-charter
- Phonograph object specification: /objects/edition-i-phonograph
- Companion audio program: /canon/audio/phonograph-as-object-lesson
- Patrick Spence's resignation as CEO of Sonos was announced January 13, 2025. See The Verge, "Sonos CEO Patrick Spence steps down after app catastrophe," January 13, 2025, and the Sonos investor-relations press release of the same date. The board appointed Tom Conrad, a Sonos board member, as interim CEO; Conrad was confirmed as permanent CEO in subsequent months. ↩
- Sonos's hardware support cadence is documented at the company's support pages and in The Verge's coverage of the 2020 "S2 platform" transition, where Sonos extended software updates for "Sonos Connect" generation hardware originally released 2006–2011 through May 2020 — roughly nine to fourteen years of post-purchase support per device. The 2020 transition itself was controversial because devices that did not meet S2 requirements were placed on the legacy "S1" software branch; the controversy is a separate episode from the May 2024 app rewrite. ↩
- The Stax Editions drop-house charter is published at
/canon/methods/stax-editions-drop-house-charter(in the canon-on-stax.dev surface) and at~/codex/methods/stax-editions-drop-house-charter.mdin the canonical Codex tree. The charter establishes the four-component capsule format (object + essay + software + audio), the quarterly cadence, the numbering convention (Roman numerals, never reused), the pricing discipline (cost-plus, no scarcity-as-marketing), and the negative-space rules (no Web3, no fast-fashion graphics, no founder-edition tiers, no celebrity collaborations in year one). Amendment 0 (2026-05-14) is the initial charter; subsequent amendments will be numbered. ↩ - The Mercantile Thesis is published at
/posts/mercantile-thesisand applies the Quantitative Mercantilism framework to the AI market: foundation models as the utility layer, sovereign deployment as the appliance layer, and the merchant-principle audit as the rubric for distinguishing durable bottleneck-ownership from spread-scalping. The appliance-layer doctrine the present essay extends to consumer hardware is the same doctrine the Mercantile Thesis applies to AI. The five-question merchant-principle audit (flow, bottleneck, who-owns-it, who-taxes-the-spread, durable position) is the methodological backbone. ↩ - The Verge, "Sonos's new app is missing key features and Sonos owners are mad," May 2024 — the initial reporting on the post-launch regressions. The article documents the specific feature losses (local library, queue management, alarms) and the customer-service backlog that built across the first weeks after launch. ↩
- Wired, "Sonos's terrible app update is a cautionary tale for the smart home," July 2024 — the extended retrospective documenting the local-library regression, the accessibility-feature loss, and the company's evolving response across May–July 2024. ↩
- The Verge, "Sonos finally restored the alarms feature," July 2024 — the specific report on the alarms-feature restoration timeline, including the user-guidance to roll back to the legacy Controller app during the interim. ↩
- Coverage of the accessibility regression appeared in AppleVis (the blind/low-vision technology community site) in May–June 2024, and was acknowledged in Spence's August 2024 apology. The specific features affected were VoiceOver compatibility with the queue and now-playing views, and the loss of larger-touch-target accessibility modes. ↩
- Patrick Spence's public apology was published as a Sonos community-blog post on August 6, 2024, titled "An update from our CEO." The post acknowledged the rewrite's failures, named specific regressions, and committed to a recovery timeline. Spence's resignation followed approximately five months later. ↩
- Tom Conrad was appointed interim CEO of Sonos on January 13, 2025, the same day Spence's resignation was announced. Conrad was a Sonos board member at the time of appointment and had previously been a co-founder of Pandora. He was confirmed as permanent CEO in mid-2025. ↩
- Washington Post, technology section, "Sonos broke its own app. Now it's trying to fix everything," September 2024 — the extended retrospective from the Post's tech-policy desk, documenting the customer revolt across the May–September 2024 window. ↩
- The mandatory Sonos account requirement for local speaker control is documented in Sonos's own support pages ("Your Sonos account is required to set up and use Sonos") and is a consistent point of complaint in the Sonos community forum and r/Sonos. The architectural decision predates the 2024 app rewrite and was unchanged by it; it is a foundational property of the S2 platform. ↩
- Bluesound's BluOS platform supports local-network control of multi-room audio without mandatory cloud authentication. The architectural difference is documented in Bluesound's product literature and in third-party comparisons such as What Hi-Fi? "Bluesound vs Sonos: which is the better multi-room system?" (multiple revisions, 2022–2024). The point of this footnote is not to advocate for Bluesound but to establish that the architectural property "local multi-room control without mandatory cloud auth" is buildable; Sonos's choice is a design decision, not a technical necessity. ↩
- The latency timings I describe ("4–7 seconds on cold launch", "2 seconds on the legacy Controller app") are my own informal observations across roughly two weeks of personal use of the Sonos S2 app on an iPhone 15 Pro in May–June 2024. They are not a controlled benchmark and should not be cited as systematic evidence. The qualitative direction (the post-rewrite app is noticeably slower than the pre-rewrite Controller) is consistent with reports across the Sonos community forum and The Verge's coverage, but the specific numbers are mine and are unaudited. ↩
- The tap-count comparisons (pre-rewrite 2–3 taps vs. post-rewrite 4–6 taps for speaker grouping) are aggregated from r/Sonos threads and the Sonos community forum across May–July 2024. They are not a randomized usability study. The structural pattern — that common operations regressed in tap count — is consistent across multiple user reports but is not formally measured. ↩
- The Sonos Controller CR100 was discontinued circa 2012 and software-deprecated in 2018. The discontinuation announcement and the community response are documented in the Sonos forum archives and in The Verge's 2018 coverage of the "bricking" controversy (Sonos's term for the deprecation, which the customer base interpreted as bricking working hardware). The CR100's design — a screen plus a click-wheel plus dedicated transport buttons — was the closest Sonos product to what the Phonograph capsule aims at, and its discontinuation marks the company's architectural commitment to phone-app-as-only-controller. ↩
- For the broader automotive-touchscreen-adoption pattern across 2017–2024, see Car and Driver, "How the touchscreen took over the dashboard," 2022, and the MotorTrend 2023 retrospective on touchscreen-first interior design. Tesla's Model 3 (2017) and Model Y (2020) are the most-cited inflection-point vehicles; Volvo, Volkswagen ID.4, Ford Mustang Mach-E, and several Hyundai/Kia models followed across the next four years. ↩
- Euro NCAP, "Rating system update for 2026," March 2024 — the announcement that physical controls for five specific functions (direction indicators, hazard warning, wipers, horn, e-call) would be required for five-star ratings starting in 2026. The announcement explicitly cites driver-distraction research as the basis for the rule. ↩
- Vi Bilägare (Swedish: "We Car Owners" — Sweden's largest motoring magazine), "Best in test: the car with the fastest infotainment," August 2022. The closed-track test compared 11 modern vehicles (2020–2022 model years) against a 2005 Volvo V70, timing a four-task workflow at 110 km/h. The 2005 Volvo completed the four tasks in 10 seconds; the slowest modern vehicle (MG Marvel R) took 44.6 seconds. The full results are published in the original article and have been widely re-cited. ↩
- U.S. National Highway Traffic Safety Administration, Visual-Manual NHTSA Driver Distraction Guidelines for In-Vehicle Electronic Devices, April 2013 (originally proposed February 2012). The guidelines specify a 2-second-per-glance and 12-second-total-task ceiling for visual-manual interfaces. The guidelines are non-binding but are widely treated as the U.S. baseline for in-vehicle interface design. ↩
- iFixit's repair guides and community discussions document the higher failure rates and harder-to-repair nature of capacitive-touch appliance panels relative to mechanical-knob predecessors. The repair-industry trade press (notably Appliance Service News) has covered the pattern across 2020–2024. The evidence is qualitative and aggregated rather than from a single controlled study; I cite it with that caveat. ↩
- The Apple TV Siri Remote's design history is documented in Apple's own product-page archives and in The Verge's coverage of each generation. The 2015 redesign removed dedicated transport buttons in favor of a touch surface; the 2021 redesign partially restored the directional pad. Don Norman's writings on remote-control design — The Design of Everyday Things (1988, revised 2013) — provide the human-factors background. ↩
- The human-factors literature on tactile-feedback advantages in safety-critical control tasks dates to Alphonse Chapanis's work on aircraft cockpit design in the 1940s and 1950s. The canonical reference is Chapanis, Research Techniques in Human Engineering (Johns Hopkins Press, 1959). The principle that operators performing safety-critical tasks while attending to a primary workload (flying, driving) benefit from tactile-distinguishable controls over visually-identical controls is one of the foundational findings of the discipline. ↩
- Leica's M-series design philosophy is documented in the company's product literature and in retrospectives such as Popular Photography's history of the M-mount system. The M11 (2022) maintains the manual-focus, physical-aperture-ring, physical-shutter-dial architecture of every M-series body since the M3 in 1954. Annual production volumes are not publicly disclosed at fine granularity, but multiple sources (Leica's annual reports, industry-trade-press estimates) place M-series sales in the thousands-per-year range across the 2010s and 2020s. ↩
- Anordain (Glasgow, Scotland) is a small independent watchmaker founded in 2015. Their published production cadence is approximately 200 watches per year, with longer waitlists than the production cycle. The Model 1 retails in the £1,800–£3,000 range depending on dial configuration. The company's public commitments are documented at anordain.com and in Hodinkee's coverage of the brand across 2018–2024. ↩
- The TiVo "Peanut" remote (1999) was designed by Paul Newby and Steve Perlman's team at TiVo. The design is discussed in IDEO's design retrospectives, in Don Norman's later writings, and in The Atlantic's "How the TiVo remote conquered the living room" (2018, retrospective). The remote was discontinued when TiVo's business pivoted away from standalone DVR hardware. ↩
- The original Nest Learning Thermostat (2011) was designed by a team led by Tony Fadell, previously of Apple's iPod division. The design — a single rotating ring, a press-to-confirm interaction, a touchscreen front used strictly as a display — is documented in Fadell's own Build (2022), in Wired's 2011 launch coverage, and in subsequent retrospectives across 2012–2020. Google's 2014 acquisition for $3.2 billion and the subsequent dilution of the ring-as-primary paradigm are documented in The Verge's ongoing coverage of the Nest product line. ↩