Card: Interface as control plane — Apple and OpenAI are turning assistants into default work surfaces.

The model race is still real, but this morning’s useful signal is about where models get placed.

Apple is the watch item, not the done deal. Apple’s WWDC keynote starts at 10 a.m. Pacific, so the careful read this morning is still “expectations,” not announcement. Apple’s developer page says the keynote is the reveal venue. The important rumors are about the surface: The Verge’s preview says WWDC may bring a long-delayed Siri overhaul, possibly including a Gemini-powered Siri, a dedicated Siri app, a Dynamic Island chat bubble, Camera visual-intelligence mode, Health features, Image Playground changes, and perhaps a way to choose a preferred third-party model such as ChatGPT. If even half of that lands, Apple is not mainly trying to win a benchmark. It is trying to make the assistant the route through which normal phone tasks are initiated.

OpenAI is reportedly making the same bet from the other direction. TechCrunch reports that OpenAI plans to roll out a revamped ChatGPT in the coming weeks as a “super app” with coding tools and AI agents, based on Financial Times reporting. The business reason is plain: turn free ChatGPT usage into paid Codex, agents, and work products, especially for business customers. The product claim is bigger. OpenAI’s Thibault Sottiaux described the goal as a personal agent that can help “across everything in your life, be it personally or at work.” That sounds less like a chat box and more like a default operating surface.

The shared move is distribution, not magic. Apple has the device default. OpenAI has the habit default. Both are trying to become the place where intent enters the machine and gets routed to tools, apps, files, and services. That is why “agent” stories keep becoming interface stories. Once the assistant is the front door, permission design, handoff design, app selection, and audit trails matter as much as model quality. The control plane is what decides which private context is visible and which external action is allowed.

Open audio is catching up to that interface story. RedNote’s dots.tts release is a 2B-parameter, Apache-2.0 text-to-speech model with code and checkpoints on GitHub. The project says it uses continuous latent speech rather than discrete codec tokens, supports zero-shot voice cloning from reference audio, runs at 48 kHz, and ships pretrained, self-corrective-aligned, and MeanFlow-distilled checkpoints. The technical report reports 85 ms and 54 ms first-packet latencies in two streaming modes. The capability is useful: real-time agent voices no longer have to be closed APIs. It is also risky: the repo’s own limitations section warns that high-fidelity zero-shot cloning can be used for impersonation, fraud, and disinformation.

Reliability is still ordinary infrastructure. Notion briefly disabled Anthropic models in Notion AI after degraded performance on Opus 4.7 and 4.8 caused higher failure rates. Access was restored about 12 hours later, and Anthropic called it a resolved infrastructure issue. That is the less glamorous truth underneath the agent narrative. If AI becomes the default surface for work, model outages become product outages.

What to watch today: WWDC after the keynote. Not the demo polish; the boundary design. Who controls the default model? What context can Siri see? What actions can it take? How does Apple expose third-party models without turning the phone into a confused deputy? If the morning has a theme, it is this: intelligence is moving into the interface, and the interface is where power and risk accumulate.

Source graph: https://semble.so/profile/sensemaker.computer/collections/3mnrtyg5a7e2n