Designing continuous music experiences for Alexa
How rapid experimentation evolved into a cross-device product vision and shipped experiences.

Experiment · Envision · Ship
Music was one of Alexa's most widely used capabilities, but usage was often narrower than the product's possibilities. Customers did not always know what they could ask, how to discover new functionality, or how to continue listening as their location, device, or account context changed.
I worked across a progression of Alexa Music initiatives: first building a high-velocity experimentation practice, then translating those learnings into a longer-term product vision, and finally designing experiences that made music more personal and continuous across shared and connected devices.
Learn quickly
A high-velocity practice that combined customer evidence, focused hypotheses, voice and visual prototypes, limited releases, and iterative evaluation.
Look beyond features
Repeated signals about discovery, context, and continuity opened a broader question: when could music become useful or meaningful throughout a customer's day?
Design for continuity
The work expanded into experiences that let customers access personal content on shared devices and move active media between compatible devices.
Discoverability
Customers often used the most familiar voice request while remaining unaware of broader music capabilities.
Engagement
Voice products have no persistent visual navigation. A feature can exist without customers knowing what to ask for.
Shared devices
Echo devices live in communal spaces, friends' homes, and temporary locations. The owner's identity is not always the listener's.
Cross-device continuity
Music may begin on one speaker, continue in another room, or move from a group to a single device. Customers need a clear model of what is playing and where.
A learning system, not a collection of tests.
The experimentation program was created to explore customer problems through smaller, faster product changes rather than relying only on large annual releases. I supported the work end to end: combining customer evidence, defining experience problems, designing voice and visual interactions, partnering through implementation, and evaluating what should be expanded, revised, or discontinued.
Customer feedback, support themes, interaction failures, behavioral patterns, prior research.
Convert engagement concerns into a focused customer problem and a testable hypothesis.
Voice, visual UI, mobile configuration, smart-display interaction, or a coordinated multimodal flow.
Introduce to a limited eligible audience with appropriate safeguards.
Review aggregate behavioral and qualitative evidence.
Discontinuing a weak idea is a valid result.
Sanitized representation of the experimentation process. Internal measurement and business results are omitted.
A representative experiment
One recurring question was whether Alexa could introduce a relevant music capability during another natural interaction, rather than waiting for the customer to know the exact request. I designed contextual voice concepts, defined where a suggestion would and would not be appropriate, and worked with partners on safeguards around timing, repetition, device state, and control.
- Step 1 · Customer
Completes a common Alexa interaction.
- Step 2 · Alexa
Provides the expected response.
- Step 3 · Alexa
When appropriate, offers one concise, relevant music suggestion.
Portfolio-native abstraction. No internal prompts, experiment percentages, or business results are reproduced.
Relevance before promotion
A suggestion should relate to the customer's immediate context rather than feel like advertising.
Restraint is part of the interaction
The experience must account for repetition, interruption, device state, and situations where silence is better.
Voice and visual must agree
Customers should not encounter capabilities they can start in one modality but cannot understand or manage in another.
Turning experiment learning into a product vision.
Individual experiments answered narrow questions, but together they revealed a broader opportunity: music could feel more continuous and context-aware without becoming intrusive. I organized and co-facilitated a five-day cross-functional vision sprint to explore that opportunity, align multiple Alexa teams, create testable concepts, and gather customer feedback. My role spanned framing, participant alignment, leadership review, co-facilitation, research partnership, scenario writing, and socializing the direction across the organization.
Align around customer moments, constraints, risks, and opportunities.
Generate multiple product directions independently before converging.
Select concepts according to customer value, clarity, and testability.
Create coordinated voice and visual scenarios at sufficient fidelity for feedback.
Use research to understand expectations, control, trust, and comprehension.
Sanitized sprint structure. Specific participants, research sessions, and internal artifacts are omitted.
Context should be understandable
Customers should understand why Alexa is making a suggestion or continuing an experience.
Proactivity requires control
A system may reduce effort without removing the customer's ability to accept, decline, edit, or stop.
Continuity should feel intentional
Moving between devices should preserve what is playing, where it is playing, and what will happen next.
A storyboard or video could explain what a screen could not.
Many of the most ambitious Alexa Music concepts did not begin with a screen. Guest Connect depended on Alexa recognizing and applying the right identity on a shared device. Media-portability concepts involved music continuing or moving as a customer changed rooms or devices. Future-of-music explorations focused on contextual behavior rather than a conventional graphical interface. That made the work difficult to communicate through static wireframes alone.
Guest Connect in particular required two cross-functional product areas to agree on an experience involving identity, permissions, shared devices, and personal content. The written proposal contained the answers, but the experience remained too abstract to reconstruct mentally. Leadership review surfaced repeated questions the document technically addressed.
I proposed making the experience visible as a short scenario video. I wrote the script, coordinated teammates as participants, arranged access to a simulated home environment, filmed the end-to-end experience, and edited the final narrative. In the next review, leaders could understand the experience as a whole. It created the alignment the written document had not achieved and helped the project move forward.
The video was not marketing added after the design. It was a functional design artifact used to validate the end-to-end model and enable a product decision.
The customer arrives in a space with a device they do not own.
Alexa recognizes the relevant identity and conditions.
With permission, the customer's content becomes accessible.
The customer understands which identity or device state is active.
The experience concludes or moves with no ambiguity left behind.
Illustrative storyboard reconstructed for this portfolio.
Your music on a device you do not own.
Amazon publicly introduced Guest Connect as a way for customers, with permission, to access their Alexa account and personal music or news from compatible Echo devices they do not own.
I designed account-connection and personalization flows that helped a guest establish identity, receive permission, access eligible personal content, and leave the host device without ambiguity about whose account was active. The scenario-video work described in the previous chapter unblocked the cross-team alignment this experience depended on.

An Echo belonging to someone else.
The guest asks to connect their account.
Both sides approve the connection.
Personal music or news plays on the host's device.
The host's account returns to active state.
Simplified public explanation of the Guest Connect account journey. Authentication implementation details are omitted.
Your music as you move between devices.
A listening experience can span several devices and rooms. Customers needed to understand what was playing, choose an appropriate destination, and move or extend playback without reconstructing the session. I led experience design for short- and medium-term media-portability concepts across voice and multimodal surfaces, defining the information architecture, interaction flows, visual patterns, edge states, and the relationship between active streams and available devices.
Show which content is active and where it is playing.
Identify a compatible device or group through voice or touch.
Explain what will happen if the destination is already active or unavailable.
Make the new playback location clear without forcing the customer to infer state.
Portfolio-native flow. Internal wireframes and unshipped grouping concepts are not published.
Direct intent
Best when the customer already knows where the music should go.
Inspect and resolve
Best for inspecting active playback, comparing destinations, and resolving more complicated household states.
Voice and touch were not parallel versions of the same interface. Each modality handled a different part of the decision.
What the progression established.
Proactivity is an interaction contract
A proactive assistant must be contextual, understandable, restrained, and easy to stop.
Multimodal design is division of labor
Voice, touch, screens, and mobile confirmation should each handle the part of the task they are best suited to.
Match the artifact to the invisibility of the experience
For ambient and multimodal systems, scenarios, comics, videos, and state models can communicate the experience more accurately than a polished mockup.
- Alexa Guest Connect was publicly introduced as a way to access personal content on supported Echo devices with permission.
- Alexa supports moving audio between compatible Echo devices and groups.
- The broader product direction connected identity, personalization, and cross-device continuity.
- Established an end-to-end experimentation practice that integrated customer evidence, voice design, visual design, implementation, and iteration.
- Organized and co-facilitated a cross-functional vision sprint that turned experiment learning into a longer-term product direction.
- Designed personalization and account-state experiences for Alexa Guest Connect.
- Led information architecture and interaction design for media-portability experiences across voice, touch, smart speakers, smart displays, and the Alexa app.
- Helped shift the product question from isolated music features toward continuous listening across context and devices.
This work taught me to design across distributed systems where identity, context, modality, and platform boundaries shape the customer experience. I later carried those lessons into enterprise AI — first by designing Workday inside Microsoft Copilot, and then by founding the product design for Workday Agent System of Record.
In a live walkthrough, I can discuss additional iteration and collaboration detail while preserving confidential Amazon product information.
This case study uses public Amazon product information, official public imagery, and process diagrams recreated specifically for this portfolio. Internal metrics, customer quotations, proprietary tools, raw research materials, unreleased concepts, and confidential implementation details have been omitted.