Case Study · 03

Designing continuous music experiences for Alexa

How rapid experimentation evolved into a cross-device product vision and shipped experiences.

Amazon Alexa·2018–2022·UX Designer II·Consumer AI · Voice · Multimodal
Public Amazon product imagery showing an Echo Show displaying an Alexa music experience.
Public Amazon product imagery
Overview

Experiment · Envision · Ship

Music was one of Alexa's most widely used capabilities, but usage was often narrower than the product's possibilities. Customers did not always know what they could ask, how to discover new functionality, or how to continue listening as their location, device, or account context changed.

I worked across a progression of Alexa Music initiatives: first building a high-velocity experimentation practice, then translating those learnings into a longer-term product vision, and finally designing experiences that made music more personal and continuous across shared and connected devices.

Stage 1 · Experiment

Learn quickly

A high-velocity practice that combined customer evidence, focused hypotheses, voice and visual prototypes, limited releases, and iterative evaluation.

Stage 2 · Envision

Look beyond features

Repeated signals about discovery, context, and continuity opened a broader question: when could music become useful or meaningful throughout a customer's day?

Stage 3 · Ship

Design for continuity

The work expanded into experiences that let customers access personal content on shared devices and move active media between compatible devices.

Challenge 01

Discoverability

Customers often used the most familiar voice request while remaining unaware of broader music capabilities.

Challenge 02

Engagement

Voice products have no persistent visual navigation. A feature can exist without customers knowing what to ask for.

Challenge 03

Shared devices

Echo devices live in communal spaces, friends' homes, and temporary locations. The owner's identity is not always the listener's.

Challenge 04

Cross-device continuity

Music may begin on one speaker, continue in another room, or move from a group to a single device. Customers need a clear model of what is playing and where.

Learning through experimentation

A learning system, not a collection of tests.

The experimentation program was created to explore customer problems through smaller, faster product changes rather than relying only on large annual releases. I supported the work end to end: combining customer evidence, defining experience problems, designing voice and visual interactions, partnering through implementation, and evaluating what should be expanded, revised, or discontinued.

Experiment Loop
01
Observe

Customer feedback, support themes, interaction failures, behavioral patterns, prior research.

02
Frame

Convert engagement concerns into a focused customer problem and a testable hypothesis.

03
Design

Voice, visual UI, mobile configuration, smart-display interaction, or a coordinated multimodal flow.

04
Release carefully

Introduce to a limited eligible audience with appropriate safeguards.

05
Evaluate

Review aggregate behavioral and qualitative evidence.

06
Expand, revise, or stop

Discontinuing a weak idea is a valid result.

Sanitized representation of the experimentation process. Internal measurement and business results are omitted.

A representative experiment

One recurring question was whether Alexa could introduce a relevant music capability during another natural interaction, rather than waiting for the customer to know the exact request. I designed contextual voice concepts, defined where a suggestion would and would not be appropriate, and worked with partners on safeguards around timing, repetition, device state, and control.

  1. Step 1 · Customer

    Completes a common Alexa interaction.

  2. Step 2 · Alexa

    Provides the expected response.

  3. Step 3 · Alexa

    When appropriate, offers one concise, relevant music suggestion.

Portfolio-native abstraction. No internal prompts, experiment percentages, or business results are reproduced.

Relevance before promotion

A suggestion should relate to the customer's immediate context rather than feel like advertising.

Restraint is part of the interaction

The experience must account for repetition, interruption, device state, and situations where silence is better.

Voice and visual must agree

Customers should not encounter capabilities they can start in one modality but cannot understand or manage in another.

Future-of-music direction

Turning experiment learning into a product vision.

Individual experiments answered narrow questions, but together they revealed a broader opportunity: music could feel more continuous and context-aware without becoming intrusive. I organized and co-facilitated a five-day cross-functional vision sprint to explore that opportunity, align multiple Alexa teams, create testable concepts, and gather customer feedback. My role spanned framing, participant alignment, leadership review, co-facilitation, research partnership, scenario writing, and socializing the direction across the organization.

Five-Day Vision Sprint
Day 1
Map

Align around customer moments, constraints, risks, and opportunities.

Day 2
Sketch

Generate multiple product directions independently before converging.

Day 3
Decide

Select concepts according to customer value, clarity, and testability.

Day 4
Prototype

Create coordinated voice and visual scenarios at sufficient fidelity for feedback.

Day 5
Learn

Use research to understand expectations, control, trust, and comprehension.

Sanitized sprint structure. Specific participants, research sessions, and internal artifacts are omitted.

Context should be understandable

Customers should understand why Alexa is making a suggestion or continuing an experience.

Proactivity requires control

A system may reduce effort without removing the customer's ability to accept, decline, edit, or stop.

Continuity should feel intentional

Moving between devices should preserve what is playing, where it is playing, and what will happen next.

Making invisible experiences tangible

A storyboard or video could explain what a screen could not.

Many of the most ambitious Alexa Music concepts did not begin with a screen. Guest Connect depended on Alexa recognizing and applying the right identity on a shared device. Media-portability concepts involved music continuing or moving as a customer changed rooms or devices. Future-of-music explorations focused on contextual behavior rather than a conventional graphical interface. That made the work difficult to communicate through static wireframes alone.

Guest Connect in particular required two cross-functional product areas to agree on an experience involving identity, permissions, shared devices, and personal content. The written proposal contained the answers, but the experience remained too abstract to reconstruct mentally. Leadership review surfaced repeated questions the document technically addressed.

I proposed making the experience visible as a short scenario video. I wrote the script, coordinated teammates as participants, arranged access to a simulated home environment, filmed the end-to-end experience, and edited the final narrative. In the next review, leaders could understand the experience as a whole. It created the alignment the written document had not achieved and helped the project move forward.

The video was not marketing added after the design. It was a functional design artifact used to validate the end-to-end model and enable a product decision.

Reconstructed ambient-experience storyboard
Frame 1
Enter shared environment

The customer arrives in a space with a device they do not own.

Frame 2
Establish context

Alexa recognizes the relevant identity and conditions.

Frame 3
Personal content available

With permission, the customer's content becomes accessible.

Frame 4
Active state is clear

The customer understands which identity or device state is active.

Frame 5
End or transfer cleanly

The experience concludes or moves with no ambiguity left behind.

Illustrative storyboard reconstructed for this portfolio.

Alexa Guest Connect

Your music on a device you do not own.

Amazon publicly introduced Guest Connect as a way for customers, with permission, to access their Alexa account and personal music or news from compatible Echo devices they do not own.

I designed account-connection and personalization flows that helped a guest establish identity, receive permission, access eligible personal content, and leave the host device without ambiguity about whose account was active. The scenario-video work described in the previous chapter unblocked the cross-team alignment this experience depended on.

Amazon publicly introducing Alexa Guest Connect at an Amazon event in September 2019.
Public Amazon product imagery
Guest Connect account journey
01
Host device

An Echo belonging to someone else.

02
Guest requests access

The guest asks to connect their account.

03
Mutual confirmation

Both sides approve the connection.

04
Guest content available

Personal music or news plays on the host's device.

05
Guest disconnects

The host's account returns to active state.

Simplified public explanation of the Guest Connect account journey. Authentication implementation details are omitted.

Media portability

Your music as you move between devices.

A listening experience can span several devices and rooms. Customers needed to understand what was playing, choose an appropriate destination, and move or extend playback without reconstructing the session. I led experience design for short- and medium-term media-portability concepts across voice and multimodal surfaces, defining the information architecture, interaction flows, visual patterns, edge states, and the relationship between active streams and available devices.

Core interaction model
01
Understand current state

Show which content is active and where it is playing.

02
Choose a destination

Identify a compatible device or group through voice or touch.

03
Resolve conflicts

Explain what will happen if the destination is already active or unavailable.

04
Confirm the result

Make the new playback location clear without forcing the customer to infer state.

Portfolio-native flow. Internal wireframes and unshipped grouping concepts are not published.

Voice

Direct intent

Best when the customer already knows where the music should go.

Touch

Inspect and resolve

Best for inspecting active playback, comparing destinations, and resolving more complicated household states.

Voice and touch were not parallel versions of the same interface. Each modality handled a different part of the decision.

Principles, outcomes, and reflection

What the progression established.

Proactivity is an interaction contract

A proactive assistant must be contextual, understandable, restrained, and easy to stop.

Multimodal design is division of labor

Voice, touch, screens, and mobile confirmation should each handle the part of the task they are best suited to.

Match the artifact to the invisibility of the experience

For ambient and multimodal systems, scenarios, comics, videos, and state models can communicate the experience more accurately than a polished mockup.

Public product outcomes
  • Alexa Guest Connect was publicly introduced as a way to access personal content on supported Echo devices with permission.
  • Alexa supports moving audio between compatible Echo devices and groups.
  • The broader product direction connected identity, personalization, and cross-device continuity.
My design contribution
  • Established an end-to-end experimentation practice that integrated customer evidence, voice design, visual design, implementation, and iteration.
  • Organized and co-facilitated a cross-functional vision sprint that turned experiment learning into a longer-term product direction.
  • Designed personalization and account-state experiences for Alexa Guest Connect.
  • Led information architecture and interaction design for media-portability experiences across voice, touch, smart speakers, smart displays, and the Alexa app.
  • Helped shift the product question from isolated music features toward continuous listening across context and devices.

This work taught me to design across distributed systems where identity, context, modality, and platform boundaries shape the customer experience. I later carried those lessons into enterprise AI — first by designing Workday inside Microsoft Copilot, and then by founding the product design for Workday Agent System of Record.

In a live walkthrough, I can discuss additional iteration and collaboration detail while preserving confidential Amazon product information.

Confidentiality note

This case study uses public Amazon product information, official public imagery, and process diagrams recreated specifically for this portfolio. Internal metrics, customer quotations, proprietary tools, raw research materials, unreleased concepts, and confidential implementation details have been omitted.

Public sources

Where the public information comes from.