AI Roleplay

Orum

Overview

I led design for Orum's AI Roleplay feature — a 0→1 product built on a lean three-person team. The feature gives sales reps realistic, on-demand practice conversations so they can build confidence, refine their approach, and get consistent feedback without waiting on a manager's calendar.

My biggest design challenge wasn't the interface. It was making AI interactions feel trustworthy and human to users who'd never practiced with a bot before — some found the idea unsettling at first — while still delivering a meaningful, scalable training experience.

AI design focus: Prompt architecture, AI-generated content visual language, model constraint tradeoffs, trust and adoption patterns for AI-skeptical users.

Strategic rationale and business impact

Team

We were a dedicated experimental team focused on fast-paced exploration, prototyping, and validation of new product concepts, with the goal of pinpointing market fit, de-risking big bets, and uncovering high-impact opportunities for future business growth. The team operated leanly: one product manager, one full-stack engineer, and one designer. That structure let us move fast and iterate with minimal overhead.

I owned end-to-end design: research synthesis, information architecture, interaction design, content strategy, and the visual language system for AI-generated content. I also contributed to scenario prompt architecture alongside the PM and engineering lead, helping define how inputs like call type, objection difficulty, and rep goals shaped what the AI produced.

Documentation

Timeline

Role

Goal

2 months (Design completion)

Design lead

Ship V1 of a brand new feature called AI Roleplays
Test, validate and iterate on scenarios

Research + Discovery

User pain points and needs

To understand the problem space, our team visited several customer on-sites in NYC — observing workflows and speaking directly with reps and managers in their environment.

Managers were relying on Excel sheets to track training and onboarding materials: a lot of manual admin work that provided little visibility into how reps were actually performing or engaging with content.

On AI specifically: Users were generally open to the idea of AI Roleplay, but had real concerns about the experience not feeling human enough. Some found the AI avatar a bit unsettling at first.

Reps told us they often felt unprepared and pressured going into calls. Many wanted more structured practice to refine their voice and methods, especially those working toward AE promotions.

Without a better option, some watched recordings of peer AEs and manually took notes: a workaround that highlighted just how little structured, interactive learning existed.

Managers/team leads

Reps

Based on these insights, we hypothesized that AI Roleplay could offer the kind of realistic, on-demand practice that traditional training couldn't match: faster ramp times, more consistent coaching, and a shift from passive learning to active skill development. The challenge was designing an experience that felt approachable enough for skeptical users while still delivering meaningful practice aligned with real-world needs.

Strategic rationale and business impact

Competitive analysis:
None of our competitors were exploring AI Roleplay within the context of an AI dialing tool, and the broader AI Roleplay space was still relatively niche — which made this a real differentiation opportunity.

For Orum:
Orum had also recently made a significant investment in a broader AI Coaching Suite, positioning the product to deliver a more comprehensive, end-to-end training experience.

Adding AI Roleplay felt like a natural next step: it deepened the product's value and gave the Coaching Suite a practice layer that made the whole suite more compelling.

Exploration + iteration

Understanding user workflows and industry best practices

Before designing anything, the team needed to ground ourselves in real sales funnel processes, trends, and best practices — so the scenarios we built would reflect realistic, high-impact situations for SDRs rather than generic practice content.

Copy considerations

I owned the content strategy for this feature, and copy decisions were as consequential as any interaction decision. We made a deliberate choice to avoid technical terms like "bot" — not just for tone, but because the word itself was a trust barrier for skeptical users. Language throughout the experience emphasized guidance and support rather than replacement.

Avoiding the word "bot" was a strategic call, not a style preference. Users who felt the AI was replacing something human were less likely to engage authentically with the practice. The copy needed to reframe the experience before the UI even loaded.

Clarity, reassurance, and tone were especially important given that some users were unfamiliar with AI tools — or actively skeptical of them.

Prompt architecture for AI-generated scenarios

I worked with the PM and engineering lead to design the prompt structure that powered scenario generation. Rather than leaving this to chance, we defined a repeatable framework:

Identify common call types based on sales funnel stages and typical sales processes
Assign difficulty using topic complexity, likely objections, and anticipated rep questions
Define rep goals aligned with industry standards for handling each scenario type

Each generated scenario included a greeting, title, short summary, specific prompt, rep goals, and avatar assignment.After the initial release, we collected user feedback to understand how scenarios resonated across different teams. Those insights informed iterative improvements — gradually moving from broadly applicable scenarios toward more tailored, team-specific practice.

Design considerations

Page navigation

I explored several approaches to organizing the AI Roleplay landing page so reps could quickly find relevant scenarios. This included grouping by scenario type, role, and difficulty level. User feedback helped determine which structure made the page feel clearest and most intuitive.

Recommended scenarios based on call history and role

Organized by scenario difficulty

Flatter structure with more specific detail surfaced within cards

Scenario cards

Designing the scenario cards was one of the harder interaction problems on this project. The real tension was between discoverability and visual noise — too little information and reps couldn't tell scenarios apart; too much and they'd stop reading the cards entirely.

My priorities were to make the page feel lively and inviting while surfacing what reps cared about most: scenario type, difficulty, AI avatar personality, and goals. I ran rapid exploration cycles testing thumbnail treatments, responsive layouts, text length, badges and chips for quick scanning, and hover states for additional context.

Each iteration helped clarify what was essential versus distracting. The final cards were designed to be visually appealing, quickly scannable, and accessible — with legible text, sufficient contrast, and intuitive interactions across devices.

A more guided experience — and why we moved away from it

I explored a version of the roleplay experience that included a criteria checklist visible on-screen as reps moved through a scenario — essentially scaffolding the conversation in real time.

We ultimately decided against it. The best salespeople don't rely on scripts — they depend on intuition, adaptability, and their own communication style. An on-screen criteria panel risked training a crutch rather than a skill, and would likely become a distraction rather than a help once the conversation got going. The lighter, more streamlined experience better supported natural conversation flow.
Any guidance needed to be introduced thoughtfully — not plastered across the screen mid-call.

Navigate scenarios in a side panel

Criteria checklist in panel

Scenarios broken into chapters so users can retry and skip to relevant sections

Visual language for AI-generated content

One of the more nuanced design problems was figuring out how to visually distinguish AI-generated content from fields that needed manual input. Without clear cues, users weren't sure what to trust, review, or act on.

I explored several approaches: an AI gradient color treatment for auto-filled fields, a star icon as a visual cue within text field labels, and label copy adjustments to explicitly surface when content was AI-generated. Defining these consistent visual cues helped users immediately understand what the AI produced versus what they needed to review or customize — a pattern that became a reusable part of the broader design system.

Customization and scalability

After the initial release, it became clear that reps needed more than standardized scenarios — they wanted to tailor practice to the factors that actually shape their calls: team size, business quarter, market conditions, shifting economic context. The design challenge was balancing flexibility with ease of use. Reps needed to drill down into relevant details without being burdened by a complex setup or too much upfront input. Too many fields and they'd abandon the flow; too few and the output wouldn't feel relevant to their situation.

High level flow:

The user provides a basic prompt describing the business context or scenario they need.

The tool uses Orum's company and user profile data to fill in missing details and generate a tailored output.

The user reviews the generated scenario.

The user confirms and finalizes it.

I explored several UI approaches for the creation flow, each representing a different point on the effort-vs-control spectrum.

Option A (Minimal effort): Users provide a basic prompt and Orum handles the rest — assigning avatars, generating summaries, filling gaps. Lower friction, faster to complete.

Option B (Full control): Users can customize down to avatar selection and avatar personality. More investment upfront, but the output feels more specifically theirs.

Option C (Admin-level): Admins and managers can set custom metrics and criteria that shape how reps' scenarios are evaluated — useful for teams with specific coaching frameworks or performance goals.

The entry point had to work for all three — so rather than defaulting to the most complex version, I designed a single upfront choice: generate automatically or enter manually. That one decision routed users to the right level of involvement before they ever saw a form field.

Constraints with AI tooling

As we rolled out V1, we continuously adapted to evolving user needs and the real constraints of our AI provider — navigating model limitations while also optimizing for cost and performance. Some features we'd initially designed had to be scoped back or redesigned entirely based on what the model could reliably do within budget.This was a meaningful part of the project: designing within constraint rather than ideal conditions, and making explicit tradeoffs between capability, cost, and user experience.

Analytics for managers: A post-launch insight

After launch, a pattern emerged in manager feedback: they could see reps were doing roleplays, but had no visibility into how. They couldn't tell which scenarios were being used, which reps were practicing consistently, or whether the tool was actually driving skill improvement.I scoped and designed an analytics layer that surfaced scenario usage by team and by rep. It was a relatively contained addition, but it reframed what AI Roleplay was — not just a practice tool for individual reps, but a coaching asset managers could actively use. That shift was important for adoption at the team level, not just individual usage.

Quality and feedback loop

We implemented a feedback dialog using Appcues to capture user insights directly within the product — in context, not after the fact.Collecting feedback this way gave us more actionable, real-time data than post-session surveys would have. It let us iterate faster, prioritize improvements based on actual user needs, and keep the product aligned with how SDRs were working day to day

Unifying the platform: Moving AI Roleplay into the Coaching Suite

We implemented a feedback dialog using Appcues to capture user insights directly within the product — in context, not after the fact.Collecting feedback this way gave us more actionable, real-time data than post-session surveys would have. It let us iterate faster, prioritize improvements based on actual user needs, and keep the product aligned with how SDRs were working day to day

Final design

Home

Analytics: Overview

Analytics: History

Create custom AI Roleplay: Add info

Create custom AI Roleplay: Generating state

Impact

Picked up by 150+ customer orgs
Power users averaging 100+ minutes of practice, with steady month-over-month usage across core teams
Became a real piece of Orum's AI Coaching story — a coaching pitch tied to actual time spent practicing, not just a feature being turned on
Teams using Orum's AI-powered stack saw 55–62% increases in connects and meetings booked across large BDR organizations. One customer doubled their meetings booked with fewer reps.

Reflection

This project pushed me in a few directions I hadn't fully anticipated.

Designing for AI isn't just about the interface. I's about understanding what the model can reliably do, what it can't, and how to set user expectations honestly without undermining their confidence in the tool. Navigating those tradeoffs in real time, with real cost constraints, was one of the more genuinely novel parts of the work.
The research insight that stuck with me most was how much the framing mattered before users ever touched the product. Users who came in skeptical of AI didn't need a better feature; they needed a different story about what they were doing. Getting that right in the copy and the onboarding flow was as important as anything in the interaction design.

AI Roleplay

Orum

Overview

Jump to

Team

Documentation

Timeline

Role

Goal

Research + Discovery

User pain points and needs

To understand the problem space, our team visited several customer on-sites in NYC — observing workflows and speaking directly with reps and managers in their environment.

Strategic rationale and business impact

Competitive analysis: None of our competitors were exploring AI Roleplay within the context of an AI dialing tool, and the broader AI Roleplay space was still relatively niche — which made this a real differentiation opportunity.

Exploration + iteration

Understanding user workflows and industry best practices

Before designing anything, the team needed to ground ourselves in real sales funnel processes, trends, and best practices — so the scenarios we built would reflect realistic, high-impact situations for SDRs rather than generic practice content.

Copy considerations

Prompt architecture for AI-generated scenarios

I worked with the PM and engineering lead to design the prompt structure that powered scenario generation. Rather than leaving this to chance, we defined a repeatable framework:

​

​Identify common call types based on sales funnel stages and typical sales processes

Assign difficulty using topic complexity, likely objections, and anticipated rep questions

Define rep goals aligned with industry standards for handling each scenario type

​

Design considerations

Page navigation

I explored several approaches to organizing the AI Roleplay landing page so reps could quickly find relevant scenarios. This included grouping by scenario type, role, and difficulty level. User feedback helped determine which structure made the page feel clearest and most intuitive.

Scenario cards

Designing the scenario cards was one of the harder interaction problems on this project. The real tension was between discoverability and visual noise — too little information and reps couldn't tell scenarios apart; too much and they'd stop reading the cards entirely.

Each iteration helped clarify what was essential versus distracting. The final cards were designed to be visually appealing, quickly scannable, and accessible — with legible text, sufficient contrast, and intuitive interactions across devices.

A more guided experience — and why we moved away from it

Visual language for AI-generated content

Customization and scalability

High level flow:​​​​

The user provides a basic prompt describing the business context or scenario they need.

The tool uses Orum's company and user profile data to fill in missing details and generate a tailored output.

The user reviews the generated scenario.

The user confirms and finalizes it.

I explored several UI approaches for the creation flow, each representing a different point on the effort-vs-control spectrum.

The entry point had to work for all three — so rather than defaulting to the most complex version, I designed a single upfront choice: generate automatically or enter manually. That one decision routed users to the right level of involvement before they ever saw a form field.

Constraints with AI tooling

Analytics for managers: A post-launch insight

Quality and feedback loop

Unifying the platform: Moving AI Roleplay into the Coaching Suite

Final design

Impact

Teams using Orum's AI-powered stack saw 55–62% increases in connects and meetings booked across large BDR organizations. One customer doubled their meetings booked with fewer reps.

Reflection

Thanks for viewing :-)

Competitive analysis:
None of our competitors were exploring AI Roleplay within the context of an AI dialing tool, and the broader AI Roleplay space was still relatively niche — which made this a real differentiation opportunity.

Identify common call types based on sales funnel stages and typical sales processes

High level flow: