Skip to content

Commit 5f55c18

Browse files
committed
Updates for new voice-agent feature
1 parent 4d881f8 commit 5f55c18

2 files changed

Lines changed: 8 additions & 7 deletions

File tree

learn-pr/wwl-data-ai/get-started-speech-azure/includes/4-voice-live.md

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -46,13 +46,14 @@ Azure speech-to-speech solutions utilize:
4646
- **Agents or application logic** which makes decisions on responses.
4747
- **Foundry Tools or MCP servers** which can expose speech as callable tools so agents don’t manage SDKs or APIs directly.
4848

49-
>[!NOTE]
50-
> The **Azure Speech in Foundry Tools MCP server** exposes speech-to-text and text-to-speech as callable tools, allowing AI agents to work with spoken language without directly managing Azure Speech APIs. The Azure Speech MCP server makes it easier to build voice‑enabled AI agents that can listen, think, and speak.
51-
52-
You can explore *VoiceLive* in a playground in Foundry portal. The Foundry playground includes some preconfigured voice samples that you can try out, or you can create a new solution of your own. When you create a solution, importantly, you need to choose a generative AI model for your agent to use. Azure Speech Voice Live uses the generative AI model *alongside* its own acoustic models to have a live conversation with the user. You can configure many settings in the playground. For example, you can enable proactive engagement, so the agent can initiate conversations.
49+
You can explore *Voice Live* in a playground in Foundry portal. The Foundry playground includes some preconfigured voice samples that you can try out, or you can create a new solution of your own. When you create a solution, importantly, you need to choose a generative AI model for your agent to use. Azure Speech Voice Live uses the generative AI model *alongside* its own acoustic models to have a live conversation with the user. You can configure many settings in the playground. For example, you can enable proactive engagement, so the agent can initiate conversations.
5350

5451
:::image type="content" source="../media/voice-live-playground.png" alt-text="Screenshot of voice live in the Foundry playground." lightbox="../media/voice-live-playground.png":::
5552

53+
You can also enable **Voice mode** for a Microsoft Foundry agent in the playground, which integrates Azure Speech Voice Live into the agent definition. This approach means that speech configuration is encapsulated in the agent itself, reducing the client code required to use it.
54+
55+
:::image type="content" source="../media/voice-live-agent.png" alt-text="Screenshot of a voice-enabled agent." lightbox="../media/voice-live-agent.png":::
56+
5657
## Using Voice Live in an application
5758

5859
To develop a custom app that uses the agent, we need to write some code. To create an application in Python, you need the `azure-ai-voicelive` package.
@@ -66,14 +67,14 @@ pip install azure-ai-voicelive
6667
>[!NOTE]
6768
> You also need to install `pyaudio`, `python-dotenv`, and `azure-identity` in order to run your Voice Live application.
6869
69-
You can find sample code for a speech-to-speech application in the Foundry playground. The sample code handles all of the logic needed to initiate the session, connect to audio devices like mics and speakers, process the incoming and outgoing streams of audio, handle interruptions, and so on. The sample code is a good starting point for building your own application.
70+
You can find sample code for a speech-to-speech application in the Foundry portal. The sample code handles all of the logic needed to initiate the session, connect to audio devices like mics and speakers, process the incoming and outgoing streams of audio, handle interruptions, and so on. The sample code is a good starting point for building your own application.
7071

7172
:::image type="content" source="../media/voice-live-playground-code.png" alt-text="Screenshot of voice live in the Foundry playground sample code." lightbox="../media/voice-live-playground-code.png":::
7273

73-
You can take the sample code into your own code editor and install the proper packages. When you run the application, a real‑time voice assistant streams your microphone audio to Azure VoiceLive, receives the assistant’s spoken audio response back, and plays it through your speakers.
74+
You can take the sample code into your own code editor and install the proper packages. When you run the application, a real‑time voice assistant streams your microphone audio to Azure Voice Live, receives the assistant’s spoken audio response back, and plays it through your speakers.
7475

7576
![Screenshot of the result of the VoiceLive Python script.](../media/voice-live-result.png)
7677

77-
VoiceLive in Azure Speech offers an effective way to build speech-capable conversational agents that engage naturally with users. Next, try out Azure Speech - Voice Live in Foundry yourself.
78+
Voice Live in Azure Speech offers an effective way to build speech-capable conversational agents that engage naturally with users. Next, try out Azure Speech - Voice Live in Foundry yourself.
7879

7980
::: zone-end
469 KB
Loading

0 commit comments

Comments
 (0)