You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-**Agents or application logic** which makes decisions on responses.
47
47
-**Foundry Tools or MCP servers** which can expose speech as callable tools so agents don’t manage SDKs or APIs directly.
48
48
49
-
>[!NOTE]
50
-
> The **Azure Speech in Foundry Tools MCP server** exposes speech-to-text and text-to-speech as callable tools, allowing AI agents to work with spoken language without directly managing Azure Speech APIs. The Azure Speech MCP server makes it easier to build voice‑enabled AI agents that can listen, think, and speak.
51
-
52
-
You can explore *VoiceLive* in a playground in Foundry portal. The Foundry playground includes some preconfigured voice samples that you can try out, or you can create a new solution of your own. When you create a solution, importantly, you need to choose a generative AI model for your agent to use. Azure Speech Voice Live uses the generative AI model *alongside* its own acoustic models to have a live conversation with the user. You can configure many settings in the playground. For example, you can enable proactive engagement, so the agent can initiate conversations.
49
+
You can explore *Voice Live* in a playground in Foundry portal. The Foundry playground includes some preconfigured voice samples that you can try out, or you can create a new solution of your own. When you create a solution, importantly, you need to choose a generative AI model for your agent to use. Azure Speech Voice Live uses the generative AI model *alongside* its own acoustic models to have a live conversation with the user. You can configure many settings in the playground. For example, you can enable proactive engagement, so the agent can initiate conversations.
53
50
54
51
:::image type="content" source="../media/voice-live-playground.png" alt-text="Screenshot of voice live in the Foundry playground." lightbox="../media/voice-live-playground.png":::
55
52
53
+
You can also enable **Voice mode** for a Microsoft Foundry agent in the playground, which integrates Azure Speech Voice Live into the agent definition. This approach means that speech configuration is encapsulated in the agent itself, reducing the client code required to use it.
54
+
55
+
:::image type="content" source="../media/voice-live-agent.png" alt-text="Screenshot of a voice-enabled agent." lightbox="../media/voice-live-agent.png":::
56
+
56
57
## Using Voice Live in an application
57
58
58
59
To develop a custom app that uses the agent, we need to write some code. To create an application in Python, you need the `azure-ai-voicelive` package.
> You also need to install `pyaudio`, `python-dotenv`, and `azure-identity` in order to run your Voice Live application.
68
69
69
-
You can find sample code for a speech-to-speech application in the Foundry playground. The sample code handles all of the logic needed to initiate the session, connect to audio devices like mics and speakers, process the incoming and outgoing streams of audio, handle interruptions, and so on. The sample code is a good starting point for building your own application.
70
+
You can find sample code for a speech-to-speech application in the Foundry portal. The sample code handles all of the logic needed to initiate the session, connect to audio devices like mics and speakers, process the incoming and outgoing streams of audio, handle interruptions, and so on. The sample code is a good starting point for building your own application.
70
71
71
72
:::image type="content" source="../media/voice-live-playground-code.png" alt-text="Screenshot of voice live in the Foundry playground sample code." lightbox="../media/voice-live-playground-code.png":::
72
73
73
-
You can take the sample code into your own code editor and install the proper packages. When you run the application, a real‑time voice assistant streams your microphone audio to Azure VoiceLive, receives the assistant’s spoken audio response back, and plays it through your speakers.
74
+
You can take the sample code into your own code editor and install the proper packages. When you run the application, a real‑time voice assistant streams your microphone audio to Azure Voice Live, receives the assistant’s spoken audio response back, and plays it through your speakers.
74
75
75
76

76
77
77
-
VoiceLive in Azure Speech offers an effective way to build speech-capable conversational agents that engage naturally with users. Next, try out Azure Speech - Voice Live in Foundry yourself.
78
+
Voice Live in Azure Speech offers an effective way to build speech-capable conversational agents that engage naturally with users. Next, try out Azure Speech - Voice Live in Foundry yourself.
0 commit comments