MicrosoftDocs
diff --git a/‎.openpublishing.publish.config.json‎
Lines changed: 6 additions & 0 deletions b/‎.openpublishing.publish.config.json‎
Lines changed: 6 additions & 0 deletions
diff --git a/‎.openpublishing.redirection.json‎
Lines changed: 10 additions & 0 deletions b/‎.openpublishing.redirection.json‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎articles/active-directory-b2c/service-limits.md‎
Lines changed: 10 additions & 4 deletions b/‎articles/active-directory-b2c/service-limits.md‎
Lines changed: 10 additions & 4 deletions
diff --git a/‎articles/api-management/TOC.yml‎
Lines changed: 2 additions & 0 deletions b/‎articles/api-management/TOC.yml‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎articles/api-management/api-management-howto-llm-logs.md‎
Lines changed: 124 additions & 0 deletions b/‎articles/api-management/api-management-howto-llm-logs.md‎
Lines changed: 124 additions & 0 deletions
diff --git a/‎articles/api-management/genai-gateway-capabilities.md‎
Lines changed: 3 additions & 3 deletions b/‎articles/api-management/genai-gateway-capabilities.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎articles/api-management/media/api-management-howto-llm-logs/analytics-workbook-small.png‎
112 KB b/‎articles/api-management/media/api-management-howto-llm-logs/analytics-workbook-small.png‎
112 KB
diff --git a/‎articles/api-management/media/api-management-howto-llm-logs/analytics-workbook.png‎
112 KB b/‎articles/api-management/media/api-management-howto-llm-logs/analytics-workbook.png‎
112 KB
diff --git a/‎articles/api-management/media/api-management-howto-llm-logs/diagnostic-setting.png‎
88.7 KB b/‎articles/api-management/media/api-management-howto-llm-logs/diagnostic-setting.png‎
88.7 KB
diff --git a/‎articles/api-management/media/api-management-howto-llm-logs/enable-llm-api-logging.png‎
69.6 KB b/‎articles/api-management/media/api-management-howto-llm-logs/enable-llm-api-logging.png‎
69.6 KB
@@ -632,6 +632,12 @@
             "branch": "main",
             "branch_mapping": {}
         },
+        {
+            "path_to_root": "app-service-agentic-langgraph-foundry-python",
+            "url": "https://github.com/Azure-Samples/app-service-agentic-langgraph-foundry-python",
+            "branch": "main",
+            "branch_mapping": {}
+        },
         {
             "path_to_root": "app-service-agentic-semantic-kernel-java",
             "url": "https://github.com/Azure-Samples/app-service-agentic-semantic-kernel-java",
 
@@ -7152,7 +7152,17 @@
     {
       "source_path": "articles/reliability/migrate-storage.md",
       "redirect_url": "/azure/storage/common/redundancy-migration",
+      "redirect_document_id": false
+    },
+    {
+      "source_path": "articles/reliability/reliability-operator-nexus.md",
+      "redirect_url": "/azure/reliability/overview-reliability-guidance",
+      "redirect_document_id": false
     }
+
+
+
+    
   ]
 }
 
@@ -8,7 +8,7 @@ manager: CelesteDG
 ms.service: azure-active-directory
 
 ms.topic: reference
-ms.date: 07/29/2025
+ms.date: 08/19/2025
 ms.subservice: b2c
 zone_pivot_groups: b2c-policy-type
 
@@ -169,14 +169,20 @@ The following table lists the administrative configuration limits in the Azure A
 |Number of sign-out URLs per application        |1          |
 |String Limit per Attribute      |250 Chars          |
 |Number of B2C tenants per subscription      |20         |
-|Total number of objects (user accounts and applications) per tenant (default limit)|1.25 million |
-|Total number of objects (user accounts and applications) per tenant (using a verified custom domain). If you want to increase this limit, please contact [Microsoft Support](find-help-open-support-ticket.md).|5.25 million |
+|Number of objects (user accounts and applications) per tenant (default limit)  <sup>2</sup>|1.25 million|
+|Number of objects (user accounts and applications) per tenant (using a verified custom domain) <sup>3</sup>. If you want to increase this limit, please contact [Microsoft Support](find-help-open-support-ticket.md).|5.25 million|
+|Number of objects per tenant for Japan Go-Local Azure AD B2C tenants (default limit) <sup>4</sup>|310K|
+|Number of objects per tenant for Japan Go-Local Azure AD B2C tenants (using a verified custom domain) <sup>5</sup>. If you want to increase this limit, please contact [Microsoft Support](find-help-open-support-ticket.md).|570K|
 |Levels of [inheritance](custom-policy-overview.md#inheritance-model) in custom policies     |10         |
 |Number of policies per Azure AD B2C tenant (user flows + custom policies)     |200          |
 |Maximum policy file size      |1024 KB          |
 |Number of API connectors per tenant     |20         |
 
-<sup>1</sup> See also [Microsoft Entra service limits and restrictions](../active-directory/enterprise-users/directory-service-limits-restrictions.md).
+- <sup>1</sup> See also [Microsoft Entra service limits and restrictions](../active-directory/enterprise-users/directory-service-limits-restrictions.md).
+- <sup>2</sup> 1M user accounts and 250K applications.
+- <sup>3</sup> 5M user accounts and 250K applications.
+- <sup>4</sup> 60K user accounts and 250K applications.
+- <sup>5</sup> 320K user accounts and 250K applications.
 
 ## Region specific service limits 
 
 
@@ -252,6 +252,8 @@
       href: azure-openai-enable-semantic-caching.md
     - name: Authenticate and authorize to Azure OpenAI
       href: api-management-authenticate-authorize-azure-openai.md
+    - name: Log LLM tokens, requests, and responses
+      href: api-management-howto-llm-logs.md
   - name: Manage MCP servers
     items:
     - name: MCP server capabilities
 
@@ -0,0 +1,124 @@
+---
+title: Set Up Logging for LLM APIs in Azure API Management
+titleSuffix: Azure API Management
+description: Enable logging for LLM APIs in Azure API Management to track token usage, prompts, and completions for billing and auditing.
+#customer intent: As a system administrator, I want to enable logging of LLM request and response messages so that I can track API interactions for billing or auditing purposes.
+author: dlepow
+ms.service: azure-api-management
+ms.topic: how-to
+ms.date: 08/22/2025
+ms.author: danlep
+ai-usage: ai-assisted
+ms.collection: ce-skilling-ai-copilot
+ms.custom:
+---
+
+# Log token usage, prompts, and completions for LLM APIs
+
+In this article, you learn how to set up Azure Monitor logging for LLM API requests and responses in Azure API Management. 
+
+The API Management administrator can use LLM API request and response logs along with API Management gateway logs for scenarios such as the following:
+
+* **Calculate usage for billing** - Calculate usage metrics for billing based on the number of tokens consumed by each application or API consumer (for example, segmented by subscription ID or IP address).
+
+* **Inspect messages** - Inspect and analyze prompts and completions to help with debugging, auditing, and model evaluation.
+
+Learn more about:
+
+* [AI gateway capabilities in API Management](genai-gateway-capabilities.md)
+* [Monitoring API Management](monitor-api-management.md)
+
+## Prerequisites
+- An Azure API Management instance.
+- A managed LLM chat completions API integrated with Azure API Management. For example, [Import an Azure AI Foundry API](azure-ai-foundry-api.md).
+- Access to an Azure Log Analytics workspace.
+- Appropriate permissions to configure diagnostic settings and access logs in API Management.
+
+## Enable diagnostic setting for LLM API logs
+
+Enable a diagnostic setting to log requests that the gateway processes for large language model REST APIs. For each request, Azure Monitor receives data about token usage (prompt tokens, completion tokens, and total tokens), the name of the model used, and optionally the request and response messages (prompt and completion). Large requests and responses are split into multiple log entries with sequence numbers for later reconstruction if needed.
+
+The following are brief steps to enable a diagnostic setting that directs LLM API logs to a Log Analytics workspace. For more information, see [Enable diagnostic setting for Azure Monitor logs](monitor-api-management.md#enable-diagnostic-setting-for-azure-monitor-logs).
+
+1. In the [Azure portal](https://portal.azure.com), navigate to your Azure API Management instance.
+1. In the left menu, under **Monitoring**, select **Diagnostic settings** > **+ Add diagnostic setting**.
+1. Configure the setting to send AI gateway logs to a Log Analytics workspace:
+   - Under **Logs**, select **Logs related to generative AI gateway**.
+   - Under **Destination details**, select **Send to Log Analytics workspace**.
+1. Review or configure other settings and make changes if needed.
+1. Select **Save**.
+
+:::image type="content" source="media/api-management-howto-llm-logs/diagnostic-setting.png" alt-text="Screenshot of diagnostic setting for AI gateway logs in the portal.":::
+
+## Enable logging of requests or responses for LLM API
+
+You can enable diagnostic settings for all APIs or customize logging for specific APIs. The following are brief steps to log both LLM requests and response messages for an API. For more information, see [Modify API logging settings](monitor-api-management.md#modify-api-logging-settings).
+
+1. In the left menu of your API Management instance, select **APIs > APIs** and then select the name of the API.
+1. Select the **Settings** tab from the top bar.
+1. Scroll down to the **Diagnostic Logs** section, and select the **Azure Monitor** tab.
+1. In **Log LLM messages**, select **Enabled**.
+1. Select **Log prompts** and enter a size in bytes, such as *32768*.
+1. Select **Log completions** and enter a size in bytes, such as *32768*.
+1. Review other settings and make changes if needed. Select **Save**.
+
+:::image type="content" source="media/api-management-howto-llm-logs/enable-llm-api-logging.png" alt-text="Screenshot of enabling LLM logging for an API in the portal.":::
+
+> [!NOTE]
+> If you enable collection, LLM request or response messages up to 32 KB in size are sent in a single entry. Messages larger than 32 KB are split and logged in 32 KB chunks with sequence numbers for later reconstruction. Request messages and response messages can't exceed 2 MB each.
+
+
+## Review analytics workbook for LLM APIs
+
+The Azure Monitor-based **Analytics** dashboard provides insights into LLM API usage and token consumption using data aggregated in a Log Analytics workspace. [Learn more](monitor-api-management.md#get-api-analytics-in-azure-api-management) about Analytics in API Management.
+
+1. In the left menu of your API Management instance, select **Monitoring** > **Analytics**.
+1. Select the **Language models** tab.
+1. Review metrics and visualizations for LLM API token consumption and requests in a selected **Time range**. 
+
+:::image type="content" source="media/api-management-howto-llm-logs/analytics-workbook-small.png" alt-text="Screenshot of analytics for language model APIs in the portal." lightbox="media/api-management-howto-llm-logs/analytics-workbook.png":::
+
+## Review Azure Monitor logs for requests and responses
+
+Review the [ApiManagementGatewayLlmLog](/azure/azure-monitor/reference/tables/apimanagementgatewayllmlog) log for details about LLM requests and responses, including token consumption, model deployment used, and other details over specific time ranges.
+
+Requests and responses (including chunked messages for large requests and responses) appear in separate log entries that you can correlate by using the `CorrelationId` field. 
+
+For auditing purposes, use a Kusto query similar to the following query to join each request and response in a single record. Adjust the query to include the fields that you want to track.
+
+```Kusto
+ApiManagementGatewayLlmLog
+| extend RequestArray = parse_json(RequestMessages)
+| extend ResponseArray = parse_json(ResponseMessages)
+| mv-expand RequestArray
+| mv-expand ResponseArray
+| project
+    CorrelationId,
+    RequestContent = tostring(RequestArray.content),
+    ResponseContent = tostring(ResponseArray.content)
+| summarize
+    Input = strcat_aray(make_list(RequestContent), " . "),
+    Output = strcat_array(make_list(ResponseContent), " . ")
+    by CorrelationId
+| where isnotempty(Input) and isnotempty(Output)
+```
+
+:::image type="content" source="media/api-management-howto-llm-logs/llm-log-query-small.png" alt-text="Screenshot of query results for LLM logs in the portal." lightbox="media/api-management-howto-llm-logs/llm-log-query.png":::
+
+## Upload data to Azure AI Foundry for model evaluation
+
+You can export LLM logging data as a dataset for [model evaluation](/azure/ai-foundry/concepts/observability) in Azure AI Foundry. With model evaluation, you can assess the performance of your generative AI models and applications against a test model or dataset using built-in or custom evaluation metrics. 
+
+To use LLM logs as a dataset for model evaluation:
+
+1. Join LLM request and response messages into a single record for each interaction, as shown in the [previous section](#review-azure-monitor-logs-for-requests-and-responses). Include the fields you want to use for model evaluation.
+1. Export the dataset to CSV format, which is compatible with Azure AI Foundry.
+1. In the Azure AI Foundry portal, create a new evaluation to upload and evaluate the dataset.
+
+For details to create and run a model evaluation in Azure AI Foundry, see [Evaluate generative AI models and applications by using Azure AI Foundry](/azure/ai-foundry/how-to/evaluate-generative-ai-app).
+
+## Related content
+
+* [Learn more about monitoring API Management](monitor-api-management.md)
+* [Azure Monitor reference for API Management](monitor-api-management-reference.md)
+* [Tutorial: Monitor published APIs](api-management-howto-use-azure-monitor.md)
@@ -109,14 +109,14 @@ In API Management, enable semantic caching by using Azure Redis Enterprise, Azur
 
 ## Logging token usage, prompts, and completions
 
-Enable a [diagnostic setting](monitor-api-management.md#enable-diagnostic-setting-for-azure-monitor-logs) in your API Management instance to log requests processed by the gateway for large language model REST APIs. For each request, data is sent to Azure Monitor including token usage (prompt tokens, completion tokens, and total tokens), name of the model used, and optionally the request and response messages (prompt and completion). Large requests and responses are split into multiple log entries that are sequentially numbered for later reconstruction if needed.
+You can enable logging for requests processed by the gateway for large language model REST APIs. For each request, data is sent to Azure Monitor including token usage (prompt tokens, completion tokens, and total tokens), name of the model used, and optionally the request and response messages (prompt and completion). Large requests and responses are split into multiple log entries that are sequentially numbered for later reconstruction if needed.
 
 The API Management administrator can use LLM gateway logs along with API Management gateway logs for scenarios such as the following:
 
 * **Calculate usage for billing** - Calculate usage metrics for billing based on the number of tokens consumed by each application or API consumer (for example, segmented by subscription ID or IP address).
 * **Inspect messages** - To help with debugging or auditing, inspect and analyze prompts and completions.
 
-Learn more about [monitoring API Management with Azure Monitor](monitor-api-management.md).
+Learn more: [Log token usage, prompts, and completions for LLM APIs](api-management-howto-llm-logs.md)
 
 ## Content safety policy
 
@@ -135,7 +135,7 @@ To help safeguard users from harmful, offensive, or misleading content, you can
 
 * [AI gateway reference architecture using API Management](/ai/playbook/technology-guidance/generative-ai/dev-starters/genai-gateway/reference-architectures/apim-based)
 * [AI hub gateway landing zone accelerator](https://github.com/Azure-Samples/ai-hub-gateway-solution-accelerator)
-* [Designing and implementing a gateway solution with Azure OpenAI resources](/ai/playbook/technology-guidance/generative-ai/dev-starters/genai-gateway/)
+* [Designing and implementing a gateway solution with Azure OpenAI resources](/ai/playbook/technology-guidance/generative-ai/dev-starters/gemonitoring API Management with Azurenai-gateway/)
 * [Use a gateway in front of multiple Azure OpenAI deployments or instances](/azure/architecture/ai-ml/guide/azure-openai-gateway-multi-backend)
 
 ## Related content
Original file line number	Diff line number	Diff line change
`@@ -7152,7 +7152,17 @@`
`7152`	`7152`	`{`
`7153`	`7153`	`"source_path": "articles/reliability/migrate-storage.md",`
`7154`	`7154`	`"redirect_url": "/azure/storage/common/redundancy-migration",`
	`7155`	`+ "redirect_document_id": false`
	`7156`	`+ },`
	`7157`	`+ {`
	`7158`	`+ "source_path": "articles/reliability/reliability-operator-nexus.md",`
	`7159`	`+ "redirect_url": "/azure/reliability/overview-reliability-guidance",`
	`7160`	`+ "redirect_document_id": false`
`7155`	`7161`	`}`
	`7162`	`+`
	`7163`	`+`
	`7164`	`+`
	`7165`	`+`
`7156`	`7166`	`]`
`7157`	`7167`	`}`
`7158`	`7168`