MicrosoftDocs
diff --git a/‎articles/microsoft-discovery/concept-azure-container-registry.md‎
Lines changed: 1 addition & 1 deletion b/‎articles/microsoft-discovery/concept-azure-container-registry.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/microsoft-discovery/concept-bookshelf-and-knowledgebases.md‎
Lines changed: 1 addition & 1 deletion b/‎articles/microsoft-discovery/concept-bookshelf-and-knowledgebases.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/microsoft-discovery/concept-bookshelf-knowledge-bases.md‎
Lines changed: 80 additions & 0 deletions b/‎articles/microsoft-discovery/concept-bookshelf-knowledge-bases.md‎
Lines changed: 80 additions & 0 deletions
diff --git a/‎articles/microsoft-discovery/concept-discovery-engine.md‎
Lines changed: 1 addition & 1 deletion b/‎articles/microsoft-discovery/concept-discovery-engine.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/microsoft-discovery/concept-projects-investigations.md‎
Lines changed: 1 addition & 1 deletion b/‎articles/microsoft-discovery/concept-projects-investigations.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/microsoft-discovery/concept-resource-provider-registration.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/microsoft-discovery/concept-resource-provider-registration.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/microsoft-discovery/how-to-data-handling-with-tools-agents.md‎
Lines changed: 1 addition & 1 deletion b/‎articles/microsoft-discovery/how-to-data-handling-with-tools-agents.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/microsoft-discovery/how-to-deploy-network-hardened-stack.md‎
Lines changed: 1 addition & 1 deletion b/‎articles/microsoft-discovery/how-to-deploy-network-hardened-stack.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/microsoft-discovery/how-to-manage-supercomputers.md‎
Lines changed: 4 additions & 4 deletions b/‎articles/microsoft-discovery/how-to-manage-supercomputers.md‎
Lines changed: 4 additions & 4 deletions
diff --git a/‎…create-supercomputer-nodepool-create.jpg‎ ‎…reate-supercomputer-node-pool-create.jpg‎articles/microsoft-discovery/media/how-to-manage-supercomputers/create-supercomputer-nodepool-create.jpg renamed to articles/microsoft-discovery/media/how-to-manage-supercomputers/create-supercomputer-node-pool-create.jpg b/‎…create-supercomputer-nodepool-create.jpg‎ ‎…reate-supercomputer-node-pool-create.jpg‎articles/microsoft-discovery/media/how-to-manage-supercomputers/create-supercomputer-nodepool-create.jpg renamed to articles/microsoft-discovery/media/how-to-manage-supercomputers/create-supercomputer-node-pool-create.jpg
@@ -221,5 +221,5 @@ When you develop and publish tools for Microsoft Discovery, the containerized to
 - [What is Microsoft Discovery?](overview-what-is-microsoft-discovery.md)
 - [Virtual networks and subnets in Microsoft Discovery](concept-virtual-networks.md)
 - [Role assignments in Microsoft Discovery](concept-role-assignments.md)
-- [Azure Container Registry documentation](https://learn.microsoft.com/azure/container-registry/)
+- [Azure Container Registry documentation](/azure/container-registry/)
 - [Azure Private Endpoint overview](../private-link/private-endpoint-overview.md)
@@ -35,7 +35,7 @@ Currently, the Bookshelf supports indexing unstructured (text-based) file format
 * JSON
 * CSV
 
-The Bookshelf uses Azure AI Search Enrichment to process supported file formats. Images embedded in supported file formats are processed using Azure AI Search's built-in [Vision skill](https://learn.microsoft.com/azure/ai-services/computer-vision/overview), which automatically generates alt-text for embedded images. See [Azure AI Search's documentation](https://microsoftapc.sharepoint.com/:x:/t/ProjectParagon/IQDSrYORrkMDSa9OME93rcyYAc2EV_jDqr9aD3jYTThB7Cs?e=DfOv2t)for the full list of supported file formats.
+The Bookshelf uses Azure AI Search Enrichment to process supported file formats. Images embedded in supported file formats are processed using Azure AI Search's built-in [Vision skill](/azure/ai-services/computer-vision/overview), which automatically generates alt-text for embedded images. See [Azure AI Search's documentation](https://microsoftapc.sharepoint.com/:x:/t/ProjectParagon/IQDSrYORrkMDSa9OME93rcyYAc2EV_jDqr9aD3jYTThB7Cs?e=DfOv2t)for the full list of supported file formats.
 
 The knowledge graph and vector database that results from indexing, collectively known as a Knowledge Base (KB), are stored in an Azure SQL DB in your subscription.
 
 
@@ -0,0 +1,80 @@
+---
+ms.service: azure
+ms.author: reburkea
+author: reburkea
+title: Microsoft Discovery Bookshelf & Knowledge Bases
+description: Conceptual overview of Microsoft Discovery Bookshelf service and Knowledge Bases. 
+ms.topic: concept-article
+ms.date: 03/23/2026
+---
+
+# Microsoft Discovery Bookshelf
+Microsoft Discovery includes the Bookshelf, a service that enables customers to convert their data into curated graphs known as Knowledge Bases (KBs). The key components of the Bookshelf service are the Bookshelf resource and Knowledge Bases within each Bookshelf. A Knowledge Base contains a vector database and knowledge graph of your indexed artifacts. KBs can be used by Discovery agents as grounding skills and queried by Discovery agents for various use cases, including answering questions, summarization, and reasoning.
+
+## When to use the Bookshelf
+The Bookshelf is best for reasoning over your curated, proprietary data. Knowledge Bases are especially effective when their scoped contents are thematically related and directly applicable to your Discovery workflow. For example, an Application-Specific Integrated Circuit (ASIC) design team could create a Knowledge Base with their project's hardware specifications, simulation result reports, and the latest relevant literature from the field. Querying this Knowledge Base during design workflows ensures Discovery's reasoning is grounded with previous engineering content and scientific literature. 
+
+For using data in a tool call or otherwise directly using data in Discovery, creating a Knowledge Base is often not necessary. Similarly, to search over vast repositories of data or to find resources that might be relevant to your workflow, we suggest using Azure AI Search, SharePoint Search, or similar general purpose search tools. Once you have identified the data that is most relevant to your workflow, a Knowledge Base including this curated data can help ground your Discovery workflows and derive new insights in context.
+
+## Features
+At a high level, the Bookshelf works by converting diverse file formats to text, then generating a graphical representation of that text, which can be queried using natural language.
+
+The Bookshelf uses an advanced technique developed by Microsoft Research called Graph Retrieval-Augmented Generation (GraphRAG) to transform customer data into graph-based representations and generate responses to queries. Unlike traditional RAG methods, GraphRAG-based algorithms not only create an indexed vector database of the source content but also constructs a knowledge graph that captures entity relationships within the data. Research from Microsoft demonstrates that GraphRAG delivers more accurate and comprehensive grounding information than standard RAG or vector-based techniques, leading to higher-quality responses.
+
+### Indexing
+Currently, the Bookshelf supports indexing unstructured (text-based) file formats stored in Azure Blob Storage. Supported file formats include:
+
+* Text (.txt)
+* PDF (.pdf)
+* Word (.docx)
+* PowerPoint (.pptx)
+* Excel (.xlsx)
+* Markdown
+* XML
+* HTML
+* JSON
+* CSV
+
+The Bookshelf uses Azure AI Search Enrichment to process supported file formats. Images embedded in supported file formats are processed using Azure AI Search's built-in [Vision skill](/azure/ai-services/computer-vision/overview), which automatically generates alt-text for embedded images. See [Azure AI Search's documentation](/azure/search/cognitive-search-skill-document-intelligence-layout#supported-file-formats) for the full list of supported file formats.
+
+The knowledge graph and vector database that results from indexing, collectively known as a Knowledge Base (KB), are stored in an Azure SQL DB in your subscription.
+
+### Query
+The Bookshelf provides the query function that can be invoked by any agent running on the Microsoft Discovery platform, including your own agent.
+
+## Known limitations
+
+### Unsupported file types 
+Encrypted, password-protected, or sensitivity-labeled files aren't supported for indexing. Any unsupported file types are skipped during indexing.
+
+### Cross-project sharing
+
+Bookshelves can't be shared across projects. Each project must have its own dedicated Bookshelves and Knowledge Bases.
+
+> [!NOTE]
+> The ability to share Bookshelves across projects is a planned feature for future releases.
+
+### One knowledge base per Bookshelf
+
+Each Bookshelf can only contain one Knowledge Base. However, Projects can contain many Bookshelves.
+
+> [!NOTE]
+> The ability to create multiple Knowledge Bases within the same Bookshelf is a planned feature for future releases.
+
+### Incremental indexing
+
+Incremental indexing isn't currently supported. To update Knowledge Bases, you must delete them and re-index.
+
+> [!NOTE]
+> Incremental indexing is a planned feature for future releases.
+
+### Scale 
+
+The Bookshelf currently supports Small (<200 MB of text), Medium (<500 MB of text, default size), and Large (<1 GB of text)-sized deployments. For more information on supported index sizes and the resources required to support each size, see the Bookshelf creation How To guide.
+
+### Best practices
+
+The Bookshelf is an evolving feature. Over the course of future releases, we'll improve the costs and time associated with creating Bookshelf deployments and indexing and searching over KBs. We'll also support incremental indexing and we'll take advantage of newer GPT models for search. Currently, for the best performance and to minimize costs of re-deployment, re-indexing, re-enrichment, or search, we recommend the following best practices:
+
+* Limit each Knowledge Base to Small or Medium (default)-sized deployments
+* Ensure each KB's content is thematically coherent and directly applicable to your Discovery workflow. 
@@ -119,7 +119,7 @@ When cognition executes tasks, it draws on the full Microsoft Discovery platform
 
 - **[Agents](concept-discovery-agent.md)**: Specialized AI systems that execute specific types of work. Cognition selects the agent whose capabilities best match each task. An agent is associated to the best model for the type of work required. 
 - **Tools**: Containerized executables that run on the [supercomputer](how-to-manage-supercomputers.md) for computation, data processing, and analysis. Tools handle work that requires specialized software or significant compute resources.
-- **[Bookshelf](concept-bookshelf-and-knowledgebases.md)**: Knowledge bases built from your documents and scientific literature. Agents query bookshelves to ground their reasoning in relevant context.
+- **[Bookshelf](concept-bookshelf-knowledge-bases.md)**: Knowledge bases built from your documents and scientific literature. Agents query bookshelves to ground their reasoning in relevant context.
 
 You configure these resources when you set up your workspace and project. Cognition then orchestrates them automatically based on what each task requires.
 
 
@@ -102,4 +102,4 @@ Subscription
 - [Add agents using bundles](quickstart-agents-bundles.md)
 - [Microsoft Discovery agents](concept-discovery-agent.md)
 - [Agent types in Microsoft Discovery](concept-discovery-agent-types.md)
-- [Bookshelf & Knowledge Bases](concept-bookshelf-and-knowledgebases.md)
+- [Bookshelf & Knowledge Bases](concept-bookshelf-knowledge-bases.md)
@@ -86,7 +86,7 @@ Refresh the **Resource providers** page and confirm that all the Resource Provid
 
 #### Prerequisites
 
-- [Azure CLI installed](https://learn.microsoft.com/cli/azure/install-azure-cli)
+- [Azure CLI installed](/cli/azure/install-azure-cli)
 - Authenticated to your Azure account (`az login`)
 
 #### Register the resource provider
@@ -113,7 +113,7 @@ az provider list --query "[].{Provider:namespace, Status:registrationState}" --o
 
 #### Prerequisites
 
-- [Azure PowerShell module installed](https://learn.microsoft.com/powershell/azure/install-azure-powershell)
+- [Azure PowerShell module installed](/powershell/azure/install-azure-powershell)
 - Authenticated to your Azure account (`Connect-AzAccount`)
 
 #### Register the resource provider
 
@@ -409,5 +409,5 @@ If a resource doesn't appear in your conversation, use the following steps:
 
 - [Microsoft Discovery agents](concept-discovery-agent.md)
 - [Agent types in Microsoft Discovery](concept-discovery-agent-types.md)
-- [Bookshelf and knowledge bases](concept-bookshelf-and-knowledgebases.md)
+- [Bookshelf and knowledge bases](concept-bookshelf-knowledge-bases.md)
 - [Storage assets and storage containers in Microsoft Discovery](concept-storage-account.md)
@@ -246,4 +246,4 @@ When you complete this deployment, you have:
 ## Next steps
 
 - [Configure network security](how-to-configure-network-security.md) - detailed network hardening and PE setup
-- [Bookshelf and Knowledge Bases](concept-bookshelf-and-knowledgebases.md)
+- [Bookshelf and Knowledge Bases](concept-bookshelf-knowledge-bases.md)
@@ -99,7 +99,7 @@ Nodepools define the compute capacity (VMs) attached to a Supercomputer. You can
 2. Under **Settings**, select **Nodepools**.
 3. Select **Create**.
 
-   :::image type="content" source="./media/how-to-manage-supercomputers/create-supercomputer-nodepool-create.jpg" alt-text="Screenshot of Azure portal showing Supercomputer create nodepool page." lightbox="./media/how-to-manage-supercomputers/create-supercomputer-nodepool-create.jpg":::
+   :::image type="content" source="./media/how-to-manage-supercomputers/create-supercomputer-node-pool-create.jpg" alt-text="Screenshot of Azure portal showing Supercomputer create nodepool page." lightbox="./media/how-to-manage-supercomputers/create-supercomputer-node-pool-create.jpg":::
 
 ### Configure basic settings
 
@@ -123,7 +123,7 @@ Nodepools define the compute capacity (VMs) attached to a Supercomputer. You can
 
 1. Choose a **Virtual Machine type** for the Node Pool.
 
-   :::image type="content" source="./media/how-to-manage-supercomputers/create-supercomputer-nodepool-vm-configuration.jpg" alt-text="SCreenshot of Azure portal showing Nodepool select VM type." lightbox="./media/how-to-manage-supercomputers/create-supercomputer-nodepool-vm-configuration.jpg":::
+   :::image type="content" source="./media/how-to-manage-supercomputers/create-supercomputer-node-pool-vm-configuration.jpg" alt-text="SCreenshot of Azure portal showing Nodepool select VM type." lightbox="./media/how-to-manage-supercomputers/create-supercomputer-node-pool-vm-configuration.jpg":::
 
 > [!NOTE]
 > The selected Virtual Machine type must be available and quota-approved in the selected region.
@@ -134,7 +134,7 @@ Nodepools define the compute capacity (VMs) attached to a Supercomputer. You can
 
 Specify the **maximum node count**, which defines the upper bound for automatically scaling.
 
-   :::image type="content" source="./media/how-to-manage-supercomputers/create-supercomputer-nodepool-scaling.jpg" alt-text="Screenshot of Azure portal showing Nodepool scaling options." lightbox="./media/how-to-manage-supercomputers/create-supercomputer-nodepool-scaling.jpg":::
+   :::image type="content" source="./media/how-to-manage-supercomputers/create-supercomputer-node-pool-scaling.jpg" alt-text="Screenshot of Azure portal showing Nodepool scaling options." lightbox="./media/how-to-manage-supercomputers/create-supercomputer-node-pool-scaling.jpg":::
 
 ### Create the Nodepool
 
@@ -158,7 +158,7 @@ To delete the nodepools, follow these steps:
     - Select the Supercomputer that owns the nodepool.
 3. Select the **Nodepool** under **Settings** in the left pane.
 
-   :::image type="content" source="./media/how-to-manage-supercomputers/delete-nodepool.jpg" alt-text="Screenshot of Azure portal showing nodepools." lightbox="./media/how-to-manage-supercomputers/delete-nodepool.jpg":::
+   :::image type="content" source="./media/how-to-manage-supercomputers/delete-node-pool.jpg" alt-text="Screenshot of Azure portal showing nodepools." lightbox="./media/how-to-manage-supercomputers/delete-node-pool.jpg":::
 
 4. Select the nodepool or nodepools that you want to delete and select **Delete**
 1. Wait for all the nodepools to get deleted, then navigate to the supercomputer and select the **Overview** section in the left pane