Merge pull request #13873 from MicrosoftDocs/main

learn-build-service-prod[bot] · web-flow · commit c544936f0f4c · 2026-04-12T11:01:56.000Z
Auto Publish – main to live - 2026-04-12 11:00 UTC
diff --git a/docs/graph/design-graph-schema.md b/docs/graph/design-graph-schema.md
@@ -0,0 +1,140 @@
+---
+title: Design a Graph Schema for Graph in Microsoft Fabric
+description: Learn best practices for designing a graph schema in Microsoft Fabric, including how to choose node types, edge types, key columns, and properties.
+ms.date: 04/10/2026
+ms.topic: how-to
+ms.reviewer: wangwilliam
+ai-usage: ai-assisted
+---
+
+# Design a graph schema in Microsoft Fabric
+
+[!INCLUDE [feature-preview](./includes/feature-preview-note.md)]
+
+A graph schema is the collection of node types, edge types, and their properties that define the structure of your graph. A well-designed graph schema makes your data easier to query, maintain, and extend. This article provides best practices for turning tabular data in a lakehouse into an effective [labeled property graph](graph-data-models.md) in Microsoft Fabric.
+
+Use these guidelines before you start modeling in the graph model editor. For step-by-step instructions on creating nodes and edges, see the [graph tutorial](tutorial-introduction.md). Examples in this article use the [Adventure Works sample dataset](sample-datasets.md).
+
+> [!IMPORTANT]
+> Graph currently doesn't support schema evolution. After you model your data, the structure of nodes, edges, and properties is fixed. Structural changes, such as adding properties, modifying labels, or changing relationship types, require you to create a new graph model and reload all data. This process takes time and consumes capacity, so plan your schema thoroughly before you start modeling.
+
+## Prerequisites
+
+- A [Fabric workspace](../fundamentals/create-workspaces.md) with a lakehouse that contains your source tables.
+- Familiarity with the [graph model editor](tutorial-introduction.md).
+- Optional: The [Adventure Works sample dataset](sample-datasets.md) to follow the examples in this article.
+
+## Understand node types and edge types
+
+Before you design a schema, understand these core concepts:
+
+A **node type** defines a kind of entity in your graph, such as a customer, product, or order. It consists of:
+
+- A **label**, which is the name that identifies this category of node. For example, `Customer`. You use the label in queries to refer to nodes of this type.
+- A **mapping table**, which is the lakehouse table that provides the source data for the node type. For example, the *adventureworks_customers* table.
+- A **key column** that uniquely identifies each node (labeled **ID** in the graph model editor). For example, `CustomerID_K`.
+- **Properties**, which are columns from the table that become attributes on each node. For example, `FirstName`, `LastName`, and `EmailAddress`.
+
+A **node** is an individual instance of a node type - one row in the mapping table. For example, each row in *adventureworks_customers* becomes a `Customer` node.
+
+An **edge type** defines a kind of relationship between two node types. It consists of:
+
+- A **label**, which is the name that identifies this category of relationship. For example, `purchases`.
+- A **mapping table** that contains the relationship data between the source and target nodes. For example, the *adventureworks_orders* table.
+- A **source node type** and a **target node type** that the edge connects. For example, `Customer` as the source and `Order` as the target.
+
+An **edge** is an individual instance of an edge type - one row in the mapping table that connects two specific nodes.
+
+> [!NOTE]
+> In the graph model editor, the **Add node** and **Add edge** buttons create node types and edge types, not individual nodes or edges.
+
+## Identify entities and relationships
+
+Start by identifying the *entities* (things) and *relationships* (connections) in your data. Entities become node types. Connections between entities become edge types.
+
+Ask these questions about your source tables:
+
+- **What are the primary entities?** Rows that represent distinct real-world things are candidates for node types. For example, customers, products, orders, and employees.
+- **How do these entities relate to each other?** Columns that reference rows in another table (foreign keys) suggest edge types. For example, `CustomerID_FK` in an `orders` table points to the `customers` table, which suggests modeling a `purchases` edge.
+- **Are there embedded entities?** A column inside a table might represent a distinct entity worth extracting into its own node type. For an example, see [Choose node types](#choose-node-types). For a step-by-step walkthrough, see [Add multiple node and edge types from one mapping table](tutorial-model-node-edge-from-same-table.md).
+
+## Choose node types
+
+Create a node type for each entity that you need to query or traverse independently. Use these guidelines:
+
+| Make the entity a **node type** when... | Keep it as a **property** when... |
+| --- | --- |
+| You need to traverse to or through it. | It's descriptive metadata you only read, not traverse. |
+| Multiple entities share a relationship with it. | It's unique to the entity it belongs to. |
+| You need to match or group by it directly in queries. | You only filter by it as a property of another entity. |
+
+**Example:** In the Adventure Works dataset, `Country` starts as a column on the `employees` table. If you need to query "which employees live in the same country?" or "which countries have the most employees?", extract `Country` into its own node type. If you only need to display an employee's country as a label, leave it as a property.
+
+## Choose key columns
+
+Every node type requires a key column (or compound key) that uniquely identifies each node. Choose keys carefully:
+
+- **Use existing unique identifiers** from your source tables. For example, `CustomerID_K` or `ProductID_K`.
+- **Avoid surrogate keys that lack business meaning** unless no natural key exists. For example, prefer `CustomerID` over an auto-incrementing row number.
+- **Use compound keys** when a single column doesn't guarantee uniqueness. For example, a `ProductVersion` node might need both `ProductID` and `VersionNumber` as its key.
+- **Match data types** between key columns and the foreign key columns used in edge mappings. Mismatched types cause edge creation failures.
+
+> [!TIP]
+> Define [node key constraints](gql-graph-types.md#set-up-node-key-constraints) to enable the query engine to perform direct lookups on key properties. This optimization speeds up queries that look up specific nodes by key.
+
+## Choose edge types
+
+Edge types define the relationships between node types. Each edge type connects a source node type to a target node type through a mapping table.
+
+Follow these guidelines:
+
+- **Use descriptive labels** that read as verbs or verb phrases. For example, `purchases`, `sells`, `livesIn`, and `belongsTo`. A well-named edge makes queries easier to read.
+- **Consider direction carefully.** Edges in graph are directed. Choose the direction that best represents the real-world relationship. For example, `Customer` --*purchases*--> `Order` reads more naturally than `Order` --*purchasedBy*--> `Customer`.
+- **Give distinct names to edge types that connect different node type pairs.** If both "employee sells order" and "customer purchases order" connect to `Order`, name them `sells` and `purchases` rather than giving both the same label. For more information, see [edge creation limitations](limitations.md#edge-creation).
+- **Add properties to edge types** when the relationship itself has attributes. For example, a `quantity` on a `contains` edge or an `orderDate` on a `purchases` edge.
+
+> [!IMPORTANT]
+> The mapping table for an edge must contain columns that match the key columns of both the source and target node types in values and data type. Tables that you use to create node types can also serve as edge mapping tables if they meet this requirement.
+
+## Remove unnecessary properties
+
+When you create a node type from a mapping table, every column in the table becomes a property by default. Remove properties that you don't need for queries or analysis.
+
+Excessive properties increase storage, slow queries, and make the graph harder to maintain. For each node type, keep only properties that are:
+
+- Required for the uniqueness of the node (key columns)
+- Used in `WHERE` filters or `RETURN` projections in your queries
+- Needed for downstream analysis or visualization
+
+For more information on how property count affects query performance, see [Return only the properties you need](gql-query-performance.md#return-only-the-properties-you-need).
+
+## Choose data types
+
+Select the most specific data type for each property. The right types improve both storage efficiency and query performance:
+
+- Use `INT` or `UINT64` for numeric identifiers and counts. Numeric comparisons are faster than string comparisons.
+- Use `ZONED DATETIME` for timestamps instead of string-formatted dates.
+- Use `BOOLEAN` for true/false flags instead of string values like `"yes"` or `"no"`.
+
+For the full list of supported types, see [Current limitations — Data types](limitations.md#data-types).
+
+## Common tabular-to-graph patterns
+
+The following table summarizes how some common tabular data structures translate to graph elements:
+
+| Tabular structure | Graph result | Example |
+| --- | --- | --- |
+| **One-to-many:** Parent table + child table with foreign key | Two node types connected by an edge type. | `Customer` --*purchases*--> `Order` |
+| **Many-to-many:** Junction table linking two tables | Edge type between two node types. | `Vendor` --*produces*--> `Product` |
+| **Embedded entity:** Column representing a shared entity | Extracted node type with edge. | `Employee` --*livesIn*--> `Country` |
+| **Hierarchy:** Chain of parent-child tables | Node types linked by edges at each level. | `Product` --*isOfType*--> `Subcategory` --*belongsTo*--> `Category` |
+
+For a step-by-step walkthrough of the embedded entity pattern, see [Add multiple node and edge types from one mapping table](tutorial-model-node-edge-from-same-table.md).
+
+## Related content
+
+- [Tutorial: Introduction to graph](tutorial-introduction.md)
+- [GQL graph types](gql-graph-types.md)
+- [Optimize GQL query performance](gql-query-performance.md)
+- [Labeled property graphs](graph-data-models.md)
+- [Current limitations](limitations.md)
diff --git a/docs/graph/gql-query-performance.md b/docs/graph/gql-query-performance.md
@@ -2,7 +2,7 @@
 title: Optimize GQL Query Performance for graph in Microsoft Fabric
 description: Learn how to write efficient GQL queries for graph in Microsoft Fabric. Apply filtering, traversal, and key constraint strategies to improve query performance.
 ms.topic: how-to
-ms.date: 03/12/2026
+ms.date: 04/10/2026
 ms.reviewer: splantikow
 ---
 
@@ -12,7 +12,7 @@ ms.reviewer: splantikow
 
 This article provides guidance for writing GQL (Graph Query Language) queries that perform predictably and efficiently when working with graph in Microsoft Fabric. The recommendations are based on current platform behavior and documented constraints.
 
-For hard limits on graph size, result size, and query timeout, see [Current limitations](limitations.md).
+For hard limits on graph size, result size, and query timeout, see [Current limitations](limitations.md). Several recommendations in this article also relate to how you design your graph schema. For more information, see [Design a graph schema](design-graph-schema.md).
 
 ## Filter early in patterns
 
diff --git a/docs/graph/graph-data-models.md b/docs/graph/graph-data-models.md
@@ -4,7 +4,7 @@ description: Learn how the Labeled Property Graph (LPG) model in graph in Micros
 #customer intent: As a data professional, I want to understand the labeled property graph model used by graph in Microsoft Fabric so that I can effectively model my connected data.
 ai-usage: ai-assisted
 ms.topic: concept-article
-ms.date: 03/31/2026
+ms.date: 04/10/2026
 ms.reviewer: wangwilliam
 ---
 
@@ -40,10 +40,14 @@ For most customers, LPG provides the best balance of performance, usability, and
 - **Simplicity and intuitiveness:** Nodes and edges map closely to how people think about networks. LPG is less complex than RDF. You don't need to define ontologies or manage global identifiers.
 - **Properties on edges:** Model weighted, temporal, or labeled relationships on edges. This feature supports advanced analytics like recommendations and fraud detection.
 - **Performance and storage efficiency:** LPG-based graph databases store data compactly and enable fast traversals, even for large, complex graphs.
-- **Flexible schema:** Evolve your graph model as your business needs change, without rigid constraints.
+- **Flexible schema:** Evolve your graph model as your business needs change, without rigid constraints. Note that schema changes currently require you to create a new graph model and reload your data. For more information, see [Design a graph schema](design-graph-schema.md).
 - **Integration with Fabric:** Graph works with OneLake and Power BI, enabling seamless analytics and visualization.
 
+For details on how node types and edge types map to lakehouse tables in Fabric, see [Understand node types and edge types](design-graph-schema.md#understand-node-types-and-edge-types).
+
 ## Related content
 
+- [Design a graph schema](design-graph-schema.md)
+- [Tutorial: Introduction to graph](tutorial-introduction.md)
 - [Try Microsoft Fabric for free](../fundamentals/fabric-trial.md)
 - [End-to-end tutorials in Microsoft Fabric](../fundamentals/end-to-end-tutorials.md)
diff --git a/docs/graph/how-graph-works.md b/docs/graph/how-graph-works.md
@@ -3,7 +3,7 @@ title: How graph in Microsoft Fabric works
 description: Learn how data flows through graph in Microsoft Fabric, from data ingestion and storage in OneLake to graph modeling, querying, and returning results.
 #customer intent: As a data analyst or engineer, I want to understand how graph in Microsoft Fabric processes and queries data so that I can evaluate whether it fits my analytical needs.
 ms.topic: concept-article
-ms.date: 03/31/2026
+ms.date: 04/10/2026
 ms.reviewer: wangwilliam
 ai-usage: ai-assisted
 ---
@@ -40,7 +40,7 @@ In the graph modeling step, you define the graph schema by specifying:
 - **Edge types:** Relationships between entities, such as "purchases," "contains," or "produces."
 - **Table mappings:** How node and edge definitions map to the underlying source tables.
 
-This step creates the [labeled property graph](graph-data-models.md) structure. Complete graph modeling before you query the graph.
+This step creates the [labeled property graph](graph-data-models.md) structure. Complete graph modeling before you query the graph. For guidance on making these modeling decisions, see [Design a graph schema](design-graph-schema.md).
 
 > [!NOTE]
 > Graph currently doesn't support schema evolution. If you need to make structural changes—such as adding new properties, modifying labels, or changing relationship types—reingest the updated source data into a new model.
diff --git a/docs/graph/overview.md b/docs/graph/overview.md
@@ -2,7 +2,7 @@
 title: What is graph in Microsoft Fabric?
 description: Learn about the core purpose, architecture, and benefits of graph in Microsoft Fabric, including integration and feature highlights.
 ms.topic: overview
-ms.date: 03/26/2026
+ms.date: 04/10/2026
 ms.reviewer: wangwilliam
 ms.custom: references_regions
 ms.search.form: graph overview
@@ -45,7 +45,7 @@ By using graph, you can:
 - Create a labeled property graph over structured data in OneLake by defining its nodes and edges in terms of underlying tabular data. To learn how to load and refresh source data, see [Manage and refresh data](manage-data.md).
 
     > [!IMPORTANT]
-    > Graph currently doesn't support schema evolution. After you ingest and model your data, the structure of nodes, relationships, and properties is fixed. If you need to make structural changes - such as adding new properties, modifying labels, or changing relationship types - you must reingest the updated source data into a new model.
+    > Graph currently doesn't support schema evolution. After you ingest and model your data, the structure of nodes, relationships, and properties is fixed. If you need to make structural changes - such as adding new properties, modifying labels, or changing relationship types - you must reingest the updated source data into a new model. For guidance on planning your schema, see [Design a graph schema](design-graph-schema.md).
 
 - Query by using GQL (Graph Query Language), including pattern matching, path constructs, aggregations, and other features as they're released. The official International Standard for GQL is [ISO/IEC 39075 Information technology — Database languages — GQL](https://www.iso.org/standard/76120.html).
 
diff --git a/docs/graph/toc.yml b/docs/graph/toc.yml
@@ -43,6 +43,8 @@ items:
         href: security-overview.md
   - name: How-tos
     items:
+      - name: Design a graph schema
+        href: design-graph-schema.md
       - name: Share and manage permissions
         href: share-graph-manage-permissions.md
       - name: Manage data
diff --git a/docs/graph/tutorial-model-node-edge-from-same-table.md b/docs/graph/tutorial-model-node-edge-from-same-table.md
@@ -2,7 +2,7 @@
 title: "Tutorial: Create Node and Edge Types from One Mapping Table"
 description: Learn how to create multiple node types and edge types from a single mapping table in your graph model in Microsoft Fabric.
 ms.topic: tutorial
-ms.date: 03/24/2026
+ms.date: 04/10/2026
 ms.reviewer: wangwilliam
 ms.search.form: Tutorial - Add nodes and edges from one mapping table
 ai-usage: ai-assisted
@@ -115,6 +115,9 @@ In this tutorial step, you derived two node types and one edge type from the sin
 
 This pattern is useful whenever a relational table contains embedded entities that you want to represent as separate nodes in your graph. Look for columns that represent distinct real-world entities, such as countries, cities, or departments, as candidates for extraction into their own node types.
 
+> [!TIP]
+> For more modeling patterns and decision guidance, see [Design a graph schema](design-graph-schema.md).
+
 ## Next step
 
 > [!div class="nextstepaction"]
diff --git a/docs/graph/tutorial-model-nodes.md b/docs/graph/tutorial-model-nodes.md
@@ -2,7 +2,7 @@
 title: "Tutorial: Add node types to your graph"
 description: Learn how to add node types to your graph model in Microsoft Fabric by mapping source tables and configuring node properties.
 ms.topic: tutorial
-ms.date: 03/24/2026
+ms.date: 04/10/2026
 ms.reviewer: wangwilliam
 ms.search.form: Tutorial - Add nodes to your graph
 ai-usage: ai-assisted
@@ -44,7 +44,7 @@ To add node types to your graph, follow these steps:
     - **ID** of the mapping column: `CustomerID_K`
 
    > [!TIP]
-   > You can set compound keys (IDs consisting of multiple columns).
+   > You can set compound keys (IDs consisting of multiple columns). After you select a mapping table, choose one ID from the **ID** dropdown, then use the dropdown again to add another.
 
 1. Select **Confirm** to add the node type to your graph.
 1. Repeat the process for all remaining node types listed in the [Adventure Works node mappings](#adventure-works-node-mappings) table.