Skip to content

Commit 5fadce4

Browse files
authored
Merge pull request #53791 from ivorb/info-extract-new
Add consolidated doc intel module
2 parents 54fc013 + 487636f commit 5fadce4

23 files changed

Lines changed: 618 additions & 0 deletions
Lines changed: 46 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,46 @@
1+
### YamlMime:LearningPath
2+
uid: learn.extract-insights-visual-data-azure
3+
metadata:
4+
title: Extract insights from visual data on Azure
5+
description: Use generative AI vision and Content Understanding in Azure to extract insights from visual data.
6+
ms.date: 03/11/2026
7+
author: ivorb
8+
ms.author: berryivor
9+
ms.topic: learning-path
10+
title: Extract insights from visual data on Azure
11+
summary: |
12+
Use generative AI, computer vision, and Content Understanding capabilities in Azure to extract insights from visual data, supporting scenarios like:
13+
- Image and video generation
14+
- Image and video analysis
15+
- Content Understanding and enrichment
16+
- Visual search and classification
17+
- Digital asset management (DAM)
18+
- Multimodal AI solutions
19+
prerequisites: |
20+
Before starting this learning path, you should already have:
21+
- Familiarity with Azure and Microsoft Foundry.
22+
- Programming experience.
23+
iconUrl: /training/achievements/generic-badge.svg
24+
levels:
25+
- intermediate
26+
roles:
27+
- ai-engineer
28+
- developer
29+
subjects:
30+
- artificial-intelligence
31+
products:
32+
- foundry-tools
33+
- microsoft-foundry
34+
- azure-cognitive-search
35+
modules:
36+
- learn.wwl.generate-images-azure-openai
37+
- learn.wwl.generate-video-with-foundry
38+
- learn.wwl.analyze-images-with-content-understanding
39+
- learn.wwl.develop-generative-ai-vision-apps
40+
- learn.wwl.analyze-content-ai
41+
- learn.wwl.analyze-content-ai-api
42+
- learn.wwl.extract-data-with-document-intelligence
43+
- learn.wwl.ai-knowledge-mining
44+
trophy:
45+
uid: learn.extract-insights-visual-data-azure.trophy
46+
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.extract-data-with-document-intelligence.introduction
3+
title: Introduction
4+
metadata:
5+
title: Introduction
6+
description: Introduction to extracting data with Azure Document Intelligence.
7+
ms.date: 03/11/2026
8+
author: ivorb
9+
ms.author: berryivor
10+
ms.topic: unit
11+
ai-usage: ai-assisted
12+
durationInMinutes: 3
13+
content: |
14+
[!include[](includes/1-introduction.md)]
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.extract-data-with-document-intelligence.what-is-document-intelligence
3+
title: What is Azure Document Intelligence?
4+
metadata:
5+
title: What is Azure Document Intelligence?
6+
description: Learn about the capabilities, components, and access options of Azure Document Intelligence.
7+
ms.date: 03/11/2026
8+
author: ivorb
9+
ms.author: berryivor
10+
ms.topic: unit
11+
ai-usage: ai-assisted
12+
durationInMinutes: 8
13+
content: |
14+
[!include[](includes/2-what-is-document-intelligence.md)]
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.extract-data-with-document-intelligence.use-document-intelligence-studio
3+
title: Use the Document Intelligence Studio
4+
metadata:
5+
title: Use the Document Intelligence Studio
6+
description: Learn how to use the Document Intelligence Studio to analyze, test, and build document processing solutions.
7+
ms.date: 03/11/2026
8+
author: ivorb
9+
ms.author: berryivor
10+
ms.topic: unit
11+
ai-usage: ai-assisted
12+
durationInMinutes: 6
13+
content: |
14+
[!include[](includes/3-use-document-intelligence-studio.md)]
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.extract-data-with-document-intelligence.use-prebuilt-models
3+
title: Use prebuilt models
4+
metadata:
5+
title: Use prebuilt models
6+
description: Learn about document analysis models and prebuilt models for extracting data from common document types.
7+
ms.date: 03/11/2026
8+
author: ivorb
9+
ms.author: berryivor
10+
ms.topic: unit
11+
ai-usage: ai-assisted
12+
durationInMinutes: 10
13+
content: |
14+
[!include[](includes/4-use-prebuilt-models.md)]
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.extract-data-with-document-intelligence.train-custom-models
3+
title: Train and use custom models
4+
metadata:
5+
title: Train and use custom models
6+
description: Learn how to train custom neural and template models and use them to extract data from your documents.
7+
ms.date: 03/11/2026
8+
author: ivorb
9+
ms.author: berryivor
10+
ms.topic: unit
11+
ai-usage: ai-assisted
12+
durationInMinutes: 8
13+
content: |
14+
[!include[](includes/5-train-custom-models.md)]
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.extract-data-with-document-intelligence.exercise
3+
title: Exercise - Analyze documents with Document Intelligence
4+
metadata:
5+
title: Exercise - Analyze documents with Document Intelligence
6+
description: Practice using Azure Document Intelligence to analyze documents with prebuilt models.
7+
ms.date: 03/11/2026
8+
author: ivorb
9+
ms.author: berryivor
10+
ms.topic: unit
11+
ai-usage: ai-assisted
12+
durationInMinutes: 30
13+
content: |
14+
[!include[](includes/6-exercise.md)]
Lines changed: 52 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,52 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.extract-data-with-document-intelligence.knowledge-check
3+
title: Module assessment
4+
metadata:
5+
title: Module assessment
6+
description: Knowledge check for extracting data with Azure Document Intelligence.
7+
ms.date: 03/11/2026
8+
author: ivorb
9+
ms.author: berryivor
10+
ms.topic: unit
11+
ai-usage: ai-assisted
12+
module_assessment: true
13+
durationInMinutes: 3
14+
content: |
15+
[!include[](includes/7-knowledge-check.md)]
16+
quiz:
17+
title: "Check your knowledge"
18+
questions:
19+
- content: "You need to extract text and table structure from a set of documents that have varying formats. You don't need to identify specific labeled fields. Which Document Intelligence model should you use?"
20+
choices:
21+
- content: "The read model."
22+
isCorrect: false
23+
explanation: "Incorrect. The read model extracts text and languages but doesn't extract tables or structural information."
24+
- content: "The layout model."
25+
isCorrect: true
26+
explanation: "Correct. The layout model extracts text, tables, selection marks, and document structure information, making it ideal for documents with varying formats where you need structural data."
27+
- content: "The invoice model."
28+
isCorrect: false
29+
explanation: "Incorrect. The invoice model is a prebuilt model designed specifically for invoices, not general documents with varying formats."
30+
- content: "You're building a custom model in Azure Document Intelligence. What training artifacts are required when training with the REST API?"
31+
choices:
32+
- content: "Only the sample form documents in a blob container."
33+
isCorrect: false
34+
explanation: "Incorrect. In addition to sample forms, you need JSON files that describe the fields, labels, and OCR data for each form."
35+
- content: "Sample forms along with ocr.json, labels.json, and fields.json files in a blob container."
36+
isCorrect: true
37+
explanation: "Correct. You need an ocr.json file for each sample form, a labels.json file for each form mapping fields to locations, and a single fields.json file describing the fields to extract."
38+
- content: "A minimum of 100 labeled forms and a trained classifier."
39+
isCorrect: false
40+
explanation: "Incorrect. You can train a custom model with as few as five to six sample forms. A classifier is optional and separate from the extraction model."
41+
- content: "A company processes both invoices and receipts. They want a single endpoint that routes each document to the correct extraction model. What should they use?"
42+
choices:
43+
- content: "A custom neural model."
44+
isCorrect: false
45+
explanation: "Incorrect. A custom neural model extracts data from one type of document. It doesn't automatically route between different document types."
46+
- content: "A prebuilt read model."
47+
isCorrect: false
48+
explanation: "Incorrect. The read model extracts text but doesn't classify documents or extract domain-specific fields."
49+
- content: "A composed model or a custom classifier paired with extraction models."
50+
isCorrect: true
51+
explanation: "Correct. A composed model combines multiple custom models into a single endpoint and classifies each document to the appropriate component model. Alternatively, a custom classifier can identify the document type before routing to the correct extraction model."
52+
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
### YamlMime:ModuleUnit
2+
uid: learn.wwl.extract-data-with-document-intelligence.summary
3+
title: Summary
4+
metadata:
5+
title: Summary
6+
description: Summary of extracting data with Azure Document Intelligence.
7+
ms.date: 03/11/2026
8+
author: ivorb
9+
ms.author: berryivor
10+
ms.topic: unit
11+
ai-usage: ai-assisted
12+
durationInMinutes: 3
13+
content: |
14+
[!include[](includes/8-summary.md)]
Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,19 @@
1+
Forms and documents are used to communicate information in every industry, every day. Many organizations still manually extract data from forms to exchange information, whether it's filing claims, enrolling patients, processing receipts for expense reports, or reviewing operations data.
2+
3+
Imagine you work for a company that processes thousands of invoices, receipts, and tax forms each month. You want to automate the extraction of key data from these documents to reduce manual effort and improve accuracy. Azure Document Intelligence provides the AI-powered tools you need to build this kind of solution.
4+
5+
Azure Document Intelligence is a cloud-based service in Microsoft Foundry that uses optical character recognition (OCR) and deep learning models to extract text, key-value pairs, tables, and structured data from forms and documents. It offers prebuilt models for common document types, document analysis models for general text extraction, and the ability to train custom models for your specific forms.
6+
7+
## Learning objectives
8+
9+
By the end of this module, you'll be able to:
10+
11+
- Describe the capabilities and components of Azure Document Intelligence.
12+
- Use the Document Intelligence Studio to explore and test models.
13+
- Use prebuilt models to extract data from common document types.
14+
- Train and use custom models for industry-specific forms.
15+
16+
## Prerequisites
17+
18+
- Familiarity with Azure and the Azure portal.
19+
- Programming experience with C# or Python.

0 commit comments

Comments
 (0)