Skip to content

Commit 47385ff

Browse files
committed
Fixing some words
1 parent 8d05730 commit 47385ff

1 file changed

Lines changed: 36 additions & 35 deletions

File tree

learn-pr/wwl/design-alm-process-ai-powered-business-solutions/includes/2-design-alm-process-data-used-ai-models-agents.md

Lines changed: 36 additions & 35 deletions
Original file line numberDiff line numberDiff line change
@@ -1,28 +1,28 @@
11
## Overview
22

3-
This expert unit equips solution architects to design an application lifecycle management (ALM) process for **all data that powers AI models and agents**. You'll define data-centric stages, controls, roles, and promotion gates that keep datasets, prompts, knowledge sources, and telemetry **governed, reproducible, and compliant** from inception through retirement. The approach integrates enterprise data governance, regional data movement considerations, and operational monitoring so that AI solutions scale safely across environments.
3+
This expert unit equips solution architects to design an application lifecycle management (ALM) process for **all data that powers AI models and agents**. You define data-centric stages, controls, roles, and promotion gates that keep datasets, prompts, knowledge sources, and telemetry **governed, reproducible, and compliant** from inception through retirement. The approach integrates enterprise data governance, regional data movement considerations, and operational monitoring so that AI solutions scale safely across environments.
44

55
## Learning objectives
66

7-
By the end of this unit, learners will be able to:
7+
By the end of this unit, learners are be able to:
88

9-
Map the **AI data lifecycle** to ALM stages and define artifacts, owners, and promotion gates.
9+
* Map the **AI data lifecycle** to ALM stages and define artifacts, owners, and promotion gates.
1010

11-
Design **environment strategies** (Dev/Test/PreProd/Prod) that isolate data and enforce guardrails.
11+
* Design **environment strategies** (Dev/Test/PreProd/Prod) that isolate data and enforce guardrails.
1212

13-
Specify **data residency and movement** policies for Copilot and agent features.
13+
* Specify **data residency and movement** policies for Copilot and agent features.
1414

15-
Operationalize **quality, lineage, and drift** checks with go/nogo criteria and rollback plans.
15+
* Operationalize **quality, lineage, and drift** checks with go/nogo criteria and rollback plans.
1616

17-
Establish a **RACI** and change workflows for datasets, knowledge sources, and evaluation telemetry.
17+
* Establish a **RACI** and change workflows for datasets, knowledge sources, and evaluation telemetry.
1818

1919
## ALM foundations for AI data
2020

2121
### Scope the "data" in AI ALM
2222

2323
#### Treat the following as versioned, promotable ALM artifacts:
2424

25-
**Training & finetuning datasets** (raw, curated, feature/embedding sets).
25+
**Training & fine-tuning datasets** (raw, curated, feature/embedding sets).
2626

2727
**Evaluation/Testing datasets** and "golden" sets for regression testing.
2828

@@ -36,19 +36,19 @@ Establish a **RACI** and change workflows for datasets, knowledge sources, and e
3636

3737
### Environment strategy
3838

39-
#### Create separate data planes per environment—Dev → Test → PreProd → Prod—with:
39+
#### Create separate data planes per environment—Dev → Test → Pre-Prod → Prod—with:
4040

41-
* **Leastprivilege access**; no crossenv shared identity.
41+
* **Least privilege access**; no crossing shared identity.
4242

43-
* **Redgold datasets** pattern: _red_ (mutable, experimental) vs. _gold_ (frozen, promoted).
43+
* **Red gold datasets** pattern: _red_ (mutable, experimental) vs. _gold_ (frozen, promoted).
4444

4545
* **Promotion gates** that require evidence: quality reports, bias checks, lineage, and security signoff.
4646

47-
* **Professional visualEnvironment & data flow (text diagram)**<br>Dev (Red data → feature builds) → Test (Repro runs, eval sets) → PreProd (Gold candidates) → Prod (Gold only)<br>Controls at each hop: validation → approval → immutable snapshot → catalog update
47+
* **Professional visualEnvironment & data flow (text diagram)**<br>Dev (Red data → feature builds) → Test (Repro runs, evaluation sets) → Pre-Prod (Gold candidates) → Prod (Gold only)<br>Controls at each hop: validation → approval → immutable snapshot → catalog update
4848

4949
## The AI data ALM process (endtoend)
5050

51-
### Phase APlan & Catalog
51+
### Phase APlan & Catalog
5252

5353
* Identify business scenarios and **data contracts** (purpose, fields, retention, owners).
5454

@@ -58,7 +58,7 @@ Establish a **RACI** and change workflows for datasets, knowledge sources, and e
5858

5959
* **Gate A → B:** Data contract approved; assets discoverable with owners and tags.
6060

61-
### Phase BIngest & Prepare
61+
### Phase BIngest & Prepare
6262

6363
* Profile and remediate quality (missingness, outliers, imbalance).
6464

@@ -68,45 +68,46 @@ Establish a **RACI** and change workflows for datasets, knowledge sources, and e
6868

6969
* **Gate B → C:** Quality and lineage reports; signed reproducibility log.
7070

71-
### Phase CDevelop & Evaluate
71+
### Phase CDevelop & Evaluate
7272

7373
* Train/iterate using Dev/Test data; **never** train on production knowledge.
7474

75-
* Run **eval suites** (accuracy, safety, robustness, cost) on golden sets.
75+
* Run **evaluation suites** (accuracy, safety, robustness, cost) on golden sets.
7676

7777
* Store **model/data cards** with dataset references and context limits.
7878

7979
* **Gate C → D:** Evaluation thresholds met; risk & safety findings addressed.
8080

81-
### Phase DStage & Approve
81+
### Phase DStage & Approve
8282

8383
* Perform **privacy, security, and compliance** reviews (DLP, RAI, export controls).
8484

85-
* Execute **canary runs** using masked/representative Prodlike data.
85+
* Execute **canary runs** using masked/representative Prod like data.
8686

87-
* _Canary testing is a low-risk deployment strategy that acts as a form of "production regression testing" by releasing new code to a small, isolated subset of users or servers to identify issues before a full rollout._
87+
* _Canary testing is a low-risk deployment strategy.
88+
It releases new code to a small, isolated subset of users or servers to identify issues before a full rollout._
8889

8990
* Freeze **gold datasets** and sign **immutability attestations**.
9091

9192
* **Gate D → E:** CAB approval; deployment runbooks and rollback plans ready.
9293

93-
### Phase EDeploy & Serve
94+
### Phase EDeploy & Serve
9495

9596
* Promote gold corpora and indexes; **semantic indexing** or retrieval stores refreshed.
9697

9798
* Enforce **region/residency** settings and connector allow/deny lists.
9899

99100
* Register release in **data catalog** and publish **consumer contracts**.
100101

101-
### Phase FOperate & Monitor
102+
### Phase FOperate & Monitor
102103

103104
* Track **latency, cost/token, success rate, safety violations, data access denials**.
104105

105106
* Detect **data drift** vs. baselines; trigger **safeguard actions** (circuit breakers, HITL).
106107

107-
* Run scheduled **reevaluation** with golden sets; file backlog items with trace IDs.
108+
* Run scheduled **re-evaluation** with golden sets; file backlog items with trace IDs.
108109

109-
### Phase GEvolve & Retire
110+
### Phase GEvolve & Retire
110111

111112
* Rotate or retrain on updated gold sets; **retest** before promote.
112113

@@ -116,31 +117,31 @@ Establish a **RACI** and change workflows for datasets, knowledge sources, and e
116117

117118
## Controls & checks (what to verify at each promotion gate)
118119

119-
### Text tableALM gate checks
120+
### Text tableALM gate checks
120121

121122
| **Gate** | **Control area** | **Questions to answer** | **Evidence required** |
122123
|---|---|---|---|
123124
| **A→B** | Catalog & ownership | Is the owner accountable? Is sensitivity labeled? | Data contract; catalog record; label policy proof |
124-
| **B→C** | Data quality & lineage | Is data profiled, balanced, deidentified where needed? | Profiling report; lineage graph; version IDs |
125-
| **C→D** | Evaluation & safety | Do evals meet thresholds? Any bias or unsafe patterns? | Metrics pack; safety report; model/data card |
125+
| **B→C** | Data quality & lineage | Is data profiled, balanced, de-identified where needed? | Profiling report; lineage graph; version IDs |
126+
| **C→D** | Evaluation & safety | Do evaluations meet thresholds? Any bias or unsafe patterns? | Metrics pack; safety report; model/data card |
126127
| **D→E** | Compliance & residency | Do region rules and DLP policies permit use? | Residency mapping; DLP rules; approval memo |
127128
| **E→F** | Runtime readiness | Can we monitor, roll back, and cap costs? | Dashboards; alarms; rollback plan; budget guard |
128129

129-
## Region, residency, and crossborder movement
130+
## Region, residency, and cross-border movement
130131

131-
Document **where** prompts/outputs may be processed for Copilot and Power Platform features, and when **crossregion capacity** is required.
132+
Document **where** prompts/outputs may be processed for Copilot and Power Platform features, and when **cross region capacity** is required.
132133

133134
In regulated scenarios, set the default to **inregion** and require explicit approval to enable **overflow processing**.
134135

135136
Align **mailbox region** (for activity data) and environment geo with your policy; define exceptions and purge schedules.
136137

137-
Professional visualResidency decision tree (text)<br>Inregion capacity available? → Yes: keep local.<br>No: Is overflow allowed for this workload tier? → If yes, enable crossregion under admin control; else block feature or defer.
138+
Professional visualResidency decision tree (text)<br>Inregion capacity available? → Yes: keep local.<br>No: Is overflow allowed for this workload tier? → If yes, enable cross region under admin control; else block feature or defer.
138139

139140
## Roles and RACI
140141

141-
## Text RACIdata used in AI models and agents
142+
## Text RACIdata used in AI models and agents
142143

143-
The below RACI chart is a representative sample for an organization. It is up to the architect on the project to validate and adjust the roles and responsibilities as appropriate for the implementation.
144+
The below RACI chart is a representative sample for an organization. It's up to the architect on the project to validate and adjust the roles and responsibilities as appropriate for the implementation.
144145

145146
| **Activity** | **Data Owner** | **AI Architect** | **Security/Compliance** | **Platform Admin** | **Product Owner** |
146147
|---|---|---|---|---|---|
@@ -161,7 +162,7 @@ Maintain **baseline metrics** per release: latency p95, success %, token/€ per
161162

162163
Compare live to baseline; if drift exceeds thresholds, **autoopen an incident**, route to data owner, and pause affected actions.
163164

164-
Rerun eval suites nightly/weekly against **golden sets**; store timeseries for audit.
165+
Rerun evaluation suites nightly/weekly against **golden sets**; store time series for audit.
165166

166167
## Checklists you can paste into your runbooks
167168

@@ -171,12 +172,12 @@ Rerun eval suites nightly/weekly against **golden sets**; store timeseries for a
171172
[ ] Sensitivity labels/DLP rules applied; connectors approved.
172173
[ ] Lineage graph current; dataset snapshot **immutable** and versioned.
173174
[ ] Evaluation thresholds met; safety risks mitigated.
174-
[ ] Residency decision recorded; cross-region toggle reviewed.
175-
[ ] Dashboards, budgets, alerts, and rollback validated in PreProd.
175+
[ ] Residency decision recorded; cross-regions toggle reviewed.
176+
[ ] Dashboards, budgets, alerts, and rollback validated in Pre-Prod.
176177

177178
### Retirement
178179

179-
[ ] Consumers notified; cutover plan executed.
180+
[ ] Consumers notified; cutover plans executed.
180181
[ ] Snapshots archived/deleted per retention; access revoked.
181182
[ ] Audit and lineage preserved; catalog updated.
182183

0 commit comments

Comments
 (0)