You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -67,53 +68,37 @@ This tutorial uses a premium instance of Azure Container Registry to improve col
67
68
--sku premium
68
69
```
69
70
70
-
## Pull, tag, and push your image
71
+
## Import the NVIDIA NIM image into your Azure Container Registry
71
72
72
-
Next, pull the image from NVIDIA GPU Cloud and push to Azure Container Registry.
73
+
Next, import the image from NVIDIA GPU Cloud to Azure Container Registry.
73
74
74
75
> [!NOTE]
75
76
> NVIDIA NICs each have their own hardware requirements. Make sure the GPU type you select supports the [NIM](https://build.nvidia.com/models?filters=nimType%3Anim_type_run_anywhere&q=llama) of your choice. The Llama3 NIM used in this tutorial can run on NVIDIA A100 GPUs.
76
77
77
-
1. Authenticate to the NVIDIA container registry.
78
-
79
-
```bash
80
-
docker login nvcr.io
81
-
```
82
-
83
-
After you run this command, the sign in process prompts you to enter a username. Enter **$oauthtoken** for your user name value.
84
-
85
-
Then you're prompted for a password. Enter your NVIDIA NGC API key here. Once authenticated to the NVIDIA registry, you can authenticate to the Azure registry.
86
-
87
78
1. Authenticate to Azure Container Registry.
88
79
89
80
```bash
90
81
az acr login --name $ACR_NAME
91
82
```
92
83
93
-
1. Pull the Llama3 NIM image.
94
-
95
-
```azurecli
96
-
docker pull nvcr.io/nim/meta/$CONTAINER_AND_TAG
97
-
```
98
-
99
-
1. Tag the image.
100
-
101
-
```azurecli
102
-
docker tag nvcr.io/nim/meta/$CONTAINER_AND_TAG$ACR_NAME.azurecr.io/$CONTAINER_AND_TAG
## Enable artifact streaming (recommended but optional)
112
97
113
-
When your container app runs, it pulls the container from your container registry. When you have larger images like in the case of AI workloads, this image pull may take some time. By enabling artifact streaming, you reduce the time needed, and your container app can take a longtime to start if you don't enable artifact streaming. Use the following steps to enable artifact streaming.
98
+
When your container app runs, it pulls the container from your container registry. When you have larger images like in the case of AI workloads, this image pull may take some time. By enabling artifact streaming, your container app will load the essential parts of your image first, reducing the amount of time to startup your container. Use the following steps to enable artifact streaming.
114
99
115
100
> [!NOTE]
116
-
> The following commands can take a few minutes to complete.
101
+
> The following commands can take a long time to complete.
117
102
118
103
1. Enable artifact streaming on your container registry.
119
104
@@ -136,7 +121,7 @@ When your container app runs, it pulls the container from your container registr
136
121
137
122
Next you create a container app with the NVIDIA GPU Cloud API key.
138
123
139
-
1. Create the container app.
124
+
1. Create the container app environment.
140
125
141
126
```azurecli
142
127
az containerapp env create \
@@ -177,6 +162,9 @@ Next you create a container app with the NVIDIA GPU Cloud API key.
177
162
178
163
This command returns the URL of your container app. Set this value aside in a text editor for use in a following command.
179
164
165
+
> [!NOTE]
166
+
> Some NIMs have longer startup times. To account for this, you can configure a [health probe](./health-probes.md) or set your container app's min-replica count with `--min-replicas 1` to keep a replica running at all times.
167
+
180
168
## Verify the application works
181
169
182
170
You can verify a successful deployment by sending a request `POST` request to your application.
0 commit comments