Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
text-generation-inference
Commits
00e6ce44
Commit
00e6ce44
authored
Oct 17, 2022
by
Olivier Dehaene
Browse files
Update aml deployment
parent
bcb53903
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
9 additions
and
4 deletions
+9
-4
aml/README.md
aml/README.md
+1
-0
aml/deployment.yaml
aml/deployment.yaml
+2
-3
aml/endpoint.yaml
aml/endpoint.yaml
+1
-1
aml/model.yaml
aml/model.yaml
+5
-0
No files found.
aml/README.md
View file @
00e6ce44
...
@@ -2,6 +2,7 @@
...
@@ -2,6 +2,7 @@
docker build
.
-t
db4c2190dd824d1f950f5d1555fbadf0.azurecr.io/text-generation:0.1
docker build
.
-t
db4c2190dd824d1f950f5d1555fbadf0.azurecr.io/text-generation:0.1
docker push db4c2190dd824d1f950f5d1555fbadf0.azurecr.io/text-generation:0.1
docker push db4c2190dd824d1f950f5d1555fbadf0.azurecr.io/text-generation:0.1
az ml model create
-f
model.yaml
-g
HuggingFace-BLOOM-ModelPage
-w
HuggingFace
az ml online-endpoint create
-f
endpoint.yaml
-g
HuggingFace-BLOOM-ModelPage
-w
HuggingFace
az ml online-endpoint create
-f
endpoint.yaml
-g
HuggingFace-BLOOM-ModelPage
-w
HuggingFace
az ml online-deployment create
-f
deployment.yaml
-g
HuggingFace-BLOOM-ModelPage
-w
HuggingFace
az ml online-deployment create
-f
deployment.yaml
-g
HuggingFace-BLOOM-ModelPage
-w
HuggingFace
```
```
\ No newline at end of file
aml/deployment.yaml
View file @
00e6ce44
$schema
:
https://azuremlschemas.azureedge.net/latest/managedOnlineDeployment.schema.json
$schema
:
https://azuremlschemas.azureedge.net/latest/managedOnlineDeployment.schema.json
name
:
bloom-deployment
name
:
bloom-deployment
endpoint_name
:
bloom-inference
endpoint_name
:
bloom-inference
model
:
model
:
azureml:bloom:1
name
:
bloom
path
:
./bloom
model_mount_path
:
/var/azureml-model
model_mount_path
:
/var/azureml-model
environment_variables
:
environment_variables
:
MODEL_BASE_PATH
:
/var/azureml-model/bloom
MODEL_BASE_PATH
:
/var/azureml-model/bloom
...
@@ -24,6 +22,7 @@ environment:
...
@@ -24,6 +22,7 @@ environment:
instance_type
:
Standard_ND96amsr_A100_v4
instance_type
:
Standard_ND96amsr_A100_v4
request_settings
:
request_settings
:
request_timeout_ms
:
90000
request_timeout_ms
:
90000
max_concurrent_requests_per_instance
:
256
liveness_probe
:
liveness_probe
:
initial_delay
:
300
initial_delay
:
300
timeout
:
20
timeout
:
20
...
...
aml/endpoint.yaml
View file @
00e6ce44
$schema
:
https://azuremlsdk2.blob.core.windows.net/latest/managedOnlineEndpoint.schema.json
$schema
:
https://azuremlsdk2.blob.core.windows.net/latest/managedOnlineEndpoint.schema.json
name
:
bloom-inference
name
:
bloom-inference
auth_mode
:
aml_to
ke
n
auth_mode
:
ke
y
aml/model.yaml
0 → 100644
View file @
00e6ce44
$schema
:
https://azuremlschemas.azureedge.net/latest/model.schema.json
name
:
bloom
version
:
1
path
:
./bloom
type
:
custom_model
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment