Upgrade#
Warning
Before upgrading, visit the release notes for a description of recent changes. While we try to preserve backward compatibility whenever possible, there are occasionally incompatible changes introduced in new versions of HPE Machine Learning Development Environment – for example, the format of the master and agent configuration files might change.
To upgrade, follow the same steps as you did during the initial installation of HPE Machine Learning Development Environment. For example, if you deployed
your HPE Machine Learning Development Environment cluster on Amazon Web Services (AWS), you would
run det deploy aws up --cluster-id CLUSTER_ID --keypair KEYPAIR_NAME
.
Important
The specific upgrade commands vary by environment. You’ll need to run the same commands (including any flags) that were run when you installed HPE Machine Learning Development Environment.
Before starting an upgrade, first follow the steps below to safely shut down the cluster. Once the upgrade is complete and HPE Machine Learning Development Environment is restarted, all suspended experiments will be resumed automatically.
Disable all HPE Machine Learning Development Environment agents in the cluster:
det -m <MASTER_ADDRESS> agent disable --all
where
MASTER_ADDRESS
is the IP address or host name where the HPE Machine Learning Development Environment master can be found. This will cause all tasks running on those agents to be checkpointed and terminated. The checkpoint process might take some time to complete; you can monitor which tasks are still running viadet slot list
.Take a backup of the HPE Machine Learning Development Environment database using pg_dump. This is a safety precaution in case any problems occur after upgrading HPE Machine Learning Development Environment.
All users should also upgrade the CLI by running
pip install --upgrade determined