Clone the OADP Operator repository:
git clone https://github.com/Amoghrd/oadp-operator.git
Switch to myBranch.
Build the operator image:
podman build -f build/Dockerfile . -t oadp-operator:latest
Push the image to a registry. For example,
podman push localhost/oadp-operator:latest <REGISTRY_URL>
In order to use a locally built image of the operator, please update the operator.yaml file. Update the image of the oadp-operator container with the image registry URL. You can edit the file manually or use the following command( <REGISTRY_URL> is the placeholder for your own registry url in the command):
sed -i 's|quay.io/konveyor/oadp-operator:latest|<REGISTRY_URL>|g' deploy/operator.yaml
Create an oadp-operator-source.yaml
file like below in oadp-operator directory:
apiVersion: operators.coreos.com/v1
kind: OperatorSource
metadata:
name: oadp-operator
namespace: openshift-marketplace
spec:
type: appregistry
endpoint: https://quay.io/cnr
registryNamespace: deshah
displayName: "OADP Operator"
publisher: "deshah@redhat.com"
Remove the deployed resources:
oc delete -f deploy/crds/konveyor.openshift.io_v1alpha1_velero_cr.yaml
oc delete -f deploy/crds/konveyor.openshift.io_veleros_crd.yaml
oc delete -f deploy/
oc delete namespace oadp-operator
oc delete crd $(oc get crds | grep velero.io | awk -F ' ' '{print $1}')
oc delete -f oadp-operator-source.yaml
Run the following commands (note: they should be run from the root of the oadp-operator directory):
oc create namespace oadp-operator
oc project oadp-operator
oc create secret generic <SECRET_NAME> --namespace oadp-operator --from-file cloud=<CREDENTIALS_FILE_PATH>
oc create -f oadp-operator-source.yaml
Navigate to the OpenShift console in your web browser. Under the Administrator view, go to Operators on the left tab and click on OperatorHub. Search for the OADP Operator in the search bar. Click on it to install and subscribe to the operator.
If you go to Installed Operators and select the oadp-operator project, you should see the OADP Operator successfully installed:
Navigate to the OpenShift console. Under the Administrator view, go to Operators on the left tab and click on OperatorHub. Search for the OpenShift Container Storage operator in the search bar. Click on it to install and subscribe to the operator. Make sure to install it under the oadp-operator namespace.
If you go to Installed Operators and select the openshift-operator project, you should see the OpenShift Container Storage operator successfully installed:
In order to use OLM for OADP deployment, you need to change flag olm_managed
in the konveyor.openshift.io_v1alpha1_velero_cr.yaml
to true
. The file is present in deploy/crds folder.
Moreover, to use Nooba, in the konveyor.openshift.io_v1alpha1_velero_cr.yaml
, you need to:
- Set the flag
noobaa
totrue
. - Set the volume snapshot location region to the correct value.
For instance the konveyor.openshift.io_v1alpha1_velero_cr.yaml
file might look something like this:
apiVersion: konveyor.openshift.io/v1alpha1
kind: Velero
metadata:
name: example-velero
spec:
use_upstream_images: true
olm_managed: true
noobaa: true
default_velero_plugins:
- aws
backup_storage_locations:
- name: default
provider: aws
object_storage:
bucket: myBucket
prefix: "velero"
config:
region: us-east-1
profile: "default"
credentials_secret_ref:
name: cloud-credentials
namespace: oadp-operator
volume_snapshot_locations:
- name: default
provider: aws
config:
region: us-west-1
profile: "default"
enable_restic: true
When the installation succeeds, create a Velero CR
oc create -f deploy/crds/konveyor.openshift.io_v1alpha1_velero_cr.yaml
Post completion of all the above steps, you can check if the operator was successfully installed; the expected result for the command oc get all -n oadp-operator
is as follows:
NAME READY STATUS RESTARTS AGE
pod/aws-s3-provisioner-6cdf54b89-8nz7l 1/1 Running 0 162m
pod/noobaa-core-0 1/1 Running 2 160m
pod/noobaa-db-0 1/1 Running 0 160m
pod/noobaa-endpoint-7f4bcfb665-wmfhf 1/1 Running 0 156m
pod/noobaa-operator-7b79bf7c68-wz5jc 1/1 Running 0 163m
pod/oadp-operator-69fc6bfcb4-xjn2l 1/1 Running 1 161m
pod/ocs-operator-7b564dc46f-hzh8g 1/1 Running 0 163m
pod/rook-ceph-operator-6f985689b4-t95k4 1/1 Running 0 163m
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
service/noobaa-db ClusterIP 172.30.29.6 <none> 27017/TCP 160m
service/noobaa-mgmt LoadBalancer 172.30.222.31 a1276fa043dc5469caaa9f6b566b5091-1561678137.us-west-1.elb.amazonaws.com 80:31438/TCP,443:31408/TCP,8445:32375/TCP,8446:31952/TCP 160m
service/oadp-operator-metrics ClusterIP 172.30.171.159 <none> 8383/TCP,8686/TCP 161m
service/s3 LoadBalancer 172.30.184.59 aebf6dedc595e489496daaeb74c65f12-400970079.us-west-1.elb.amazonaws.com 80:31321/TCP,443:30213/TCP,8444:32073/TCP 160m
NAME READY UP-TO-DATE AVAILABLE AGE
deployment.apps/aws-s3-provisioner 1/1 1 1 163m
deployment.apps/noobaa-endpoint 1/1 1 1 156m
deployment.apps/noobaa-operator 1/1 1 1 163m
deployment.apps/oadp-operator 1/1 1 1 161m
deployment.apps/ocs-operator 1/1 1 1 163m
deployment.apps/rook-ceph-operator 1/1 1 1 163m
NAME DESIRED CURRENT READY AGE
replicaset.apps/aws-s3-provisioner-5dd8cccfd8 0 0 0 163m
replicaset.apps/aws-s3-provisioner-6cdf54b89 1 1 1 162m
replicaset.apps/noobaa-endpoint-7f4bcfb665 1 1 1 156m
replicaset.apps/noobaa-operator-7b79bf7c68 1 1 1 163m
replicaset.apps/oadp-operator-69fc6bfcb4 1 1 1 161m
replicaset.apps/ocs-operator-7b564dc46f 1 1 1 163m
replicaset.apps/rook-ceph-operator-6f985689b4 1 1 1 163m
NAME READY AGE
statefulset.apps/noobaa-core 1/1 160m
statefulset.apps/noobaa-db 1/1 160m
NAME REFERENCE TARGETS MINPODS MAXPODS REPLICAS AGE
horizontalpodautoscaler.autoscaling/noobaa-endpoint Deployment/noobaa-endpoint 0%/80% 1 1 1 156m
NAME HOST/PORT PATH SERVICES PORT TERMINATION WILDCARD
route.route.openshift.io/noobaa-mgmt noobaa-mgmt-oadp-operator.apps.cluster-dlu-4-5.dlu-4-5.mg.dog8code.com noobaa-mgmt mgmt-https reencrypt None
route.route.openshift.io/s3 s3-oadp-operator.apps.cluster-dlu-4-5.dlu-4-5.mg.dog8code.com s3 s3-https reencrypt None
Also, oc get noobaa
should give:
NAME MGMT-ENDPOINTS S3-ENDPOINTS IMAGE PHASE AGE
noobaa [https://10.0.185.183:31408] [https://10.0.185.183:30213] registry.redhat.io/ocs4/mcg-core-rhel8@sha256:689c5a109b81190ddc507b17b7b44ae00029951e7e2c80a6e33358a53945dab6 Ready 161m
Note: Please refer the following documentation for more information regarding OADP Operator with NooBaa install, debugging scenarios and cleanup.
Install OADP Operator and use NooBaa as a BackupStoraeLocation
Cleanup OADP Operator with NooBaa
In this step we will walkthrough the steps of installing a complex application that will be used to demonstration the use of velero in a backup and restore scenario.
Available Applications
Please follow the link below for installing Cassandra.
Running ansible-playbook install.yaml
should set everything up.
End results should look like the following in OpenShift:
The output should also have the following when running the command oc get all -n cassandra-stateful
:
pod/cassandra-0 1/1 Running 0 6d1h
pod/cassandra-1 1/1 Running 0 6d1h
pod/cassandra-2 1/1 Running 0 6d1h
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
service/cassandra ClusterIP None <none> 9042/TCP 6d1h
NAME READY AGE
statefulset.apps/cassandra 3/3 6d1h
Please follow this link for installing Postgres: Postgres example.
Running all the commands to install patroni will give results like the following:
Output should also have the looking when running the command oc get all -n patroni
:
NAME READY STATUS RESTARTS AGE
pod/patroni-persistent-0 1/1 Running 0 6d4h
pod/patroni-persistent-1 1/1 Running 0 6d4h
pod/patroni-persistent-2 1/1 Running 0 6d4h
pod/pgbench 1/1 Running 0 6d4h
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
service/patroni-persistent ClusterIP 172.30.146.208 <none> 5432/TCP 6d4h
service/patroni-persistent-master ClusterIP 172.30.44.10 <none> 5432/TCP 6d4h
service/patroni-persistent-replica ClusterIP 172.30.111.87 <none> 5432/TCP 6d4h
NAME READY AGE
statefulset.apps/patroni-persistent 3/3 6d4h
Now that we have a complex application on OpenShift, we can demonstrate a backup on the application.
Click the link based on which application was installed.
First before our backup, lets populate our database in Cassandra with some sample data.
Run the commands in the following order. This will access the shell and populate some data.
oc exec -it cassandra-0 -- cqlsh CREATE KEYSPACE classicmodels WITH REPLICATION = { 'class' : 'SimpleStrategy', 'replication_factor' : 3 }; use classicmodels; CREATE TABLE offices (officeCode text PRIMARY KEY, city text, phone text, addressLine1 text, addressLine2 text, state text, country text, postalCode text, territory text); INSERT into offices(officeCode, city, phone, addressLine1, addressLine2, state, country ,postalCode, territory) values ('1','San Francisco','+1 650 219 4782','100 Market Street','Suite 300','CA','USA','94080','NA');
Optional: You can use SELECT * FROM classicmodels.offices;
to check that your sample data is entered correctly.
Last exit the shell to move onto the backup.
Now for the backup, all we need to do to perform the backup is running the command ansible-playbook backup.yaml
.
Backup will look like the following with the nodetool operations being called in cassandra.
Disable Gossip (Stops communication with other nodes) Disable Thrift (Stops communication with one of the two protocols for listening to client) Disable Binary (Stops communication with the other protocol for listening to client) Nodetool flush is called to flush all memory to disk Perform Backup Enable Gossip, Thrift, and Binary (Start listening to connections again)
Velero hooks look like this
pre.hook.backup.velero.io/command: '["/bin/bash", "-c", "opt/cassandra/bin/nodetool disablegossip && opt/cassandra/bin/nodetool disablethrift && opt/cassandra/bin/nodetool disablebinary && opt/cassandra/bin/nodetool flush"]' post.hook.backup.velero.io/command: '["/bin/bash", "-c", "opt/cassandra/bin/nodetool enablegossip && opt/cassandra/bin/nodetool enablethrift && opt/cassandra/bin/nodetool enablebinary"]'
The Cassandra backup can be checked by running velero get backups
.
Output showing Cassandra in the backup should look like the following:
Output showing description of backup using velero describe backup
Note: This backup is using restic because vsl was not configured.
Used opt-in approach. Annotate Pod with the following.
kubectl -n YOUR_POD_NAMESPACE annotate pod/YOUR_POD_NAME backup.velero.io/backup-volumes=YOUR_VOLUME_NAME_1,YOUR_VOLUME_NAME_2,...
Name: cassandra Namespace: oadp-operator Labels: velero.io/storage-location=default Annotations: velero.io/source-cluster-k8s-gitversion=v1.17.1 velero.io/source-cluster-k8s-major-version=1 velero.io/source-cluster-k8s-minor-version=17+ Phase: Completed Namespaces: Included: cassandra-stateful Excluded: <none> Resources: Included: * Excluded: <none> Cluster-scoped: auto Label selector: <none> Storage Location: default Snapshot PVs: auto TTL: 720h0m0s Hooks: <none> Backup Format Version: 1 Started: 2020-08-05 15:29:29 -0400 EDT Completed: 2020-08-05 15:31:36 -0400 EDT Expiration: 2020-09-04 15:29:29 -0400 EDT Resource List: apps/v1/ControllerRevision: - cassandra-stateful/cassandra-7f6dd487c4 apps/v1/StatefulSet: - cassandra-stateful/cassandra authorization.openshift.io/v1/RoleBinding: - cassandra-stateful/system:deployers - cassandra-stateful/system:image-builders - cassandra-stateful/system:image-pullers rbac.authorization.k8s.io/v1/RoleBinding: - cassandra-stateful/system:deployers - cassandra-stateful/system:image-builders - cassandra-stateful/system:image-pullers v1/Endpoints: - cassandra-stateful/cassandra v1/Event: - cassandra-stateful/cassandra-0.162864baf6d28fab - cassandra-stateful/cassandra-0.162864bcd951d606 - cassandra-stateful/cassandra-0.162864bcf2313a6b - cassandra-stateful/cassandra-0.162864bd16274de5 - cassandra-stateful/cassandra-0.162864bd1b3084c6 - cassandra-stateful/cassandra-1.1628648122760fc3 - cassandra-stateful/cassandra-1.1628648e6bfc1f12 - cassandra-stateful/cassandra-1.162864cdc171b9a3 - cassandra-stateful/cassandra-1.162864cddc6ccce5 - cassandra-stateful/cassandra-1.162864ce0a07b2f1 - cassandra-stateful/cassandra-1.162864ce13ffce3a - cassandra-stateful/cassandra-2.162864baf6d22579 - cassandra-stateful/cassandra-2.162864bcc8b8da2b - cassandra-stateful/cassandra-2.162864bcf2327934 - cassandra-stateful/cassandra-2.162864bd15748bd6 - cassandra-stateful/cassandra-2.162864bd1b348408 v1/Namespace: - cassandra-stateful v1/PersistentVolume: - pvc-3efd688e-96e8-4bf4-8d2e-38548ac06c91 - pvc-91431776-ccf1-42f4-a88c-7d0eb4c8298d - pvc-cf82ea3c-c545-42ec-a7c1-ced47ea557e5 v1/PersistentVolumeClaim: - cassandra-stateful/cassandra-data-cassandra-0 - cassandra-stateful/cassandra-data-cassandra-1 - cassandra-stateful/cassandra-data-cassandra-2 v1/Pod: - cassandra-stateful/cassandra-0 - cassandra-stateful/cassandra-1 - cassandra-stateful/cassandra-2 v1/Secret: - cassandra-stateful/builder-dockercfg-qtvlk - cassandra-stateful/builder-token-9nf7b - cassandra-stateful/builder-token-njwp5 - cassandra-stateful/default-dockercfg-f5wdd - cassandra-stateful/default-token-dhnvg - cassandra-stateful/default-token-jgsx8 - cassandra-stateful/deployer-dockercfg-7gz2z - cassandra-stateful/deployer-token-92snh - cassandra-stateful/deployer-token-zgzw9 v1/Service: - cassandra-stateful/cassandra v1/ServiceAccount: - cassandra-stateful/builder - cassandra-stateful/default - cassandra-stateful/deployer Persistent Volumes: <none included> Restic Backups: Completed: cassandra-stateful/cassandra-0: cassandra-data
To start the backup, start by creating annotation of prehooks and posthooks. These will serve the purpose of enabling the quiesce behavior for Patroni during backup: Detailed Directions
oc annotate pod -n patroni \
pre.hook.backup.velero.io/command= '["/bin/bash", "-c","patronictl pause && pg_ctl stop -D pgdata/pgroot/data"]' \
pre.hook.backup.velero.io/container=patroni-persistent \
post.hook.backup.velero.io/command='["/bin/bash", "-c", "patronictl resume"]'\
post.hook.backup.velero.io/container=patroni-persistent
Then we can run oc create -f postgres-backup.yaml
to create the backup itself.
Then run oc get volumesnapshotcontent
and make sure the output looks similar to the one below.
$ oc get volumesnapshotcontent
NAME READYTOUSE RESTORESIZE DELETIONPOLICY DRIVER VOLUMESNAPSHOTCLASS VOLUMESNAPSHOT AGE
snapcontent-24ef293e-68b1-4f01-8d9b-673d20c6b423 true 5368709120 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-persistent-patroni-persistent-0-6l8rf 46m
snapcontent-bea37537-a585-4c5d-a02d-ce1067f067a8 true 5368709120 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-persistent-patroni-persistent-2-hk4pk 45m
snapcontent-c0e5a49b-e5d8-4235-8f90-96f2eedf2d04 true 5368709120 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-persistent-patroni-persistent-1-q7nj6 46m
snapcontent-d1104635-17d9-4c83-82e4-94032b31054e true 2147483648 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-8gx2g 44m
A disaster scenario of deleting the namespace will be performed to show that the restore functionalty of velero works.
Pick which app that a backup was performed with.
First make sure Step 6 was performed and a Backup of Cassandra exists velero get backups
.
Then following the Cassandra Example, run the
command ansible-playbook delete.yaml
which will delete Cassandra and perform the disaster scenario.
The output should look like the following. Results should also show the Cassandra namespace being terminated and then deleting from OpenShift:
First make sure Step 6 was performed and a Backup of Cassandra exists velero get backups
.
Next we can safely create a disaster scenario and safely delete the namespace.
Run oc delete namespace patroni
to delete the namespace.
Results should look like the following:
We can now move onto restoring the application after the disaster scenario.
Pick which app that a backup was performed with.
Restoring Cassandra by simply running ansible-playbook restore
. This restore should look like:
Should look like the following.
Now run velero describe restore cassandra --details
to check the data being restored and the restore as complete.
Name: cassandra Namespace: oadp-operator Labels: <none> Annotations: <none> Phase: Completed Backup: cassandra Namespaces: Included: * Excluded: <none> Resources: Included: * Excluded: nodes, events, events.events.k8s.io, backups.velero.io, restores.velero.io, resticrepositories.velero.io Cluster-scoped: auto Namespace mappings: <none> Label selector: <none> Restore PVs: true Restic Restores: Completed: cassandra-stateful/cassandra-0: cassandra-data
Restoring Postgres by running oc create -f postgres-restore.yaml
. The output should show:
Then check the CSI snapshots were created after restore.
$ oc get volumesnapshotcontent
NAME READYTOUSE RESTORESIZE DELETIONPOLICY DRIVER VOLUMESNAPSHOTCLASS VOLUMESNAPSHOT AGE
snapcontent-24ef293e-68b1-4f01-8d9b-673d20c6b423 true 5368709120 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-persistent-patroni-persistent-0-6l8rf 46m
snapcontent-bea37537-a585-4c5d-a02d-ce1067f067a8 true 5368709120 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-persistent-patroni-persistent-2-hk4pk 45m
snapcontent-c0e5a49b-e5d8-4235-8f90-96f2eedf2d04 true 5368709120 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-persistent-patroni-persistent-1-q7nj6 46m
snapcontent-d1104635-17d9-4c83-82e4-94032b31054e true 2147483648 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-8gx2g 44m
velero-velero-patroni-8gx2g-hhgd2 true 0 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-8gx2g 40m
velero-velero-patroni-persistent-patroni-persistent-0-6l8rcppgv true 0 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-persistent-patroni-persistent-0-6l8rf 17m
velero-velero-patroni-persistent-patroni-persistent-1-q7njpv44s true 0 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-persistent-patroni-persistent-1-q7nj6 17m
velero-velero-patroni-persistent-patroni-persistent-2-hk4pv8mgz true 0 Retain rook-ceph.rbd.csi.ceph.com csi-rbdplugin-snapclass velero-patroni-persistent-patroni-persistent-2-hk4pk 17m