Upgrading to Riak KV 3.0.11

[use admin riak admin]: https://www.tiot.jp/riak-docs//riak/kv/3.0.11/using/admin/riak admin

Overview

You can upgrade one node or your whole cluster to Riak KV 3.0.11 by following the instructions below.

Tip

KV nodes negotiate with each other to determine supported operating modes. This allows clusters containing mixed-versions of Riak KV to interoperate without special configuration, and simplifies rolling upgrades.

General Process

For every node in the cluster:

  1. Stop Riak KV.
  2. Back up the Riak /etc, /data, and /basho-patches directories.
  3. Remove your /basho-patches directory.
  4. Upgrade Riak KV.
    • If you are upgrading from EE to OSS, uninstall your EE KV package before upgrading.
  5. (Optional) If you would like to potentially downgrade at some point, update your advanced.config file to opt-out of the AAE updates.
  6. If you’re upgrading from EE to OSS, apply your customized settings to vm.args/riak.conf and app.config/advanced.config
  7. If you’re using MDC replication to clusters with versions less than 2.2.0, update your advanced.config file to over-ride the default bucket properties for compatibility.
  8. Start Riak KV.
  9. Verify Riak KV is running the upgraded version.
  10. Wait for the riak_kv service to start.
  11. Wait for any hinted handoffs to complete.

Before starting the rolling upgrade process on your cluster, check out the Upgrading Riak KV: Production Checklist, which covers details and questions to consider before upgrading.

Transitioning to Leveled backend

Riak KV 2.9 introduced a new backend specifically for Riak, Leveled:

The leveled backend is not compatible with other backends in terms of the serialised disk format. There is no in-place transition possible from bitcask/eleveldb/hanoidb to leveled. Transitioning requires a node replace operation. It is recommended to: * First transition to 2.9 with the current backend in-place, minimising the time spent running mis-matched versions in parallel; * Then as a second phase run a rolling series of node transfers to replace the nodes with the previous backend, with nodes with the leveled backend.

Components That Complicate Downgrades

We do our best to make all features that change data formats on disk opt-in; however, some features may be introduced that we either believe are so important that we automatically opt-in users on upgrade or there is no way to provide direct backward compatibility. Downgrading environments with these features can require more effort or might not be possible.

  • Automatic features alter the data format on disk, but are considered important enough for users to be automatically opted-in.
  • Required features must be accepted as a part of the upgrade. Internal Solr version upgrades that change the data format on disk are an example of a required feature upgrade.
  • One Way features, when enabled, will make a clean downgrade of a cluster impossible.
Feature Automatic Required One Way Notes
Active anti-entropy file format changes Can opt-out using a capability.
LZ4 compression in LevelDB
Global expiration in LevelDB
HyperLogLog data type On downgrade data written in HLL format is unreadable.

When Downgrading is No Longer an Option

If you decide to upgrade to version 2.9, you can still downgrade your cluster to an earlier version of Riak KV if you wish, unless you transfer all of your nodes to the new Leveled backend.

If you use other new features, you can still downgrade your cluster, but you will no longer be able to use those features after the downgrade.

Upgrading process

1. Stop Riak KV on the node you are going to upgrade:

riak stop

2. Back up your /etc (app.config and vm.args), /data, and /basho-patches directories.

sudo tar -czf riak_backup.tar.gz /var/lib/riak /etc/riak /usr/lib64/riak/lib/basho-patches
sudo tar -czf riak_backup.tar.gz /var/lib/riak /etc/riak /usr/lib/riak/lib/basho-patches

3. Remove your /basho-patches directory:

sudo rm -rf /usr/lib64/riak/lib/basho-patches/*
sudo rm -rf /usr/lib/riak/lib/basho-patches*

4. Upgrade Riak KV:

Upgrading from KV Enterprise Edition

If you are upgrading from Riak KV EE to Riak KV OSS, you must uninstall your Riak KV EE package right now, before you can install the OSS version.

sudo rpm -Uvh »riak_package_name«.rpm
sudo dpkg -i »riak_package_name«.deb

5.a. (Optional) If you would like to keep your AAE trees in a format that will facilitate downgrading, the capability override should be in the riak_kv proplist of the advanced.config file:

   {riak_kv, [
     {override_capability, [
       {object_hash_version, [{use, legacy}] }
     ]}
   ]}

5.b. (Optional) If you would like to keep your leveldb compression in a format that will facilitate downgrading, the capability override should be in riak.conf:

   leveldb.compression.algorithm=snappy

5.c. (OSS Only)If you are upgrading from Riak KV OSS =< 2.2.3, you must perform the following steps before moving on:

  • A standard package uninstall should not have removed your data directories, but if it did, move your backup to where the data directory should be.
  • Then copy any customizations from your backed-up vm.args/riak.conf to the newly installed vm.args/riak.conf file (these files may be identical).
  • The advanced.config file from the newly installed version will be significantly different from your backed-up file. It will have many new sections along with the original ones. Copy the customizations from your original advanced.config file into the appropriate sections in the new one. Ensure that the following sections are present in advanced.conf:

5.d. (EE Only with MDC)If you need to replicate to EE clusters with versions less than 2.2.0, the capability override for bucket properties should be in the riak_repl proplist of the advanced.config file:

   {riak_repl, [
     {override_capability, [
       {default_bucket_props_hash, [{use, [consistent, datatype, n_val, allow_mult, last_write_wins]}] }
     ]}
   ]}

Once all of the clusters have been upgraded to version 2.2.0 or greater, this override should be removed.

5.e. (EE Only)JMX and SNMP are no longer present in Riak KV. You must remove or comment out all references to them in your riak.conf/advanced.config files for Riak to start successfully post-upgrade.

6. Restart Riak KV:

riak start

7. Verify that Riak KV is running the new version:

riak version

8. Wait for the riak_kv service to start:

riak admin wait-for-service riak_kv »target_node«
  • »target_node« is the node which you have just upgraded (e.g. riak@192.168.1.11)

9. Wait for any hinted handoff transfers to complete:

riak admin transfers
  • While the node was offline, other nodes may have accepted writes on its behalf. This data is transferred to the node when it becomes available.

10. Repeat the process for the remaining nodes in the cluster.

Basho Patches

After upgrading, you should ensure that any custom patches contained in the basho-patches directory are examined to determine their application to the upgraded version. You can find this information in the Release Notes.

If you find that patches no longer apply to the upgraded version, you should remove them from the basho-patches directory prior to operating the node in production.

The following lists locations of the basho-patches directory for each supported operating system:

  • CentOS & RHEL Linux: /usr/lib64/riak/lib/basho-patches
  • Debian & Ubuntu Linux: /usr/lib/riak/lib/basho-patches
  • FreeBSD: /usr/local/lib/riak/lib/basho-patches
  • SmartOS: /opt/local/lib/riak/lib/basho-patches
  • Solaris 10: /opt/riak/lib/basho-patches

Riaknostic

It is a good idea to also verify some basic configuration and general health of the Riak KV node after upgrading by using Riak KV’s built-in diagnostic utility Riaknostic.

Ensure that Riak KV is running on the node, and issue the following command:

riak admin diag

Make the recommended changes from the command output to ensure optimal node operation.