.. _swh-storage-cassandra-migrations: Cassandra migrations ==================== While swh-storage's PostgreSQL backend relies on the generic ``swh db`` CLI provided by :ref:`swh-core` to manage migrations, its Cassandra backend uses a specific design and CLI. Migration model --------------- Due to the potentially long runtime (tables need to be rewritten to apply some changes), and non-transactionality of migrations in Cassandra, we model migrations as a DAG instead of a sequence of updates. This allows, for example, running two migrations in parallel, as long as they don't depend on each other. There are two types of migrations: those that can be done automatically (and are implemented in Python), and those that need to be performed manually (for example, by setting up a :ref:`replayer `, to write the content of a new table from the :ref:`journal `). Each migration has the following properties: ``id`` A unique human-readable name, starting with the date it was written. The date does not have any impact in the order migrations are applied. ``dependencies`` A set of migration ids that need to be applied before this migration can start ``min_read_version`` What minimum version of swh-storage can read the database after this migration started (or after it is completed). ``status`` a string among ``pending``, ``running``, or ``completed`` Version checking ---------------- swh-storage's :meth:`check_config `_ documentation for details. ``swh storage cassandra mark-upgraded`` Tells the database a migration was applied. This is typically used after manually applying a migration, as ``swh storage cassandra upgrade`` does it on its own for automated migrations.