Using Alembic with Lumigator

Alembic is a lightweight database migration tool for usage with the SQLAlchemy Database Toolkit for Python. In Lumigator, whenever we make changes to the database (DB) schemas (via our SQLAlchemy models), we must create a migration path for the DB, removing blockers for developers working on the codebase, and operators upgrading to newer versions of Lumigator.

For new installations of Lumigator, Alembic is enabled by default to create the database and manage migrations. This document covers the ways in which Alembic can be used with Lumigator.

Note

Alembic creates an additional table (alembic_version) in the database which it uses to store the matching revision ID.

Running commands

Please run alembic commands in this document within the backend (lumigator/python/mzai/backend) folder, using uv. For example, where the Lumigator repository is cloned to your $HOME directory:

user@host:~/$ cd lumigator/lumigator/python/mzai/backend
user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic --version

Data model changes

When code changes are made to the data models, or when new models are added, a manual step is required in order to ensure that the models are visible to Alembic.

First, the imports at the top of env.py MUST import your package:

from backend.records.{package} import * # noqa: F403

The reason the wildcard is used is so you don’t have to explicitly import every type in the package, which makes this solution more brittle. The comment is required to ignore the Flake8 F403 rule which causes issues with ruff linting.

This ensures that the types are loaded such that the BaseRecord.metadata fully represents all our types to Alembic. If a package is not imported then changes will not be captured by Alembic and shown in migration revisions.

Environment variables

In order to connect to a “real” database to compare its state against the codified data models, an SQLALCHEMY_DATABASE_URL is required to be present in your environment. You can export the variable in your shell, or provide it on each invocation of alembic:

user@host:~/lumigator/lumigator/python/mzai/backend$ export SQLALCHEMY_DATABASE_URL=sqlite:///local.db

or

user@host:~/lumigator/lumigator/python/mzai/backend$ SQLALCHEMY_DATABASE_URL=sqlite:///local.db uv run alembic history

The rest of the document assumes SQLALCHEMY_DATABASE_URL is exported. If SQLALCHEMY_DATABASE_URL is not present then a default of sqlite:///local.db will be used (see: here).

If you’ve followed the README for backend, you should have sourced the virtual environment. This means you can run the alembic command directly in the terminal. Alternatively, you can also run it using uv:

user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic --version

Pre-existing databases (not currently managed by Alembic)

Existing Lumigator operators/contributors may already have a populated database with data they don’t want to lose. In this scenario, the database must be brought under the management of Alembic using the alembic stamp command.

This is done by stamping the database to indicate the version of the revisions that Alembic should manage going forward.

My database is up to date

If you believe your database already matches the most up-to-date models:

user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic stamp head

My database is in a different state (from a prior release)

This scenario requires manual review of the existing revisions stored in the versions folder, to determine which revision ID represents the current state of the database containing the data.

Revisions are stored in a format resembling a linked-list, with each revision containing a revision ID and down_revision ID (which can be None for the initial revision). In each revision the changes captured in the upgrade() method must be examined.

For information on the important information on see revision structure.

To align your database with a specific revision (migration ID e75fa022c781 aligns with the current state of our database):

user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic stamp e75fa022c781

Other Alembic commands

Let us now look at some other useful commands that can be used with Alembic. These commands are useful for viewing the history of migrations, the current revision, and for upgrading and downgrading the database.

  • Viewing migration history

    user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic history
    
  • Show your current revision

    user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic current
    
  • Upgrading

    • Manually upgrade your database to match the latest models:

      user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic upgrade head
      
    • You can also upgrade ‘relative’ to your current state, so to move forwards 1 revision:

      user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic upgrade +1
      
    • If you know the version you want to migrate to, you can specify it:

      user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic upgrade cb3cf47d9259
      
  • Downgrading

    • To downgrade to the original state (not recommended) use:

      user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic downgrade base
      
    • You can also downgrade ‘relative’ to your current state, so to move backwards 1 revision:

      user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic downgrade -1
      
    • If you know the version you want to migrate to, you can specify it:

      user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic upgrade cb3cf47d9259
      

Creating revisions

When you make a change to a database schema via the models, you should create a migration (revision) that handles upgrading the database, and downgrading too (this allows a linear chain to be followed when moving between migrations).

To create an empty revision that you populate manually:

user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic revision -m "{Explanatory commit-like message}"

This will create a new Python file under [versions/](https://github.com/mozilla-ai/lumigator/tree/main/lumigator/python/mzai/backend/backend/alembic/versions).

For example:

user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic revision -m "added desc field to job"

should see a new Python file created with a commit/ID prepended to your message:

cb3cf47d9259_added_desc_field_to_job.py

Alembic can attempt to work out the changes required to migrate your database if you ask it to create a revision using the --autogenerate flag when creating a revision. This is the recommended way to create revisions in Lumigator.

user@host:~/lumigator/lumigator/python/mzai/backend$ uv run alembic revision --autogenerate -m {Explanatory commit-like message}

Please note that “automatic” doesn’t mean this can be completely automated, as manual steps are still required in verifying the output in the generated upgrade() and downgrade() methods.

Revision structure

The main parts of the migration Python files Alembic creates are:

Name

Purpose

revision

ID of this migration

down_revision

ID of the previous revision (None when this is the first migration)

upgrade()

Function which handles changes required when upgrading to this migration

downgrade()

Function which handles changes required when downgrading from this migration