First steps
Once you have followed the steps in the Installation section to install the Operator and its dependencies, deploy an Airflow cluster and its dependencies. Afterward you can verify that it works by running and tracking an example DAG.
Setup
With the external dependencies required by Airflow (Postgresql and Redis) installed, install the Airflow Stacklet itself.
Supported versions for PostgreSQL and Redis can be found in the Airflow documentation.
Secret with Airflow credentials
Create a Secret with the necessary credentials, this entails database connection credentials as well as an admin account for Airflow itself.
Create a file called airflow-credentials.yaml:
---
apiVersion: v1
kind: Secret
metadata:
  name: simple-airflow-credentials
type: Opaque
stringData:
  adminUser.username: airflow
  adminUser.firstname: Airflow
  adminUser.lastname: Admin
  adminUser.email: airflow@airflow.com
  adminUser.password: airflow
  connections.secretKey: thisISaSECRET_1234
  connections.sqlalchemyDatabaseUri: postgresql+psycopg2://airflow:airflow@airflow-postgresql.default.svc.cluster.local/airflow
  # Only needed when using celery workers (instead of Kubernetes executors)
  connections.celeryResultBackend: db+postgresql://airflow:airflow@airflow-postgresql.default.svc.cluster.local/airflow
  connections.celeryBrokerUrl: redis://:redis@airflow-redis-master:6379/0And apply it:
kubectl apply -f airflow-credentials.yamlThe connections.secretKey is used for securely signing the session cookies and can be used for any other security related needs by extensions.
It should be a long random string of bytes.
connections.sqlalchemyDatabaseUri must contain the connection string to the SQL database storing the Airflow metadata.
connections.celeryResultBackend must contain the connection string to the SQL database storing the job metadata (the example above uses the same PostgreSQL database for both).
connections.celeryBrokerUrl must contain the connection string to the Redis instance used for queuing the jobs submitted to the airflow executor(s).
The adminUser fields are used to create an admin user.
| The admin user is disabled if you use a non-default authentication mechanism like LDAP. | 
Airflow
An Airflow cluster is made of up three components:
- 
webserver: this provides the main UI for user-interaction
- 
executors: the CeleryExecutor or KubernetesExecutor nodes over which the job workload is distributed by the scheduler
- 
scheduler: responsible for triggering jobs and persisting their metadata to the backend database
Create a file named airflow.yaml with the following contents:
---
apiVersion: airflow.stackable.tech/v1alpha1
kind: AirflowCluster
metadata:
  name: airflow
spec:
  image:
    productVersion: 2.10.5
  clusterConfig:
    loadExamples: true
    exposeConfig: false
    credentialsSecret: simple-airflow-credentials
  webservers:
    roleConfig:
      listenerClass: external-unstable
    roleGroups:
      default:
        replicas: 1
  celeryExecutors:
    roleGroups:
      default:
        replicas: 2
    config:
      resources:
        cpu:
          min: 400m
          max: 800m
        memory:
          limit: 2Gi
  schedulers:
    roleGroups:
      default:
        replicas: 1And apply it:
kubectl apply -f airflow.yaml
Where:
- 
metadata.namecontains the name of the Airflow cluster.
- 
the product version of the Docker image provided by Stackable must be set in spec.image.productVersion.
- 
spec.celeryExecutors: deploy executors managed by Airflow’s Celery engine. Alternatively you can usekuberenetesExectorsthat use Airflow’s Kubernetes engine for executor management. For more information see https://airflow.apache.org/docs/apache-airflow/stable/executor/index.html#executor-types).
- 
the spec.clusterConfig.loadExampleskey is optional and defaults tofalse. It is set totruehere as the example DAGs are used when verifying the installation.
- 
the spec.clusterConfig.exposeConfigkey is optional and defaults tofalse. It is set totrueonly as an aid to verify the configuration and should never be used as such in anything other than test or demo clusters.
- 
the previously created secret must be referenced in spec.clusterConfig.credentialsSecret.
| The version you need to specify for spec.image.productVersionis the desired version of Apache Airflow.
You can optionally specify thespec.image.stackableVersionto a certain release like23.11.0but it is recommended to leave it out and use the default provided by the operator.
Check our image registry for a list of available versions. Information on how to browse the registry can be found here.
It should generally be safe to simply use the latest version that is available. | 
This creates the actual Airflow cluster.
After a while, all the Pods in the StatefulSets should be ready:
kubectl get statefulsetThe output should show all pods ready, including the external dependencies:
NAME READY AGE airflow-postgresql 1/1 16m airflow-redis-master 1/1 16m airflow-redis-replicas 1/1 16m airflow-scheduler-default 1/1 11m airflow-webserver-default 1/1 11m airflow-celery-executor-default 2/2 11m
When the Airflow cluster has been created and the database is initialized, Airflow can be opened in the
browser: the webserver UI port defaults to 8080 can be forwarded to the local host:
kubectl port-forward svc/airflow-webserver 8080 2>&1 >/dev/null &
Verify that it works
The Webserver UI can now be opened in the browser with http://localhost:8080. Enter the admin credentials from the Kubernetes secret:
 
Since the examples were loaded in the cluster definition, they appear under the DAGs tabs:
 
Select one of these DAGs by clicking on the name in the left-hand column e.g. example_trigger_target_dag. Click on the arrow in the top right of the screen, select "Trigger DAG" and the DAG nodes are automatically highlighted as the job works through its phases.
 
Great! You have set up an Airflow cluster, connected to it and run your first DAG!
Alternative: Using the command line
If you prefer to interact directly with the API instead of using the web interface, you can use the following commands. The DAG is one of the example DAGs named example_trigger_target_dag. To trigger a DAG run via the API requires an initial extra step to ensure that the DAG is not in a paused state:
  curl -s --user airflow:airflow -H 'Content-Type:application/json' \
    -XPATCH http://localhost:8080/api/v1/dags/example_trigger_target_dag \
    -d '{"is_paused": false}'A DAG can then be triggered by providing the DAG name (in this case, example_trigger_target_dag).
The response identifies the DAG identifier, which can be parse out of the JSON like this:
  curl -s --user airflow:airflow -H 'Content-Type:application/json' \
    -XPOST http://localhost:8080/api/v1/dags/example_trigger_target_dag/dagRuns \
    -d '{}' | jq -r '.dag_run_id'If this identifier is stored in a variable such as dag_id (manually replaced in the command) you can run this command to access the status of the DAG run:
  curl -s --user airflow:airflow -H 'Content-Type:application/json' \
    -XGET http://localhost:8080/api/v1/dags/example_trigger_target_dag/dagRuns/"$dag_id" | jq -r '.state'What’s next
Look at the Usage guide to find out more about configuring your Airflow Stacklet and loading your own DAG files.