Problem with start date and scheduled date in Apache Airflow

Airflow schedules tasks at the end of the interval (See documentation reference)

Meaning that when you do:

start_date: datetime(2020, 12, 7, 8, 0,0)
schedule_interval: '0 8 * * *'

The first run will kick in at 2020-12-08 at 08:00+- (depends on resources)

This run’s execution_date will be: 2020-12-07 08:00

The next run will kick in at 2020-12-09 at 08:00

This run’s execution_date of 2020-12-08 08:00.

Since today is 2020-12-08 the next run didn’t kick in because it’s not the end of the interval yet.

Leave a Comment