python - Still unable to run all tasks in parallel with Airflow -
i have task set in this question.
based on ui looks dependencies well-defined:
i have tested individual tasks follows: airflow test capone_dash_preproc aaag5608078m2 2017-07-25
. works great, updating target database entries corresponding user. when try run full task, however, seems it's getting hung up. python3 dash_dag.py
results in following terminal output on web server, repeated endlessly. cpus quiet, doesn't seem there's calculation happening:
/usr/local/lib/python3.6/site-packages/flask/exthook.py:71: extdeprecationwarning: importing flask.ext.cache deprecated, use flask_cache instead. .format(x=modname), extdeprecationwarning [2017-07-25 16:48:23,266] [58627] {models.py:167} info - filling dagbag /users/aaronpolhamus/airflow/dags [2017-07-25 16:48:25 -0500] [58487] [info] handling signal: ttou [2017-07-25 16:49:13 -0500] [58557] [info] worker exiting (pid: 58557) [2017-07-25 16:49:44 -0500] [58487] [info] handling signal: ttin [2017-07-25 16:49:44 -0500] [58642] [info] booting worker pid: 58642 /usr/local/lib/python3.6/site-packages/flask/exthook.py:71: extdeprecationwarning: importing flask.ext.cache deprecated, use flask_cache instead. .format(x=modname), extdeprecationwarning [2017-07-25 16:49:44,607] [58642] {models.py:167} info - filling dagbag /users/aaronpolhamus/airflow/dags [2017-07-25 16:49:46 -0500] [58487] [info] handling signal: ttou [2017-07-25 16:50:21 -0500] [58568] [info] worker exiting (pid: 58568) [2017-07-25 16:50:51 -0500] [58487] [info] handling signal: ttin [2017-07-25 16:50:51 -0500] [58661] [info] booting worker pid: 58661 /usr/local/lib/python3.6/site-packages/flask/exthook.py:71: extdeprecationwarning: importing flask.ext.cache deprecated, use flask_cache instead. .format(x=modname), extdeprecationwarning [2017-07-25 16:50:52,324] [58661] {models.py:167} info - filling dagbag /users/aaronpolhamus/airflow/dags [2017-07-25 16:50:54 -0500] [58487] [info] handling signal: ttou [2017-07-25 16:51:20 -0500] [58596] [info] worker exiting (pid: 58596) [2017-07-25 16:51:50 -0500] [58487] [info] handling signal: ttin [2017-07-25 16:51:50 -0500] [58677] [info] booting worker pid: 58677 ...
my confusion arises fact individual tests run fine , populate database. entire run hangs , fails. obvious here?
airflow exists of multiple components. important being scheduler, webserver, , worker (in case of horizontal scaling). scheduler always* needs running have airflow execute tasks. webserver there provide nice ui on top of scheduler doing , rest api. components use backing database.
you showing output of webserver, while have expected see output of scheduler. screenshot posted , way try running dag assumption did not start it.
so start scheduler issuing airflow scheduler
on command line , dag kick off.
*backfills exception
Comments
Post a Comment