DataProc - a (near) complete guide
This post is the summation of months of debugging and optimizing ML and AI workflows on Google Cloud DataProc clusters using Cloud Composer / Apache Airflow for orchestration.
If you would prefer to jump directly to a chapter: