Cloud Cost Optimization Pipeline with PySpark, Dataproc & BigQueryÂ