Improving PySpark Performance: Spark performance beyond the JVM - PyDataSG