Simon Du - How Over-Parameterization Slows Down Gradient Descent