Effect of learning rate in gradient descent algorithm