as number of hidden layers increase, model capacity increases