Measuring deep learning performance - an empirical study of performance distributions across architectures and tasks
Non-determinism in deep learning algorithm design and implementation leads to performance variation, meaning model performance is not a single value,...