The breakthrough came with the application of deep neural networks to acoustic modeling. The seminal work by Hinton et al. (2012) demonstrated that DNNs could outperform GMMs by a significant margin when used in conjunction with HMMs. This work was pivotal in shifting the paradigm from shallow models to deep architectures in speech recognition.