Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 770 | 2019 |
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 235 | 2020 |
Personalized speech recognition on mobile devices I McGraw, R Prabhavalkar, R Alvarez, MG Arenas, K Rao, D Rybach, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 233 | 2016 |
Speaker verification using co-location information RA Guevara, O Hansson US Patent 9,792,914, 2017 | 225 | 2017 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 214 | 2019 |
Locally-connected and convolutional neural networks for small footprint speaker recognition Y Chen, IL Moreno, T Sainath, M Visontai, R Alvarez, C Parada | 131 | 2015 |
Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks R Prabhavalkar, R Alvarez, C Parada, P Nakkiran, TN Sainath 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 112 | 2015 |
Compressing deep neural networks using a rank-constrained topology. P Nakkiran, R Alvarez, R Prabhavalkar, C Parada INTERSPEECH, 1473-1477, 2015 | 98 | 2015 |
Text-dependent speaker identification D Roblek, M Sharifi, RA Guevara US Patent 9,542,948, 2017 | 94 | 2017 |
End-to-end streaming keyword spotting A Raziel, P Hyun-Jin arXiv preprint arXiv:1812.02802, 2018 | 84* | 2018 |
Optimizing speech recognition for the edge Y Shangguan, J Li, Q Liang, R Alvarez, I McGraw arXiv preprint arXiv:1909.12408, 2019 | 73 | 2019 |
On the efficient representation and execution of deep acoustic models R Alvarez, R Prabhavalkar, A Bakhtin arXiv preprint arXiv:1607.04683, 2016 | 70 | 2016 |
Speaker verification using co-location information RA Guevara, O Hansson US Patent 9,257,120, 2016 | 60 | 2016 |
A cascade architecture for keyword spotting on mobile devices A Gruenstein, R Alvarez, C Thornton, M Ghodrat arXiv preprint arXiv:1712.03603, 2017 | 41 | 2017 |
Systems and methods for performing actions in response to user gestures in captured images R Alvarez US Patent 9,953,216, 2018 | 35 | 2018 |
Rank-constrained neural networks RA Guevara, P Nakkiran US Patent 9,767,410, 2017 | 31 | 2017 |
Automatic gain control for speech recognition R Alvarez, P Nakkiran US Patent App. 14/727,741, 2016 | 24 | 2016 |
Speaker verification using co-location information RA Guevara, O Hansson US Patent 10,147,429, 2018 | 22 | 2018 |
Color image classification through fitting of implicit surfaces R Álvarez, E Millán, R Swain-Oropeza, A Aceves-López Advances in Artificial Intelligence–IBERAMIA 2004: 9th Ibero-American …, 2004 | 16 | 2004 |
On the quantization of recurrent neural networks J Li, R Alvarez arXiv preprint arXiv:2101.05453, 2021 | 15 | 2021 |