Follow
Antonio Orvieto
Antonio Orvieto
ELLIS Institute Tübingen, Max Planck Institute for Intelligent Systems
Verified email at tue.ellis.eu - Homepage
Title
Cited by
Cited by
Year
Learning explanations that are hard to vary
G Parascandolo, A Neitz, A Orvieto, L Gresele, B Schölkopf
International Conference on Learning Representations (2021), 2020
1442020
Resurrecting recurrent neural networks for long sequences
A Orvieto, SL Smith, A Gu, A Fernando, C Gulcehre, R Pascanu, S De
International Conference on Machine Learning, 26670-26698, 2023
922023
A continuous-time perspective for modeling acceleration in Riemannian optimization
F Alimisis, A Orvieto, G Bécigneul, A Lucchi
International Conference on Artificial Intelligence and Statistics, 1297-1307, 2020
582020
Momentum improves optimization on Riemannian manifolds
F Alimisis, A Orvieto, G Becigneul, A Lucchi
International conference on artificial intelligence and statistics, 1351-1359, 2021
46*2021
Faster single-loop algorithms for minimax optimization without strong concavity
J Yang, A Orvieto, A Lucchi, N He
International Conference on Artificial Intelligence and Statistics, 5485-5517, 2022
442022
Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse
L Noci, S Anagnostidis, L Biggio, A Orvieto, SP Singh, A Lucchi
Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022
332022
Continuous-time models for stochastic optimization algorithms
A Orvieto, A Lucchi
Advances in Neural Information Processing Systems 32 (2019), 2018
332018
Anticorrelated noise injection for improved generalization
A Orvieto, H Kersting, F Proske, F Bach, A Lucchi
International Conference on Machine Learning (ICML), 2022, 2022
322022
The role of memory in stochastic optimization
A Orvieto, J Kohler, A Lucchi
Uncertainty in Artificial Intelligence, 356-366, 2020
202020
Shadowing properties of optimization algorithms
A Orvieto, A Lucchi
Advances in Neural Information Processing Systems 32 (2019), 2019
202019
Dynamics of SGD with Stochastic Polyak Stepsizes: Truly Adaptive Variants and Convergence to Exact Solution
A Orvieto, S Lacoste-Julien, N Loizou
Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022
172022
An accelerated dfo algorithm for finite-sum convex functions
Y Chen, A Orvieto, A Lucchi
International Conference on Machine Learning (ICML), 2020, 2020
172020
Explicit regularization in overparametrized models via noise injection
A Orvieto, A Raj, H Kersting, F Bach
International Conference on Artificial Intelligence and Statistics, 7265-7287, 2023
162023
Vanishing Curvature in Randomly Initialized Deep ReLU Networks.
A Orvieto, J Kohler, D Pavllo, T Hofmann, A Lucchi
AISTATS, 7942-7975, 2022
13*2022
An SDE for Modeling SAM: Theory and Insights
E Monzio Compagnoni, L Biggio, A Orvieto, FN Proske, H Kersting, ...
arXiv e-prints, arXiv: 2301.08203, 2023
12*2023
Achieving a better stability-plasticity trade-off via auxiliary networks in continual learning
S Kim, L Noci, A Orvieto, T Hofmann
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
102023
Revisiting the Role of Euler Numerical Integration on Acceleration and Stability in Convex Optimization
P Zhang, A Orvieto, H Daneshmand, T Hofmann, R Smith
International Conference on Artificial Intelligence and Statistics (2021), 2021
92021
On the second-order convergence properties of random search methods
A Lucchi, A Orvieto, A Solomou
Advances in Neural Information Processing Systems 34, 25633-25645, 2021
82021
On the Theoretical Properties of Noise Correlation in Stochastic Optimization
A Lucchi, F Proske, A Orvieto, F Bach, H Kersting
Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022
72022
Randomized Signature Layers for Signal Extraction in Time Series Data
E Monzio Compagnoni, L Biggio, A Orvieto, T Hofmann, J Teichmann
arXiv e-prints, arXiv: 2201.00384, 2022
7*2022
The system can't perform the operation now. Try again later.
Articles 1–20