Follow
Yujeong Choi
Title
Cited by
Cited by
Year
Prema: A predictive multi-task scheduling algorithm for preemptible neural processing units
Y Choi, M Rhu
2020 IEEE International Symposium on High Performance Computer Architecture …, 2020
1212020
Lazy batching: An SLA-aware batching system for cloud machine learning inference
Y Choi, Y Kim, M Rhu
2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021
532021
NeuMMU: Architectural support for efficient address translations in neural processing units
B Hyun, Y Kwon, Y Choi, J Kim, M Rhu
Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020
312020
PARIS and ELSA: An Elastic Scheduling Algorithm for Reconfigurable Multi-GPU Inference Servers
Y Kim, Y Choi, M Rhu
Proceedings of the 59th ACM/IEEE Design Automation Conference, 2022
72022
The system can't perform the operation now. Try again later.
Articles 1–4