Sehoon Kim
Cited by
Cited by
A survey of quantization methods for efficient neural network inference
A Gholami, S Kim, Z Dong, Z Yao, MW Mahoney, K Keutzer
arXiv preprint arXiv:2103.13630, 2021
I-BERT: Integer-only BERT quantization
S Kim, A Gholami, Z Yao, MW Mahoney, K Keutzer
International conference on machine learning, 5506-5518, 2021
AI and Memory Wall
A Gholami, Z Yao, S Kim, M Mahoney, K Keutzer
RiseLab Blog Post,, 2021
Applications and techniques for fast machine learning in science
AMC Deiana, N Tran, J Agar, M Blott, G Di Guglielmo, J Duarte, P Harris, ...
Frontiers in big Data 5, 2022
Hessian-aware pruning and optimal neural implant
S Yu, Z Yao, A Gholami, Z Dong, S Kim, MW Mahoney, K Keutzer
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022
Learned Token Pruning for Transformers
S Kim, S Shen, D Thorsley, A Gholami, W Kwon, J Hassoun, K Keutzer
arXiv preprint arXiv:2107.00910, 2021
WindTunnel: towards differentiable ML pipelines beyond a single model
GI Yu, S Amizadeh, S Kim, A Pagnoni, C Zhang, BG Chun, M Weimer, ...
Proceedings of the VLDB Endowment 15 (1), 11-20, 2021
Integer-Only Zero-Shot Quantization for Efficient Speech Recognition
S Kim, A Gholami, Z Yao, N Lee, P Wang, A Nrusimha, B Zhai, T Gao, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
S Kim, A Gholami, A Shaw, N Lee, K Mangalam, J Malik, MW Mahoney, ...
arXiv preprint arXiv:2206.00888, 2022
A Fast Post-Training Pruning Framework for Transformers
W Kwon, S Kim, MW Mahoney, J Hassoun, K Keutzer, A Gholami
arXiv preprint arXiv:2204.09656, 2022
Terra: Imperative-Symbolic Co-Execution of Imperative Deep Learning Programs
T Kim, E Jeong, GW Kim, Y Koo, S Kim, G Yu, BG Chun
Advances in Neural Information Processing Systems 34, 1468-1480, 2021
Memory-Efficient Hardware Performance Counters with Approximate-Counting Algorithms
J Xu, S Kim, B Nikolic, YS Shao
2021 IEEE International Symposium on Performance Analysis of Systems and …, 2021
The system can't perform the operation now. Try again later.
Articles 1–12