An architecture of Stampi: MPI library on a cluster of parallel computers T Imamura, Y Tsujita, H Koide, H Takemiya Recent Advances in Parallel Virtual Machine and Message Passing Interface …, 2000 | 137 | 2000 |

The 10,240‐member ensemble Kalman filtering with an intermediate AGCM T Miyoshi, K Kondo, T Imamura Geophysical Research Letters 41 (14), 5264-5271, 2014 | 110 | 2014 |

Development of a high performance eigensolver on the petascale next generation supercomputer system T Imamura, S Yamada, M Machida Progress in Nuclear Science and Technology 2, 643-650, 2011 | 70 | 2011 |

16.447 tflops and 159-billion-dimensional exact-diagonalization for trapped fermion-hubbard model on the earth simulator S Yamada, T Imamura, M Machida SC'05: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, 44-44, 2005 | 53 | 2005 |

Communication-overlap techniques for improved strong scaling of gyrokinetic Eulerian code beyond 100k cores on the K-computer Y Idomura, M Nakata, S Yamada, M Machida, T Imamura, T Watanabe, ... The International Journal of High Performance Computing Applications 28 (1 …, 2014 | 45 | 2014 |

High-performance computing for exact numerical approaches to quantum many-body problems on the earth simulator S Yamada, T Imamura, T Kano, M Machida Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, 47-es, 2006 | 36 | 2006 |

Parallel implementation of 3D FFT with volumetric decomposition schemes for efficient molecular dynamics simulations J Jung, C Kobayashi, T Imamura, Y Sugita Computer Physics Communications 200, 57-65, 2016 | 35 | 2016 |

Stampi-I/O: A flexible parallel-I/O library for heterogeneous computing environment Y Tsujita, T Imamura, H Takemiya, N Yamagishi European Parallel Virtual Machine/Message Passing Interface Users’ Group …, 2002 | 32 | 2002 |

A 1024-member ensemble data assimilation with 3.5-km mesh global weather simulations H Yashiro, K Terasaki, Y Kawai, S Kudo, T Miyoshi, T Imamura, K Minami, ... SC20: International Conference for High Performance Computing, Networking …, 2020 | 29 | 2020 |

Implementation of d-Spline-based incremental performance parameter estimation method with ppOpen-AT T Tanaka, R Otsuka, A Fujii, T Katagiri, T Imamura Scientific Programming 22 (4), 299-307, 2014 | 25 | 2014 |

Implementation and numerical techniques for one EFlop/s HPL-AI benchmark on Fugaku S Kudo, K Nitadori, T Ina, T Imamura 2020 IEEE/ACM 11th Workshop on Latest Advances in Scalable Algorithms for …, 2020 | 22 | 2020 |

MLPerf™ HPC: A holistic benchmark suite for scientific machine learning on HPC systems S Farrell, M Emani, J Balma, L Drescher, A Drozd, A Fink, G Fox, D Kanter, ... 2021 IEEE/ACM Workshop on Machine Learning in High Performance Computing …, 2021 | 21 | 2021 |

DGEMM using tensor cores, and its accurate and reproducible versions D Mukunoki, K Ozaki, T Ogita, T Imamura International Conference on High Performance Computing, 230-248, 2020 | 21 | 2020 |

HF-STEX and RASSCF calculations on nitrogen K-shell X-ray absorption of purine base and its derivative Y Mochizuki, H Koide, T Imamura, H Takemiya Journal of synchrotron radiation 8 (2), 1003-1005, 2001 | 19 | 2001 |

Prompt report on Exa-scale HPL-AI benchmark S Kudo, K Nitadori, T Ina, T Imamura 2020 IEEE International Conference on Cluster Computing (CLUSTER), 418-419, 2020 | 18 | 2020 |

Quantum synchronization effects in intrinsic Josephson junctions M Machida, T Kano, S Yamada, M Okumura, T Imamura, T Koyama Physica C: Superconductivity and its applications 468 (7-10), 689-694, 2008 | 18 | 2008 |

Grid computing supporting system on ITBL project K Higuchi, T Imamura, Y Suzuki, F Shimizu, M Machida, T Otani, ... High Performance Computing: 5th International Symposium, ISHPC 2003, Tokyo …, 2003 | 17 | 2003 |

Fast implementation of general matrix-vector multiplication (GEMV) on kepler GPUS D Mukunoki, T Imamura, D Takahashi 2015 23rd Euromicro International Conference on Parallel, Distributed, and …, 2015 | 15 | 2015 |

Performance evaluation of the Eigen Exa eigensolver on Oakleaf-FX: Tridiagonalization versus pentadiagonalization T Fukaya, T Imamura 2015 IEEE International Parallel and Distributed Processing Symposium …, 2015 | 12 | 2015 |

Performance analysis of the Householder-type parallel tall-skinny QR factorizations toward automatic algorithm selection T Fukaya, T Imamura, Y Yamamoto High Performance Computing for Computational Science--VECPAR 2014: 11th …, 2015 | 12 | 2015 |