Roundoff error analysis of the CholeskyQR2 algorithm Y Yamamoto, Y Nakatsukasa, Y Yanagisawa, T Fukaya Electron. Trans. Numer. Anal 44 (01), 306-326, 2015 | 62 | 2015 |
Shifted Cholesky QR for Computing the QR Factorization of Ill-Conditioned Matrices T Fukaya, R Kannan, Y Nakatsukasa, Y Yamamoto, Y Yanagisawa SIAM Journal on Scientific Computing 42 (1), A477-A503, 2020 | 52 | 2020 |
CholeskyQR2: a simple and communication-avoiding algorithm for computing a tall-skinny QR factorization on a large-scale parallel system T Fukaya, Y Nakatsukasa, Y Yanagisawa, Y Yamamoto 2014 5th Workshop on Latest Advances in Scalable Algorithms for Large-Scale …, 2014 | 52 | 2014 |
Accelerating the singular value decomposition of rectangular matrices with the CSX600 and the integrable SVD Y Yamamoto, T Fukaya, T Uneyama, M Takata, K Kimura, M Iwasaki, ... International Conference on Parallel Computing Technologies, 340-345, 2007 | 22 | 2007 |
Performance evaluation of the Eigen Exa eigensolver on Oakleaf-FX: tridiagonalization versus pentadiagonalization T Fukaya, T Imamura 2015 IEEE International Parallel and Distributed Processing Symposium …, 2015 | 14 | 2015 |
A dynamic programming approach to optimizing the blocking strategy for the Householder QR decomposition T Fukaya, Y Yamamoto, SL Zhang 2008 IEEE International Conference on Cluster Computing, 402-410, 2008 | 13 | 2008 |
Roundoff error analysis of the CholeskyQR2 algorithm in an oblique inner product Y Yamamoto, Y Nakatsukasa, Y Yanagisawa, T Fukaya JSIAM Letters 8, 5-8, 2016 | 12 | 2016 |
Performance analysis of the Householder-type parallel tall-skinny QR factorizations toward automatic algorithm selection T Fukaya, T Imamura, Y Yamamoto International Conference on High Performance Computing for Computational …, 2014 | 12 | 2014 |
Effect of Mixed Precision Computing on H-Matrix Vector Multiplication in BEM Analysis R Ooi, T Iwashita, T Fukaya, A Ida, R Yokota Proceedings of the International Conference on High Performance Computing in …, 2020 | 11 | 2020 |
Performance analysis of the Chebyshev basis conjugate gradient method on the K computer Y Kumagai, A Fujii, T Tanaka, Y Hirota, T Fukaya, T Imamura, R Suda International Conference on Parallel Processing and Applied Mathematics, 74-85, 2015 | 10 | 2015 |
Acceleration of Hessenberg reduction for nonsymmetric eigenvalue problems in a hybrid CPU-GPU computing environment J Muramatsu, T Fukaya, SL Zhang, K Kimura, Y Yamamoto International Journal of Networking and Computing 1 (2), 132-143, 2011 | 10 | 2011 |
Differential qd algorithm for totally nonnegative band matrices: convergence properties and error analysis Y Yamamoto, T Fukaya JSIAM Letters 1, 56-59, 2009 | 10 | 2009 |
Time-space tiling with tile-level parallelism for the 3D FDTD method T Fukaya, T Iwashita Proceedings of the International Conference on High Performance Computing in …, 2018 | 9 | 2018 |
Differential qd algorithm for totally nonnegative Hessenberg matrices: introduction of origin shifts and relationship with the discrete hungry Lotka-Volterra system Y Yamamoto, T Fukaya JSIAM Letters 2, 69-72, 2010 | 9 | 2010 |
EigenExa: high performance dense eigensolver, present and future T Imamura, Y Hirota, T Fukaya, S Yamada, M Machida 8th International Workshop on Parallel Matrix Algorithms and Applications …, 2014 | 8 | 2014 |
Hierarchical block multi-color ordering: A new parallel ordering method for vectorization and parallelization of the sparse triangular solver in the ICCG method T Iwashita, S Li, T Fukaya CCF Transactions on High Performance Computing 2 (2), 84-97, 2020 | 6 | 2020 |
A case study on modeling the performance of dense matrix computation: Tridiagonalization in the EigenExa eigensolver on the K computer T Fukaya, T Imamura, Y Yamamoto 2018 IEEE International Parallel and Distributed Processing Symposium …, 2018 | 6 | 2018 |
Accelerating the SpMV kernel on standard CPUs by exploiting the partially diagonal structures T Fukaya, K Ishida, A Miura, T Iwashita, H Nakashima arXiv preprint arXiv:2105.04937, 2021 | 5 | 2021 |
EigenKernel K Tanaka, H Imachi, T Fukumoto, A Kuwata, Y Harada, T Fukaya, ... Japan Journal of Industrial and Applied Mathematics 36 (2), 719-742, 2019 | 5 | 2019 |
On constructing cost models for online automatic tuning using ATMathCoreLib: Case studies through the SVD computation on a multicore processor S Nagashima, T Fukaya, Y Yamamoto 2016 IEEE 10th International Symposium on Embedded Multicore/Many-core …, 2016 | 5 | 2016 |