Publications
[2025]
L. Carpentieri, A. De Caro, M. Salimi Beni, K. Fan and B. Cosenza, “Phase-Based Frequency Scaling for Energy-Efficient Heterogeneous Computing”, in IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPS), 2024. [Acc rate = 24%] [PDF | DOI | BibTeX]
M. Salimi Beni and R. Laso and B. Cosenza and S. Benkner and S. Hunold, “Exploring NCCL tuning strategies for distributed deep learning”, in IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPS-W), 2024. [PDF | Slides | DOI | BibTeX]
M. Salimi Beni, R. Laso, B. Cosenza, S. Benkner, and S. Hunold, “Optimizing Distributed Deep Learning Training by Tuning NCCL,” in Proc. Austrian-Slovenian HPC Meeting (ASHPC25), Abstract, 2025. [PDF | DOI | BibTeX]
I. Vardas, R. Laso Rodriguez, and M. Salimi Beni, “ncclsee: A Lightweight Profiling Tool for NCCL,” in Proc. Austrian-Slovenian HPC Meeting (ASHPC25), Abstract, 2025. [PDF | DOI | BibTeX]
[2024]
M. Salimi Beni, B. Cosenza, and S. Hunold, “MPI Collective Algorithm Selection in the Presence of Process Arrival Patterns”, in IEEE International Conference on Cluster Computing (CLUSTER), 2024. [Acc rate = 26%] [PDF | Slides | DOI | BibTeX]
M. Salimi Beni, S. Hunold, and B. Cosenza, “Analysis and prediction of performance variability in large-scale computing systems,” The Journal of Supercomputing, 2024. [PDF | DOI | BibTeX]
[2023]
M. Salimi Beni, S. Hunold, and B. Cosenza, “Algorithm Selection of MPI Collectives Considering System Utilization,” in Euro-Par 2023: Parallel Processing Workshops, Springer, 2023. [PDF | Slides | BibTeX]
M. Salimi Beni, L. Crisci, and B. Cosenza, “EMPI: Enhanced Message Passing Interface in Modern C++,” in IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid), IEEE, 2023. [Acc rate = 20%] [PDF | DOI | Slides | BibTeX]
[2022]
[BEST PAPER AWARD] M. Salimi Beni and B. Cosenza, “An analysis of long-tailed network latency distribution and background traffic on dragonfly+,” in The 14th BenchCouncil International Symposium On Benchmarking, Measuring And Optimizing (Bench 2022), LNCS, 2022. [PDF | DOI | VIDEO | Slides | BibTeX]
L. Crisci, M. Salimi Beni, B. Cosenza, N. Scipione, D. Gadioli, E. Vitali, G. Palermo, A. Beccari, Towards a Portable Drug Discovery Pipeline with SYCL 2020. in International Workshop on OpenCL, 2022. [PDF | DOI | BibTeX]
M. Salimi Beni and B. Cosenza, “An analysis of performance variability on dragonfly+ topology,” in IEEE International Conference on Cluster Computing (CLUSTER), IEEE, 2022, pp. [PDF | DOI | POSTER | BibTeX]
[2021]
- A. H. Sojoodi, M. Salimi Beni, and F. Khunjush, “Ignite-GPU: A GPU-enabled in-memory computing architecture on clusters,” The Journal of Supercomputing, 2021. [PDF | DOI | BibTeX]
[2020]