Publications

You can find the articles on Google Scholar, too.

YearTitle & AuthorsVenueLinks
2025Amirhossein Sojoodi, Amir Farazdaghi, Hamed Sharifian, Ryan E. Grant, Ahmad Afsahi
“Collaborative Bandwidth-Efficient Intra-Node Allreduce”
AsHES Workshop📄 DOI · 📥 PDF
📚 BIB · 📊 Slides
2024Yıltan Hassan Temucin, Whit Schonbein, Scott Levy, Amirhossein Sojoodi, Ryan E Grant, Ahmad Afsahi
“Design and Implementation of MPI-Native GPU-Initiated MPI Partitioned Communication”
ExaMPI Workshop📄 DOI · 📥 PDF
📚 BIB
2024Hamed Sharifian, Amirhossein Sojoodi, Ahmad Afsahi
“A Topology- and Load-Aware Design for Neighborhood Allgather”
IEEE CLUSTER📄 DOI · 📥 PDF
📚 BIB
2024Amirhossein Sojoodi, Yiltan Hassan Temucin, Ahmad Afsahi
“Enhancing Intra-Node GPU-to-GPU Performance in MPI + UCX through Multi-Path Communication”
🏆 Best Paper Award
ExHET Workshop📄 DOI · 📥 PDF
📚 BIB · 📊 Slides
2022Pedram Alizadeh, Amirhossein Sojoodi, Yiltan Hassan Temucin, Ahmad Afsahi
“Efficient Process Arrival Pattern Aware Collective Communication for Deep Learning”
EuroMPI📄 DOI · 📥 PDF
📚 BIB
2022Philipp A. Witte, Russell J. Hewett, Kumar Saurabh, Amirhossein Sojoodi, Ranveer Chandra
“SciAI4Industry - Solving PDEs for industry-scale problems with deep learning”
arXiv📄 DOI
2021Yiltan Hassan Temucin, Amirhossein Sojoodi, Pedram Alizadeh, Benjamin W Kitor, Ahmad Afsahi
“Accelerating Deep Learning using Interconnect-Aware UCX Communication for MPI Collectives”
IEEE Micro📄 DOI · 📥 PDF
📚 BIB
2021Yiltan Hassan Temucin, Amirhossein Sojoodi, Pedram Alizadeh, Ahmad Afsahi
“Efficient Multi-Path NVLink / PCIe-Aware UCX based Collective Communication for Deep Learning”
IEEE HOTI📄 DOI · 📥 PDF
📚 BIB
2020Majid Salimi Beni, Amirhossein Sojoodi, Farshad Khunjush
“A GPU-Enabled Extension for Apache Ignite to Facilitate Running Genetic Algorithms”
CADS Symposium📄 DOI · 📥 PDF
📚 BIB
2020Amirhossein Sojoodi, Majid Salimi Beni, Farshad Khunjush
“Ignite-GPU: a GPU-enabled in-memory computing architecture on clusters”
Journal of Supercomputing📄 DOI · 📥 PDF
📚 BIB