Holger Fröning, Prof. Dr.
Holger Fröning is a full professor and leads the Computing Systems Group at the Institute of Computer Engineering at Heidelberg University. His research interests focus on embedded machine learning and high-performance computing, and include hardware and software architectures, programmability, co-design, data movement optimizations, and associated power and energy aspects. From 2011 to 2018, he was associate professor at the same university. Since 2023, he is managing director of the Institute of Computer Engineering. From 2019 to 2022, he was Dean of Studies for Computer Science at Heidelberg University. In 2016, he was with NVIDIA Research (Santa Clara, CA, US) as visiting scientist, sponsored by Bill Dally. Early 2015 he was visiting professor at the Graz University of Technology (Austria), sponsored by Gernot Kubin. From 2008 to 2011 he reported to Jose Duato from the Technical University of Valencia (Spain). He has received his PhD and MSc degrees 2007 respectively 2001 from University of Mannheim, Germany.
In 2023, he was named visiting professor at Xi’an University of Technology. In 2021, he was awarded visiting scientist at the Chinese Academy of Sciences. In 2014, he received the prestigious Google Faculty Research Award. Five of his publications have received a best paper award (IPDPS, ICPP, among others), and parts of his research results have been commercialized.
He co-organizes the Workshop on Embedded Machine Learning (WEML) and the Workshop on IoT, Edge, and Mobile for Embedded Machine Learning (ITEM) on a regular basis. He was local co-chair for IEEE CLUSTER 2022, chaired tracks for EuroPar 2015 and International Supercomputer Conference 2017, and recently served as program committee member for IPDPS2024/21, CCGRID2023/20, SC2017, ICPP2023/22/21, FPL2023/22/21, and EuroPar 2024. He was reviewer for ECAI2023 and ECML2023. He is frequently providing reviews for established journals, such as IEEE Micro, TPDS, and JPDC. His recent sponsors include DFG, FWF, FFG, Carl-Zeiss Foundation, NVIDIA, SAP, and XILINX.
Research interests
- HW for artificial intelligence
- Embedded machine learning - resource-efficient neural networks and beyond
- Computer architecture and interconnection networks
- High-performance computing, machine learning, data analytics
Recent news (2-year horizon)
- 09/2024: Invited talk at IEEE SOCC, Dresden, Germany, on ‘Towards Enhanced Resource Efficiency in Large Language Models’
- 08/2024: Invited talk at Huawei Research, Shenzhen, on ‘On Accelerating Deep and Bayesian Neural Architectures’
- 08/2024: Invited talk at National Supercomputing Center in Shenzhen, on ‘Towards Enhanced Resource Efficiency in Large Language Models’
- 06/2024: Invited talk at National Supercomputing Center in Shenzhen, on ‘On Accelerating Deep and Bayesian Neural Architectures’, sponsored by Haohuan Fu
- 05/2024: Invited talk at Chinese Acacemy of Sciences, Beijing, on ‘Charting the Course: Future Directions on the Intersection of Hardware Systems and Artificial Intelligence’
- 05/2024: Invited talk at Tsinghua University, Beijing on ‘On Accelerating Deep and Bayesian Neural Architectures’, sponsored by Yu Wang
- 04/2024: Invited to Dagstuhl Seminar “Hardware Support for Cloud Database Systems in the Post-Moore’s Law Era (24162)”, co-organized by David F. Bacon, Carsten Binnig, David A. Patterson, and Margo Seltzer
- 02/2024: Invited talk at Rotary Club Heidelberg-Schloss on ‘Tiefe Neuronale Netzwerke aus Sicht von Wissenschaft und Technik’
- 01/2024: Invited talk at hessian.ai/TU Darmstadt - “On Accelerating Deep and Bayesian Neural Architectures”
- 11/2023: Invited talk at STRUCTURES - “On Accelerating Deep and Bayesian Neural Architectures”
- 05/2023: Guest lecture at Xi’an University of Technology, China, on ‘Charting the Course: Future Directions on the Intersection of Hardware Systems and Artificial Intelligence’
General information
- Short CV: pdf
- Bio: please use text above
- Photo: jpg - copyright “Universität Heidelberg, Kommunikation und Marketing”
Awards
- 2023: Visiting Professor at Xi’an University of Technology, China
- 2023: Best paper award, 2nd Practical-DL Workshop, AAAI Conference on Artificial Intelligence
- 2021: Visiting Scientist at the Chinese Academy of Sciences (CAS), funded by President’s International Fellowship Initiative (PIFI 2020)
- 2020: Best paper finalist, Int. Conf. on Machine Learning, Optimization, and Data Science (LOD)
- 2017: Best paper finalist, International Supercomputer Conference (ISC)
- 2017: Best paper award, 31th International Parallel and Distributed Processing Symposium
- 2014: Google Faculty Research Award
- 2010: Best paper award, 9th International Conference on Networks
- 2009: Best paper award, 5th International Workshop on Applied Reconfigurable Computing
- 2008: Best paper award, 37th International Conference on Parallel Processing (ICPP)
Recent Service (4-year horizon)
Reviewer/Technical Program Committee (TPC)
- 2025: TPC member, IEEE International Parallel & Distributed Processing Symposium (IPDPS)
- 2024: TPC member, International Conference on Chip Design (ICCD)
- 2024: TPC member, European Conference on Machine Learning (ECML)
- 2024: TPC member, International Conference on Field-Programmable Logic and Applications (FPL)
- 2024: Reviewer, ACM Transactions on Embedded Computing Systems (TECS)
- 2024: Reviewer, Journal of Parallel and Distributed Computing (JPDC)
- 2024: TPC member, IEEE International Parallel & Distributed Processing Symposium (IPDPS)
- 2024: Reviewer, International Conference on Pattern Recognition (ICPR)
- 2024: TPC member, International Conference on Supercomputing (ICS)
- 2024: TPC member, International European Conference on Parallel and Distributed Computing (Euro-PAR)
- 2023: TPC member, International Conference on Chip Design (ICCD)
- 2023: Reviewer, European Conference on Artificial Intelligence (ECAI)
- 2023: Reviewer, European Conference on Machine Learning (ECML)
- 2023: Reviewer, Journal of Parallel and Distributed Computing (JPDC)
- 2023: TPC member, IEEE/ACM international Symposium on Cluster, Cloud and Internet Computing (CCGRID)
- 2023: TPC member, International Conference on Parallel Processing (ICPP)
- 2023: TPC member, International Conference on Field-Programmable Logic and Applications (FPL)
- 2022: TPC member, International Conference on Parallel Processing (ICPP)
- 2022: TPC member, International Conference on Field-Programmable Logic and Applications (FPL)
- 2022: Reviewer, Transactions on Parallel and Distributed Systems (TPDS)
- 2021: TPC member, IEEE International Parallel & Distributed Processing Symposium (IPDPS)
- 2021: TPC member, International Conference on Parallel Processing (ICPP)
- 2021: TPC member, International Conference on Field-Programmable Logic and Applications (FPL)
- 2020: TPC member, IEEE/ACM international Symposium on Cluster, Cloud and Internet Computing (CCGRID)
- 2020: Reviewer, IEEE MICRO
Local Co-Chair
- 2022: IEEE Cluster Conference
TPC track chair
- 2021: Special session chair, International Symposium on Highly Efficient Accelerators and & Reconfigurable Technologies (HEART2021)
- 2017: International Supercomputer Conference (ISC)
- 2015: International European Conference on Parallel and Distributed Computing (Euro-PAR)
Organizer
- Since 2018: Workshop Series on Embedded Machine Learning (WEML)
- Since 2020: Workshop Series on IoT, Edge, and Mobile for Embedded Machine Learning (ITEM)
- 2024, 2023: Workshop on FPGA/xPU Accelerators for Future HPC and Datacenter (F4HD)
Recent Teaching (4-year horizon)
Summer term 2032
- On leave (sabbatical) - please check teaching subsite for replacements
Winter term 2023/24
- Lecturer - graduate course “High Performance and Distributed Computing (2+2)” [LSF] [Moodle]
- Lecturer - graduate course “GPU Computing - Architecture and Programming (2+2)” [LSF] [Moodle]
- Organizer - seminar “Machine Learning Accelerators” [LSF]
- Organizer - undergraduate course “Einführung in das Textsatzsystem LaTeX” [LSF]
Summer term 2023
- Lecturer - graduate course “Embedded Machine Learning (2+2)” [LSF] [Moodle]
- Organizer - seminar “Robust Machine Learning” [LSF]
Winter term 2022/2023
- Lecturer - graduate course “Introduction to High Performance Computing (2+2)” [LSF] [Moodle]
- Organizer - seminar “Resilient Machine Learning” [LSF]
Summer term 2022
- Organizer - seminar “Probabilistic Programming Languages” [LSF]
Winter term 2021/2022
- Lecturer - graduate course “GPU Computing (2+2)” [LSF]
- Organizer - seminar “Quantum Computing” [LSF]
- Co-Lecturer - undergraduate course “Einführung in die Technische Informatik” [LSF]
Summer term 2021
- Lecturer - graduate course “Advanced Parallel Computing (2+2)” [LSF] [github]
- Organizer - seminar “Considered Harmful” [LSF]
- Organizer - undergraduate course “Einführung in das Textsatzsystem LaTeX” [LSF]
- Co-Lecturer - lecture series “Data Science and Health” [LSF]
Winter term 2020/2021
- Lecturer - graduate course “GPU Computing (2+2)” [LSF]
- Lecturer - graduate course “Introduction to High-Performance Computing (2+2)” [LSF]
- Organizer - seminar “Specialized Machine-Learning Frameworks” [LSF]
- Organizer - undergraduate course “Einführung in das Textsatzsystem LaTeX” [LSF]
- Co-Lecturer - lecture series “Data Science and Health” [LSF]
Summer term 2020
- Organizer - seminar “Indoor Localization, Mapping and Short-Range Communication” [LSF]
- Co-Lecturer - lecture series “Data Science and Health”
[LSF]
Publications
2024
- Less Memory Means smaller GPUs: Backpropagation with Compressed ActivationsCoRR, abs/2409.11902, 2024
@article{barley2024, author = {Barley, Daniel and Fr{{\"o}}ning, Holger}, title = {Less Memory Means smaller GPUs: Backpropagation with Compressed Activations}, year = {2024}, volume = {abs/2409.11902}, journal = {CoRR}, url = {https://arxiv.org/abs/2409.11902}, }
- Resource-Efficient Neural Networks for Embedded SystemsJournal of Machine Learning Research, 25(50), 1–51, 2024
@article{JMLR:v25:18-566, author = {Roth, Wolfgang and Schindler, G{{\"u}}nther and Klein, Bernhard and Peharz, Robert and Tschiatschek, Sebastian and Fr{\"{o}}ning, Holger and Pernkopf, Franz and Ghahramani, Zoubin}, title = {Resource-Efficient Neural Networks for Embedded Systems}, journal = {Journal of Machine Learning Research}, year = {2024}, volume = {25}, number = {50}, pages = {1--51}, url = {http://jmlr.org/papers/v25/18-566.html}, }
- Walking Noise: On Layer-Specific Robustness of Neural Architectures against Noisy Computations and Associated Characteristic Learning DynamicsEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), 2024
@inproceedings{borras2024, title = {Walking Noise: On Layer-Specific Robustness of Neural Architectures against Noisy Computations and Associated Characteristic Learning Dynamics}, author = {Borras, Hendrik and Klein, Bernhard and Fr{\"{o}}ning, Holger}, booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases}, year = {2024}, series = {ECML-PKDD}, url = {https://doi.org/10.1007/978-3-031-70359-1_3}, }
- Function Space Diversity for Uncertainty Prediction via Repulsive Last-Layer EnsemblesICML 2024 Workshop on Structured Probabilistic Inference & Generative Modeling, 2024
@inproceedings{steger2024function, title = {Function Space Diversity for Uncertainty Prediction via Repulsive Last-Layer Ensembles}, author = {Steger, Sophie and Knoll, Christian and Klein, Bernhard and Fr{\"o}ning, Holger and Pernkopf, Franz}, booktitle = {ICML 2024 Workshop on Structured Probabilistic Inference {\&} Generative Modeling}, year = {2024}, url = {https://openreview.net/forum?id=FbMN9HjgHI}, }
- Probabilistic Photonic Computing with Chaotic LightCoRR, abs/2401.17915, 2024
@article{brckerhoffplckelmann2024probabilistic, title = {Probabilistic Photonic Computing with Chaotic Light}, author = {Brückerhoff-Plückelmann, Frank and Borras, Hendrik and Klein, Bernhard and Varri, Akhil and Becker, Marlon and Dijkstra, Jelle and Brückerhoff, Martin and Wright, C. David and Salinga, Martin and Bhaskaran, Harish and Risse, Benjamin and Fr{\"o}ning, Holger and Pernice, Wolfram}, year = {2024}, volume = {abs/2401.17915}, journal = {CoRR}, url = {https://arxiv.org/abs/2401.17915}, }
- DeepHYDRA: A Hybrid Deep Learning and DBSCAN-Based Approach to Time-Series Anomaly Detection in Dynamically-Configured Systems38th ACM International Conference on Supercomputing (ICS), 272–285, Association for Computing Machinery, 2024
@inproceedings{10.1145/3650200.3656637, author = {Stehle, Franz Kevin and Vandelli, Wainer and Zahn, Felix and Avolio, Giuseppe and Fr\"{o}ning, Holger}, title = {DeepHYDRA: A Hybrid Deep Learning and DBSCAN-Based Approach to Time-Series Anomaly Detection in Dynamically-Configured Systems}, year = {2024}, isbn = {9798400706103}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, doi = {10.1145/3650200.3656637}, booktitle = {38th ACM International Conference on Supercomputing}, pages = {272–285}, numpages = {14}, location = {Kyoto, Japan}, series = {ICS}, }
- GraphScale: Scalable Processing on FPGAs for HBM and Large GraphsACM Trans. Reconfigurable Technol. Syst., 17(2), Association for Computing Machinery, 2024
@article{10.1145/3616497, author = {Dann, Jonas and Ritter, Daniel and Fr\"{o}ning, Holger}, title = {GraphScale: Scalable Processing on FPGAs for HBM and Large Graphs}, year = {2024}, issue_date = {June 2024}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, volume = {17}, number = {2}, issn = {1936-7406}, url = {https://doi.org/10.1145/3616497}, doi = {10.1145/3616497}, journal = {ACM Trans. Reconfigurable Technol. Syst.}, month = mar, articleno = {22}, numpages = {23}, keywords = {FPGA, Graph processing, HBM}, }
- Random telegraph noise characteristic of nonvolatile resistive random access memories based on optical interference principleJapanese Journal of Applied Physics, 63(3), 031003, IOP Publishing, 2024
@article{Qin_2024, doi = {10.35848/1347-4065/ad26d1}, url = {https://dx.doi.org/10.35848/1347-4065/ad26d1}, year = {2024}, month = mar, publisher = {IOP Publishing}, volume = {63}, number = {3}, pages = {031003}, author = {Qin, Sichen and Zhang, Guiquan and Zhang, Jia-Wei and Zhao, Yu and Song, Chen and Emonds, Yannick and Fröning, Holger}, title = {Random telegraph noise characteristic of nonvolatile resistive random access memories based on optical interference principle}, journal = {Japanese Journal of Applied Physics}, }
- GraphMatch: Subgraph Query Processing on FPGAsCoRR, abs/2402.17559, 2024
@article{dann2024graphmatch, title = {GraphMatch: Subgraph Query Processing on FPGAs}, author = {Dann, Jonas and Götz, Tobias and Ritter, Daniel and Giceva, Jana and Fröning, Holger}, year = {2024}, volume = {abs/2402.17559}, journal = {CoRR}, url = {https://arxiv.org/abs/2402.17559}, doi = {10.48550/ARXIV.2402.17559}, }
- Implications of Noise in Resistive Memory on Deep Neural Networks for Image ClassificationCoRR, abs/2401.05820, 2024
@article{DBLP:journals/corr/abs-2401-05820, author = {Emonds, Yannick and Xi, Kai and Fröning, Holger}, title = {Implications of Noise in Resistive Memory on Deep Neural Networks for Image Classification}, journal = {CoRR}, volume = {abs/2401.05820}, year = {2024}, url = {https://arxiv.org/abs/2401.05820}, doi = {10.48550/ARXIV.2401.05820}, eprinttype = {arXiv}, eprint = {2401.05820}, timestamp = {Thu, 25 Jan 2024 00:00:00 +0100}, }
2023
- Characterization of data compression across CPU platforms and acceleratorsConcurr. Comput. Pract. Exp., 35(20), 2023
@article{DBLP:journals/concurrency/PrombergerSF23, author = {Promberger, Laura and Schwemmer, Rainer and Fr{\"{o}}ning, Holger}, title = {Characterization of data compression across {CPU} platforms and accelerators}, journal = {Concurr. Comput. Pract. Exp.}, volume = {35}, number = {20}, year = {2023}, url = {https://doi.org/10.1002/cpe.6465}, doi = {10.1002/CPE.6465}, timestamp = {Thu, 14 Sep 2023 01:00:00 +0200}, }
- Non-relational Databases on FPGAs: Survey, Design Decisions, ChallengesACM Comput. Surv., 55(11), 225:1–225:37, 2023
@article{DBLP:journals/csur/DannRF23, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, title = {Non-relational Databases on FPGAs: Survey, Design Decisions, Challenges}, journal = {{ACM} Comput. Surv.}, volume = {55}, number = {11}, pages = {225:1--225:37}, year = {2023}, url = {https://doi.org/10.1145/3568990}, doi = {10.1145/3568990}, timestamp = {Fri, 02 Jun 2023 01:00:00 +0200}, }
- CUDAsap: Statically-Determined Execution Statistics as Alternative to Execution-Based Profiling23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID), 119–130, IEEE, 2023
@inproceedings{DBLP:conf/ccgrid/EmondsBF23, author = {Emonds, Yannick and Braun, Lorenz and Fr{\"{o}}ning, Holger}, editor = {Simmhan, Yogesh and Altintas, Ilkay and Varbanescu, Ana Lucia and Balaji, Pavan and Prasad, Abhinandan S. and Carnevale, Lorenzo}, title = {CUDAsap: Statically-Determined Execution Statistics as Alternative to Execution-Based Profiling}, booktitle = {23rd {IEEE/ACM} International Symposium on Cluster, Cloud and Internet Computing}, address = {Bangalore, India}, series = {CCGRID}, pages = {119--130}, publisher = {{IEEE}}, year = {2023}, url = {https://doi.org/10.1109/CCGrid57682.2023.00021}, doi = {10.1109/CCGRID57682.2023.00021}, timestamp = {Fri, 21 Jul 2023 22:25:52 +0200}, }
- Implementation Techniques for SPMD Kernels on CPUsInternational Workshop on OpenCL, IWOCL 2023, Cambridge, United Kingdom, April 18-20, 2023, 1:1–1:12, ACM, 2023
@inproceedings{DBLP:conf/iwocl/0003AHFH23, author = {Meyer, Joachim and Alpay, Aksel and Hack, Sebastian and Fr{\"{o}}ning, Holger and Heuveline, Vincent}, title = {Implementation Techniques for {SPMD} Kernels on CPUs}, booktitle = {International Workshop on OpenCL, {IWOCL} 2023, Cambridge, United Kingdom, April 18-20, 2023}, pages = {1:1--1:12}, publisher = {{ACM}}, year = {2023}, url = {https://doi.org/10.1145/3585341.3585342}, doi = {10.1145/3585341.3585342}, timestamp = {Sat, 29 Apr 2023 01:00:00 +0200}, }
- Reducing Memory Requirements for the IPU using Butterfly FactorizationsSC ’23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, SC-W 2023, Denver, CO, USA, November 12-17, 2023, 1255–1263, ACM, 2023
@inproceedings{DBLP:conf/sc/ShekoftehAF23, author = {Shekofteh, S. Kazem and Alles, Christian and Fr{\"{o}}ning, Holger}, title = {Reducing Memory Requirements for the {IPU} using Butterfly Factorizations}, booktitle = {{SC} '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, {SC-W} 2023, Denver, CO, USA, November 12-17, 2023}, pages = {1255--1263}, publisher = {{ACM}}, year = {2023}, url = {https://doi.org/10.1145/3624062.3624196}, doi = {10.1145/3624062.3624196}, timestamp = {Tue, 28 Nov 2023 00:00:00 +0100}, }
- On the Non-Associativity of Analog ComputationsCoRR, abs/2309.14292, 2023
@article{DBLP:journals/corr/abs-2309-14292, author = {Kuhn, Lisa and Klein, Bernhard and Fr{\"{o}}ning, Holger}, title = {On the Non-Associativity of Analog Computations}, journal = {CoRR}, volume = {abs/2309.14292}, year = {2023}, url = {https://arxiv.org/abs/2309.14292}, doi = {10.48550/ARXIV.2309.14292}, eprinttype = {arXiv}, eprint = {2309.14292}, timestamp = {Wed, 27 Sep 2023 01:00:00 +0200}, }
- On Performance Analysis of Graphcore IPUs: Analyzing Squared and Skewed Matrix MultiplicationCoRR, abs/2310.00256, 2023
@article{DBLP:journals/corr/abs-2310-00256, author = {Shekofteh, S. Kazem and Alles, Christian and Kochend{\"{o}}rfer, Nils and Fr{\"{o}}ning, Holger}, title = {On Performance Analysis of Graphcore IPUs: Analyzing Squared and Skewed Matrix Multiplication}, journal = {CoRR}, volume = {abs/2310.00256}, year = {2023}, url = {https://arxiv.org/abs/2310.00256}, doi = {10.48550/ARXIV.2310.00256}, eprinttype = {arXiv}, eprint = {2310.00256}, timestamp = {Wed, 18 Oct 2023 01:00:00 +0200}, }
- Compressing the Backward Pass of Large-Scale Neural Architectures by Structured Activation PruningCoRR, abs/2311.16883, 2023
@article{DBLP:journals/corr/abs-2311-16883, author = {Barley, Daniel and Fr{\"{o}}ning, Holger}, title = {Compressing the Backward Pass of Large-Scale Neural Architectures by Structured Activation Pruning}, journal = {CoRR}, volume = {abs/2311.16883}, year = {2023}, url = {https://arxiv.org/abs/2311.16883}, doi = {10.48550/ARXIV.2311.16883}, eprinttype = {arXiv}, eprint = {2311.16883}, timestamp = {Mon, 04 Dec 2023 00:00:00 +0100}, }
2022
- Joint Program and Layout Transformations to Enable Convolutional Operators on Specialized Hardware Based on Constraint ProgrammingACM Trans. Archit. Code Optim., 19(1), 7:1–7:26, 2022
@article{DBLP:journals/taco/RieberAF22, author = {Rieber, Dennis and Acosta, Axel and Fr{\"{o}}ning, Holger}, title = {Joint Program and Layout Transformations to Enable Convolutional Operators on Specialized Hardware Based on Constraint Programming}, journal = {{ACM} Trans. Archit. Code Optim.}, volume = {19}, number = {1}, pages = {7:1--7:26}, year = {2022}, url = {https://doi.org/10.1145/3487922}, doi = {10.1145/3487922}, timestamp = {Mon, 28 Aug 2023 01:00:00 +0200}, }
- PipeJSON: Parsing JSON at Line Speed on FPGAsInternational Conference on Management of Data, DaMoN 2022, Philadelphia, PA, USA, 13 June 2022, 3:1–3:7, ACM, 2022
@inproceedings{DBLP:conf/damon/DannW0FF22, author = {Dann, Jonas and Wagner, Royden and Ritter, Daniel and Faerber, Christian and Fr{\"{o}}ning, Holger}, editor = {Blanas, Spyros and May, Norman}, title = {PipeJSON: Parsing {JSON} at Line Speed on FPGAs}, booktitle = {International Conference on Management of Data, DaMoN 2022, Philadelphia, PA, USA, 13 June 2022}, pages = {3:1--3:7}, publisher = {{ACM}}, year = {2022}, url = {https://doi.org/10.1145/3533737.3535094}, doi = {10.1145/3533737.3535094}, timestamp = {Wed, 15 Jun 2022 13:47:16 +0200}, }
- GraphScale: Scalable Bandwidth-Efficient Graph Processing on FPGAs32nd International Conference on Field-Programmable Logic and Applications, FPL 2022, Belfast, United Kingdom, August 29 - Sept. 2, 2022, 24–32, IEEE, 2022
@inproceedings{DBLP:conf/fpl/Dann0F22, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, title = {GraphScale: Scalable Bandwidth-Efficient Graph Processing on FPGAs}, booktitle = {32nd International Conference on Field-Programmable Logic and Applications, {FPL} 2022, Belfast, United Kingdom, August 29 - Sept. 2, 2022}, pages = {24--32}, publisher = {{IEEE}}, year = {2022}, url = {https://doi.org/10.1109/FPL57034.2022.00016}, doi = {10.1109/FPL57034.2022.00016}, timestamp = {Mon, 20 Feb 2023 17:38:16 +0100}, }
- Compiler-aided nd-range parallel-for implementations on CPU in hipSYCLIWOCL’22: International Workshop on OpenCL, Bristol, United Kingdom, May 10 - 12, 2022, 28:1–28:3, ACM, 2022
@inproceedings{DBLP:conf/iwocl/0003AFH22, author = {Meyer, Joachim and Alpay, Aksel and Fr{\"{o}}ning, Holger and Heuveline, Vincent}, title = {Compiler-aided nd-range parallel-for implementations on {CPU} in hipSYCL}, booktitle = {IWOCL'22: International Workshop on OpenCL, Bristol, United Kingdom, May 10 - 12, 2022}, pages = {28:1--28:3}, publisher = {{ACM}}, year = {2022}, url = {https://doi.org/10.1145/3529538.3530216}, doi = {10.1145/3529538.3530216}, timestamp = {Mon, 26 Jun 2023 01:00:00 +0200}, }
- HW-Aware Initialization of DNN Auto-Tuning to Improve Exploration Time and RobustnessCoRR, abs/2205.15568, 2022
@article{DBLP:journals/corr/abs-2205-15568, author = {Rieber, Dennis and Reiber, Moritz and Bringmann, Oliver and Fr{\"{o}}ning, Holger}, title = {HW-Aware Initialization of {DNN} Auto-Tuning to Improve Exploration Time and Robustness}, journal = {CoRR}, volume = {abs/2205.15568}, year = {2022}, url = {https://arxiv.org/abs/2205.15568}, doi = {10.48550/ARXIV.2205.15568}, eprinttype = {arXiv}, eprint = {2205.15568}, timestamp = {Wed, 01 Jun 2022 01:00:00 +0200}, }
- Towards Hardware-Specific Automatic Compression of Neural NetworksCoRR, abs/2212.07818, 2022
@article{DBLP:journals/corr/abs-2212-07818, author = {Krieger, Torben and Klein, Bernhard and Fr{\"{o}}ning, Holger}, title = {Towards Hardware-Specific Automatic Compression of Neural Networks}, journal = {CoRR}, volume = {abs/2212.07818}, year = {2022}, url = {https://arxiv.org/abs/2212.07818}, doi = {10.48550/ARXIV.2212.07818}, eprinttype = {arXiv}, eprint = {2212.07818}, timestamp = {Mon, 02 Jan 2023 00:00:00 +0100}, }
2021
- A Simple Model for Portable and Fast Prediction of Execution Time and Power Consumption of GPU KernelsACM Trans. Archit. Code Optim., 18(1), 7:1–7:25, 2021
@article{DBLP:journals/taco/BraunNSHF21, author = {Braun, Lorenz and Nikas, Sotirios and Song, Chen and Heuveline, Vincent and Fr{\"{o}}ning, Holger}, title = {A Simple Model for Portable and Fast Prediction of Execution Time and Power Consumption of {GPU} Kernels}, journal = {{ACM} Trans. Archit. Code Optim.}, volume = {18}, number = {1}, pages = {7:1--7:25}, year = {2021}, url = {https://doi.org/10.1145/3431731}, doi = {10.1145/3431731}, timestamp = {Sat, 30 Sep 2023 01:00:00 +0200}, }
- Exploring Memory Access Patterns for Graph Processing AcceleratorsDatenbanksysteme für Business, Technologie und Web (BTW 2021), 19. Fachtagung des GI-Fachbereichs ,,Datenbanken und Informationssysteme" (DBIS), 13.-17. September 2021, Dresden, Germany, Proceedings (LNI), P-311, 101–122, Gesellschaft für Informatik, Bonn, 2021
@inproceedings{DBLP:conf/btw/Dann0F21, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, editor = {Sattler, Kai{-}Uwe and Herschel, Melanie and Lehner, Wolfgang}, title = {Exploring Memory Access Patterns for Graph Processing Accelerators}, booktitle = {Datenbanksysteme f{\"{u}}r Business, Technologie und Web {(BTW} 2021), 19. Fachtagung des GI-Fachbereichs ,,Datenbanken und Informationssysteme" (DBIS), 13.-17. September 2021, Dresden, Germany, Proceedings}, series = {{LNI}}, volume = {{P-311}}, pages = {101--122}, publisher = {Gesellschaft f{\"{u}}r Informatik, Bonn}, year = {2021}, url = {https://doi.org/10.18420/btw2021-05}, doi = {10.18420/BTW2021-05}, timestamp = {Tue, 04 Jul 2023 17:43:09 +0200}, }
- Towards Addressing Noise and Static Variations of Analog Computations Using Efficient RetrainingMachine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of ECML PKDD 2021, Proceedings Part I (Communications in Computer and Information Science), 1524, 409–420, Springer, 2021
@inproceedings{DBLP:conf/pkdd/KleinKWESSF21, author = {Klein, Bernhard and Kuhn, Lisa and Weis, Johannes and Emmel, Arne and Stradmann, Yannik and Schemmel, Johannes and Fr{\"{o}}ning, Holger}, editor = {Kamp, Michael and Koprinska, Irena and Bibal, Adrien and Bouadi, Tassadit and Fr{\'{e}}nay, Beno{\^{\i}}t and Gal{\'{a}}rraga, Luis and Oramas, Jos{\'{e}} and Adilova, Linara and Krishnamurthy, Yamuna and Kang, Bo and Largeron, Christine and Lijffijt, Jefrey and Viard, Tiphaine and Welke, Pascal and Ruocco, Massimiliano and Aune, Erlend and Gallicchio, Claudio and Schiele, Gregor and Pernkopf, Franz and Blott, Michaela and Fr{\"{o}}ning, Holger and Schindler, G{\"{u}}nther and Guidotti, Riccardo and Monreale, Anna and Rinzivillo, Salvatore and Biecek, Przemyslaw and Ntoutsi, Eirini and Pechenizkiy, Mykola and Rosenhahn, Bodo and Buckley, Christopher L. and Cialfi, Daniela and Lanillos, Pablo and Ramstead, Maxwell and Verbelen, Tim and Ferreira, Pedro M. and Andresini, Giuseppina and Malerba, Donato and Medeiros, Ib{\'{e}}ria and Fournier{-}Viger, Philippe and Nawaz, M. Saqib and Ventura, Sebasti{\'{a}}n and Sun, Meng and Zhou, Min and Bitetta, Valerio and Bordino, Ilaria and Ferretti, Andrea and Gullo, Francesco and Ponti, Giovanni and Severini, Lorenzo and Ribeiro, Rita P. and Gama, Jo{\~{a}}o and Gavald{\`{a}}, Ricard and Cooper, Lee A. D. and Ghazaleh, Naghmeh and Richiardi, Jonas and Roqueiro, Damian and Miranda, Diego Saldana and Sechidis, Konstantinos and Gra{\c{c}}a, Guilherme}, title = {Towards Addressing Noise and Static Variations of Analog Computations Using Efficient Retraining}, booktitle = {Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of {ECML} {PKDD} 2021, Proceedings Part {I}}, series = {Communications in Computer and Information Science}, volume = {1524}, pages = {409--420}, publisher = {Springer}, year = {2021}, url = {https://doi.org/10.1007/978-3-030-93736-2_32}, doi = {10.1007/978-3-030-93736-2\_32}, }
- Demystifying memory access patterns of FPGA-based graph processing acceleratorsGRADES-NDA ’21: 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA), Virtual Event, China, 20 June 2021, 3:1–3:10, ACM, 2021
@inproceedings{DBLP:conf/sigmod/Dann0F21, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, editor = {Kalavri, Vasiliki and Yakovets, Nikolay}, title = {Demystifying memory access patterns of FPGA-based graph processing accelerators}, booktitle = {{GRADES-NDA} '21: 4th {ACM} {SIGMOD} Joint International Workshop on Graph Data Management Experiences {\&} Systems {(GRADES)} and Network Data Analytics (NDA), Virtual Event, China, 20 June 2021}, pages = {3:1--3:10}, publisher = {{ACM}}, year = {2021}, url = {https://doi.org/10.1145/3461837.3464512}, doi = {10.1145/3461837.3464512}, timestamp = {Wed, 14 Jul 2021 16:01:02 +0200}, }
- Understanding Cache Boundness of ML Operators on ARM ProcessorsCoRR, abs/2102.00932, 2021
@article{DBLP:journals/corr/abs-2102-00932, author = {Klein, Bernhard and Gratl, Christoph and M{\"{u}}cke, Manfred and Fr{\"{o}}ning, Holger}, title = {Understanding Cache Boundness of {ML} Operators on {ARM} Processors}, journal = {CoRR}, volume = {abs/2102.00932}, year = {2021}, url = {https://arxiv.org/abs/2102.00932}, eprinttype = {arXiv}, eprint = {2102.00932}, timestamp = {Thu, 14 Oct 2021 01:00:00 +0200}, }
- The Programming of Deep Learning Accelerators as a Constraint Satisfaction ProblemCoRR, abs/2104.04731, 2021
@article{DBLP:journals/corr/abs-2104-04731, author = {Rieber, Dennis and Acosta, Axel and Fr{\"{o}}ning, Holger}, title = {The Programming of Deep Learning Accelerators as a Constraint Satisfaction Problem}, journal = {CoRR}, volume = {abs/2104.04731}, year = {2021}, url = {https://arxiv.org/abs/2104.04731}, eprinttype = {arXiv}, eprint = {2104.04731}, timestamp = {Tue, 22 Feb 2022 00:00:00 +0100}, }
- Scheduling of Graph Queries: Controlling Intra- and Inter-query Parallelism for a High System ThroughputCoRR, abs/2110.10797, 2021
@article{DBLP:journals/corr/abs-2110-10797, author = {Hauck, Matthias and Oukid, Ismail and Fr{\"{o}}ning, Holger}, title = {Scheduling of Graph Queries: Controlling Intra- and Inter-query Parallelism for a High System Throughput}, journal = {CoRR}, volume = {abs/2110.10797}, year = {2021}, url = {https://arxiv.org/abs/2110.10797}, eprinttype = {arXiv}, eprint = {2110.10797}, timestamp = {Thu, 28 Oct 2021 01:00:00 +0200}, }
2020
- cCUDA: Effective Co-Scheduling of Concurrent Kernels on GPUsIEEE Trans. Parallel Distributed Syst., 31(4), 766–778, 2020
@article{DBLP:journals/tpds/ShekoftehNNFY20, author = {Shekofteh, S. Kazem and Noori, Hamid and Naghibzadeh, Mahmoud and Fr{\"{o}}ning, Holger and Yazdi, Hadi Sadoghi}, title = {cCUDA: Effective Co-Scheduling of Concurrent Kernels on GPUs}, journal = {{IEEE} Trans. Parallel Distributed Syst.}, volume = {31}, number = {4}, pages = {766--778}, year = {2020}, url = {https://doi.org/10.1109/TPDS.2019.2944602}, doi = {10.1109/TPDS.2019.2944602}, timestamp = {Fri, 02 Oct 2020 01:00:00 +0200}, }
- Towards Real-Time Single-Channel Singing-Voice Separation with Pruned Multi-Scaled Densenets2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020, 806–810, IEEE, 2020
@inproceedings{DBLP:conf/icassp/HuberSSRPF20, author = {Huber, Markus and Schindler, G{\"{u}}nther and Sch{\"{o}}rkhuber, Christian and Roth, Wolfgang and Pernkopf, Franz and Fr{\"{o}}ning, Holger}, title = {Towards Real-Time Single-Channel Singing-Voice Separation with Pruned Multi-Scaled Densenets}, booktitle = {2020 {IEEE} International Conference on Acoustics, Speech and Signal Processing, {ICASSP} 2020, Barcelona, Spain, May 4-8, 2020}, pages = {806--810}, publisher = {{IEEE}}, year = {2020}, url = {https://doi.org/10.1109/ICASSP40776.2020.9053542}, doi = {10.1109/ICASSP40776.2020.9053542}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- On Network Locality in MPI-Based HPC ApplicationsICPP 2020: 49th International Conference on Parallel Processing, Edmonton, AB, Canada, August 17-20, 2020, 57:1–57:10, ACM, 2020
@inproceedings{DBLP:conf/icpp/ZahnF20, author = {Zahn, Felix and Fr{\"{o}}ning, Holger}, editor = {Amaral, Jos{\'{e}} Nelson and John, Lizy Kurian and Shen, Xipeng}, title = {On Network Locality in MPI-Based {HPC} Applications}, booktitle = {{ICPP} 2020: 49th International Conference on Parallel Processing, Edmonton, AB, Canada, August 17-20, 2020}, pages = {57:1--57:10}, publisher = {{ACM}}, year = {2020}, url = {https://doi.org/10.1145/3404397.3404436}, doi = {10.1145/3404397.3404436}, timestamp = {Wed, 12 Aug 2020 17:44:07 +0200}, }
- Automated Partitioning of Data-Parallel Kernels using Polyhedral CompilationICPP Workshops ’20: Workshops, Edmonton, AB, Canada, August 17-20, 2020, 13:1–13:10, ACM, 2020
@inproceedings{DBLP:conf/icppw/MatzDF20, author = {Matz, Alexander and Doerfert, Johannes and Fr{\"{o}}ning, Holger}, editor = {Silla, Federico and Abdelrahman, Tarek S.}, title = {Automated Partitioning of Data-Parallel Kernels using Polyhedral Compilation}, booktitle = {{ICPP} Workshops '20: Workshops, Edmonton, AB, Canada, August 17-20, 2020}, pages = {13:1--13:10}, publisher = {{ACM}}, year = {2020}, url = {https://doi.org/10.1145/3409390.3409403}, doi = {10.1145/3409390.3409403}, timestamp = {Mon, 03 Jan 2022 00:00:00 +0100}, }
- Assessing the Overhead of Offloading Compression TasksICPP Workshops ’20: Workshops, Edmonton, AB, Canada, August 17-20, 2020, 15:1–15:10, ACM, 2020
@inproceedings{DBLP:conf/icppw/PrombergerSF20, author = {Promberger, Laura and Schwemmer, Rainer and Fr{\"{o}}ning, Holger}, editor = {Silla, Federico and Abdelrahman, Tarek S.}, title = {Assessing the Overhead of Offloading Compression Tasks}, booktitle = {{ICPP} Workshops '20: Workshops, Edmonton, AB, Canada, August 17-20, 2020}, pages = {15:1--15:10}, publisher = {{ACM}}, year = {2020}, url = {https://doi.org/10.1145/3409390.3409405}, doi = {10.1145/3409390.3409405}, timestamp = {Wed, 15 Dec 2021 00:00:00 +0100}, }
- On Resource-Efficient Bayesian Network Classifiers and Deep Neural Networks25th International Conference on Pattern Recognition, ICPR 2020, Virtual Event / Milan, Italy, January 10-15, 2021, 10297–10304, IEEE, 2020
@inproceedings{DBLP:conf/icpr/RothPSF20, author = {Roth, Wolfgang and Pernkopf, Franz and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger}, title = {On Resource-Efficient Bayesian Network Classifiers and Deep Neural Networks}, booktitle = {25th International Conference on Pattern Recognition, {ICPR} 2020, Virtual Event / Milan, Italy, January 10-15, 2021}, pages = {10297--10304}, publisher = {{IEEE}}, year = {2020}, url = {https://doi.org/10.1109/ICPR48806.2021.9413156}, doi = {10.1109/ICPR48806.2021.9413156}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- Parameterized Structured Pruning for Deep Neural NetworksMachine Learning, Optimization, and Data Science - 6th International Conference, LOD 2020, Siena, Italy, July 19-23, 2020, Revised Selected Papers, Part II (Lecture Notes in Computer Science), 12566, 16–27, Springer, 2020
@inproceedings{DBLP:conf/mod/SchindlerRPF20, author = {Schindler, G{\"{u}}nther and Roth, Wolfgang and Pernkopf, Franz and Fr{\"{o}}ning, Holger}, editor = {Nicosia, Giuseppe and Ojha, Varun and Malfa, Emanuele La and Jansen, Giorgio and Sciacca, Vincenzo and Pardalos, Panos M. and Giuffrida, Giovanni and Umeton, Renato}, title = {Parameterized Structured Pruning for Deep Neural Networks}, booktitle = {Machine Learning, Optimization, and Data Science - 6th International Conference, {LOD} 2020, Siena, Italy, July 19-23, 2020, Revised Selected Papers, Part {II}}, series = {Lecture Notes in Computer Science}, volume = {12566}, pages = {16--27}, publisher = {Springer}, year = {2020}, url = {https://doi.org/10.1007/978-3-030-64580-9\_3}, doi = {10.1007/978-3-030-64580-9\_3}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- Search Space Complexity of Iteration Domain Based Instruction Embedding for Deep Learning AcceleratorsIoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, ITEM 2020, Co-located with ECML/PKDD 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected Papers (Communications in Computer and Information Science), 1325, 213–228, Springer, 2020
@inproceedings{DBLP:conf/pkdd/RieberF20, author = {Rieber, Dennis and Fr{\"{o}}ning, Holger}, editor = {Gama, Jo{\~{a}}o and Pashami, Sepideh and Bifet, Albert and {Sayed Mouchaweh}, Moamar and Fr{\"{o}}ning, Holger and Pernkopf, Franz and Schiele, Gregor and Blott, Michaela}, title = {Search Space Complexity of Iteration Domain Based Instruction Embedding for Deep Learning Accelerators}, booktitle = {IoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, {ITEM} 2020, Co-located with {ECML/PKDD} 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected Papers}, series = {Communications in Computer and Information Science}, volume = {1325}, pages = {213--228}, publisher = {Springer}, year = {2020}, url = {https://doi.org/10.1007/978-3-030-66770-2\_16}, doi = {10.1007/978-3-030-66770-2\_16}, timestamp = {Wed, 07 Apr 2021 01:00:00 +0200}, }
- On the Difficulty of Designing Processor Arrays for Deep Neural NetworksIoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, ITEM 2020, Co-located with ECML/PKDD 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected Papers (Communications in Computer and Information Science), 1325, 229–240, Springer, 2020
@inproceedings{DBLP:conf/pkdd/StehleSF20, author = {Stehle, Kevin and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger}, editor = {Gama, Jo{\~{a}}o and Pashami, Sepideh and Bifet, Albert and {Sayed Mouchaweh}, Moamar and Fr{\"{o}}ning, Holger and Pernkopf, Franz and Schiele, Gregor and Blott, Michaela}, title = {On the Difficulty of Designing Processor Arrays for Deep Neural Networks}, booktitle = {IoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, {ITEM} 2020, Co-located with {ECML/PKDD} 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected Papers}, series = {Communications in Computer and Information Science}, volume = {1325}, pages = {229--240}, publisher = {Springer}, year = {2020}, url = {https://doi.org/10.1007/978-3-030-66770-2\_17}, doi = {10.1007/978-3-030-66770-2\_17}, timestamp = {Mon, 15 Feb 2021 00:00:00 +0100}, }
- Resource-Efficient Neural Networks for Embedded SystemsCoRR, abs/2001.03048, 2020
@article{DBLP:journals/corr/abs-2001-03048, author = {Roth, Wolfgang and Schindler, G{\"{u}}nther and Z{\"{o}}hrer, Matthias and Pfeifenberger, Lukas and Peharz, Robert and Tschiatschek, Sebastian and Fr{\"{o}}ning, Holger and Pernkopf, Franz and Ghahramani, Zoubin}, title = {Resource-Efficient Neural Networks for Embedded Systems}, journal = {CoRR}, volume = {abs/2001.03048}, year = {2020}, url = {http://arxiv.org/abs/2001.03048}, eprinttype = {arXiv}, eprint = {2001.03048}, timestamp = {Mon, 13 Jan 2020 00:00:00 +0100}, }
- A Simple Model for Portable and Fast Prediction of Execution Time and Power Consumption of GPU KernelsCoRR, abs/2001.07104, 2020
@article{DBLP:journals/corr/abs-2001-07104, author = {Braun, Lorenz and Nikas, Sotirios and Song, Chen and Heuveline, Vincent and Fr{\"{o}}ning, Holger}, title = {A Simple Model for Portable and Fast Prediction of Execution Time and Power Consumption of {GPU} Kernels}, journal = {CoRR}, volume = {abs/2001.07104}, year = {2020}, url = {https://arxiv.org/abs/2001.07104}, eprinttype = {arXiv}, eprint = {2001.07104}, timestamp = {Sat, 23 Jan 2021 00:00:00 +0100}, }
- Resource-Efficient Speech Mask Estimation for Multi-Channel Speech EnhancementCoRR, abs/2007.11477, 2020
@article{DBLP:journals/corr/abs-2007-11477, author = {Pfeifenberger, Lukas and Z{\"{o}}hrer, Matthias and Schindler, G{\"{u}}nther and Roth, Wolfgang and Fr{\"{o}}ning, Holger and Pernkopf, Franz}, title = {Resource-Efficient Speech Mask Estimation for Multi-Channel Speech Enhancement}, journal = {CoRR}, volume = {abs/2007.11477}, year = {2020}, url = {https://arxiv.org/abs/2007.11477}, eprinttype = {arXiv}, eprint = {2007.11477}, timestamp = {Wed, 29 Jul 2020 01:00:00 +0200}, }
- Exploring Memory Access Patterns for Graph Processing AcceleratorsCoRR, abs/2010.13619, 2020
@article{DBLP:journals/corr/abs-2010-13619, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, title = {Exploring Memory Access Patterns for Graph Processing Accelerators}, journal = {CoRR}, volume = {abs/2010.13619}, year = {2020}, url = {https://arxiv.org/abs/2010.13619}, eprinttype = {arXiv}, eprint = {2010.13619}, timestamp = {Mon, 02 Nov 2020 00:00:00 +0100}, }
2019
- Constructing virtual 5-dimensional tori out of lower-dimensional network cardsConcurr. Comput. Pract. Exp., 31(2), 2019
@article{DBLP:journals/concurrency/AndujarVSADF19, author = {Andujar, Francisco J. and Villar, Juan A. and S{\'{a}}nchez, Jos{\'{e}} L. and Alfaro, Francisco J. and Duato, Jos{\'{e}} and Fr{\"{o}}ning, Holger}, title = {Constructing virtual 5-dimensional tori out of lower-dimensional network cards}, journal = {Concurr. Comput. Pract. Exp.}, volume = {31}, number = {2}, year = {2019}, url = {https://doi.org/10.1002/cpe.4361}, doi = {10.1002/CPE.4361}, timestamp = {Mon, 02 Mar 2020 00:00:00 +0100}, }
- On link width scaling for energy-proportional direct interconnection networksConcurr. Comput. Pract. Exp., 31(2), 2019
@article{DBLP:journals/concurrency/ZahnLF19, author = {Zahn, Felix and Lammel, Steffen and Fr{\"{o}}ning, Holger}, title = {On link width scaling for energy-proportional direct interconnection networks}, journal = {Concurr. Comput. Pract. Exp.}, volume = {31}, number = {2}, year = {2019}, url = {https://doi.org/10.1002/cpe.4439}, doi = {10.1002/CPE.4439}, timestamp = {Mon, 02 Mar 2020 00:00:00 +0100}, }
- Metric Selection for GPU Kernel ClassificationACM Trans. Archit. Code Optim., 15(4), 68:1–68:27, 2019
@article{DBLP:journals/taco/ShekoftehNNYF19, author = {Shekofteh, S. Kazem and Noori, Hamid and Naghibzadeh, Mahmoud and Yazdi, Hadi Sadoghi and Fr{\"{o}}ning, Holger}, title = {Metric Selection for {GPU} Kernel Classification}, journal = {{ACM} Trans. Archit. Code Optim.}, volume = {15}, number = {4}, pages = {68:1--68:27}, year = {2019}, url = {https://doi.org/10.1145/3295690}, doi = {10.1145/3295690}, timestamp = {Sat, 08 Jan 2022 00:00:00 +0100}, }
- Quantifying the NUMA Behavior of Partitioned GPGPU Applications12th Workshop on General Purpose Processing Using GPUs, GPGPU@ASPLOS 2019, Providence, RI, USA, April 13, 2019, 53–62, ACM, 2019
@inproceedings{DBLP:conf/asplos/MatzF19, author = {Matz, Alexander and Fr{\"{o}}ning, Holger}, editor = {Jog, Adwait and Kayiran, Onur}, title = {Quantifying the {NUMA} Behavior of Partitioned {GPGPU} Applications}, booktitle = {12th Workshop on General Purpose Processing Using GPUs, GPGPU@ASPLOS 2019, Providence, RI, USA, April 13, 2019}, pages = {53--62}, publisher = {{ACM}}, year = {2019}, url = {https://doi.org/10.1145/3300053.3319420}, doi = {10.1145/3300053.3319420}, timestamp = {Tue, 16 Apr 2019 17:25:22 +0200}, }
- Effects of Congestion Management on Energy Saving Techniques in Interconnection NetworksThe 5th International Workshop on High-Performance Interconnection Networks in the ExaScale and Big-Data Era, HiPINEB@HPCA 2019, 17 February 2019, Washington, DC, USA, 9–16, IEEE Computer Society, 2019
@inproceedings{DBLP:conf/hpca/ZahnYEGF19, author = {Zahn, Felix and Y{\'{e}}benes, Pedro and Escudero{-}Sahuquillo, Jes{\'{u}}s and Garc{\'{\i}}a, Pedro Javier and Fr{\"{o}}ning, Holger}, title = {Effects of Congestion Management on Energy Saving Techniques in Interconnection Networks}, booktitle = {The 5th International Workshop on High-Performance Interconnection Networks in the ExaScale and Big-Data Era, HiPINEB@HPCA 2019, 17 February 2019, Washington, DC, {USA}}, pages = {9--16}, publisher = {{IEEE} Computer Society}, year = {2019}, url = {https://doi.org/10.1109/HiPINEB.2019.00009}, doi = {10.1109/HIPINEB.2019.00009}, timestamp = {Mon, 03 Jan 2022 00:00:00 +0100}, }
- Software-Based Buffering of Associative Operations on Random Memory Addresses2019 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2019, Rio de Janeiro, Brazil, May 20-24, 2019, 943–952, IEEE, 2019
@inproceedings{DBLP:conf/ipps/HauckPF19, author = {Hauck, Matthias and Paradies, Marcus and Fr{\"{o}}ning, Holger}, title = {Software-Based Buffering of Associative Operations on Random Memory Addresses}, booktitle = {2019 {IEEE} International Parallel and Distributed Processing Symposium, {IPDPS} 2019, Rio de Janeiro, Brazil, May 20-24, 2019}, pages = {943--952}, publisher = {{IEEE}}, year = {2019}, url = {https://doi.org/10.1109/IPDPS.2019.00102}, doi = {10.1109/IPDPS.2019.00102}, timestamp = {Wed, 16 Oct 2019 14:14:51 +0200}, }
- Training Discrete-Valued Neural Networks with Sign Activations Using Weight DistributionsMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2019, Würzburg, Germany, September 16-20, 2019, Proceedings, Part II (Lecture Notes in Computer Science), 11907, 382–398, Springer, 2019
@inproceedings{DBLP:conf/pkdd/RothSFP19, author = {Roth, Wolfgang and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger and Pernkopf, Franz}, editor = {Brefeld, Ulf and {\'{E}}lisa Fromont and Hotho, Andreas and Knobbe, Arno J. and Maathuis, Marloes H. and Robardet, C{\'{e}}line}, title = {Training Discrete-Valued Neural Networks with Sign Activations Using Weight Distributions}, booktitle = {Machine Learning and Knowledge Discovery in Databases - European Conference, {ECML} {PKDD} 2019, W{\"{u}}rzburg, Germany, September 16-20, 2019, Proceedings, Part {II}}, series = {Lecture Notes in Computer Science}, volume = {11907}, pages = {382--398}, publisher = {Springer}, year = {2019}, url = {https://doi.org/10.1007/978-3-030-46147-8\_23}, doi = {10.1007/978-3-030-46147-8\_23}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- CUDA Flux: A Lightweight Instruction Profiler for CUDA Applications2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, PMBS@SC 2019, Denver, CO, USA, November 18, 2019, 73–81, IEEE, 2019
@inproceedings{DBLP:conf/sc/BraunF19, author = {Braun, Lorenz and Fr{\"{o}}ning, Holger}, title = {{CUDA} Flux: {A} Lightweight Instruction Profiler for {CUDA} Applications}, booktitle = {2019 {IEEE/ACM} Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, PMBS@SC 2019, Denver, CO, USA, November 18, 2019}, pages = {73--81}, publisher = {{IEEE}}, year = {2019}, url = {https://doi.org/10.1109/PMBS49563.2019.00014}, doi = {10.1109/PMBS49563.2019.00014}, timestamp = {Wed, 22 Apr 2020 16:43:07 +0200}, }
2018
- Heterogeneous and unconventional cluster architectures and applicationsConcurr. Comput. Pract. Exp., 30(17), 2018
@article{DBLP:journals/concurrency/FroningS18, author = {Fr{\"{o}}ning, Holger and Silla, Federico}, title = {Heterogeneous and unconventional cluster architectures and applications}, journal = {Concurr. Comput. Pract. Exp.}, volume = {30}, number = {17}, year = {2018}, url = {https://doi.org/10.1002/cpe.4661}, doi = {10.1002/CPE.4661}, timestamp = {Mon, 02 Mar 2020 00:00:00 +0100}, }
- Buffer Provisioning for Large-Scale Data-Acquisition Systems12th ACM International Conference on Distributed and Event-based Systems, DEBS 2018, Hamilton, New Zealand, June 25-29, 2018, 100–111, ACM, 2018
@inproceedings{DBLP:conf/debs/SantosVGF18, author = {Santos, Alejandro and Vandelli, Wainer and Garc{\'{\i}}a, Pedro Javier and Fr{\"{o}}ning, Holger}, editor = {Hinze, Annika and Eyers, David M. and Hirzel, Martin and Weidlich, Matthias and Bhowmik, Sukanya}, title = {Buffer Provisioning for Large-Scale Data-Acquisition Systems}, booktitle = {12th {ACM} International Conference on Distributed and Event-based Systems, {DEBS} 2018, Hamilton, New Zealand, June 25-29, 2018}, pages = {100--111}, publisher = {{ACM}}, year = {2018}, url = {https://doi.org/10.1145/3210284.3210288}, doi = {10.1145/3210284.3210288}, timestamp = {Fri, 26 May 2023 07:40:34 +0200}, }
- Evaluating Energy-Saving Strategies on Torus, K-Ary N-Tree, and Dragonfly4th IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2018, Vienna, Austria, February 24, 2018, 16–23, IEEE Computer Society, 2018
@inproceedings{DBLP:conf/hpca/ZahnSF18, author = {Zahn, Felix and Schoffer, Armin and Fr{\"{o}}ning, Holger}, title = {Evaluating Energy-Saving Strategies on Torus, K-Ary N-Tree, and Dragonfly}, booktitle = {4th {IEEE} International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2018, Vienna, Austria, February 24, 2018}, pages = {16--23}, publisher = {{IEEE} Computer Society}, year = {2018}, url = {https://doi.org/10.1109/HiPINEB.2018.00011}, doi = {10.1109/HIPINEB.2018.00011}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Resource Efficient Deep Eigenvector Beamforming2018 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2018, Calgary, AB, Canada, April 15-20, 2018, 3354–3358, IEEE, 2018
@inproceedings{DBLP:conf/icassp/ZohrerPSFP18, author = {Z{\"{o}}hrer, Matthias and Pfeifenberger, Lukas and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger and Pernkopf, Franz}, title = {Resource Efficient Deep Eigenvector Beamforming}, booktitle = {2018 {IEEE} International Conference on Acoustics, Speech and Signal Processing, {ICASSP} 2018, Calgary, AB, Canada, April 15-20, 2018}, pages = {3354--3358}, publisher = {{IEEE}}, year = {2018}, url = {https://doi.org/10.1109/ICASSP.2018.8462503}, doi = {10.1109/ICASSP.2018.8462503}, timestamp = {Wed, 16 Oct 2019 14:14:52 +0200}, }
- Towards Efficient Forward Propagation on Resource-Constrained SystemsMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2018, Dublin, Ireland, September 10-14, 2018, Proceedings, Part I (Lecture Notes in Computer Science), 11051, 426–442, Springer, 2018
@inproceedings{DBLP:conf/pkdd/SchindlerZPF18, author = {Schindler, G{\"{u}}nther and Z{\"{o}}hrer, Matthias and Pernkopf, Franz and Fr{\"{o}}ning, Holger}, editor = {Berlingerio, Michele and Bonchi, Francesco and G{\"{a}}rtner, Thomas and Hurley, Neil and Ifrim, Georgiana}, title = {Towards Efficient Forward Propagation on Resource-Constrained Systems}, booktitle = {Machine Learning and Knowledge Discovery in Databases - European Conference, {ECML} {PKDD} 2018, Dublin, Ireland, September 10-14, 2018, Proceedings, Part {I}}, series = {Lecture Notes in Computer Science}, volume = {11051}, pages = {426--442}, publisher = {Springer}, year = {2018}, url = {https://doi.org/10.1007/978-3-030-10925-7\_26}, doi = {10.1007/978-3-030-10925-7\_26}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- Efficient and Robust Machine Learning for Real-World SystemsCoRR, abs/1812.02240, 2018
@article{DBLP:journals/corr/abs-1812-02240, author = {Pernkopf, Franz and Roth, Wolfgang and Z{\"{o}}hrer, Matthias and Pfeifenberger, Lukas and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger and Tschiatschek, Sebastian and Peharz, Robert and Mattina, Matthew and Ghahramani, Zoubin}, title = {Efficient and Robust Machine Learning for Real-World Systems}, journal = {CoRR}, volume = {abs/1812.02240}, year = {2018}, url = {http://arxiv.org/abs/1812.02240}, eprinttype = {arXiv}, eprint = {1812.02240}, timestamp = {Tue, 01 Jan 2019 00:00:00 +0100}, }
2017
- InfiniBand Verbs on GPU: a case study of controlling an InfiniBand network device from the GPUInt. J. High Perform. Comput. Appl., 31(4), 274–284, 2017
@article{DBLP:journals/ijhpca/OdenF17, author = {Oden, Lena and Fr{\"{o}}ning, Holger}, title = {InfiniBand Verbs on {GPU:} a case study of controlling an InfiniBand network device from the {GPU}}, journal = {Int. J. High Perform. Comput. Appl.}, volume = {31}, number = {4}, pages = {274--284}, year = {2017}, url = {https://doi.org/10.1177/1094342015588142}, doi = {10.1177/1094342015588142}, timestamp = {Thu, 12 Mar 2020 00:00:00 +0100}, }
- Linking Application Description with Efficient SIMD Code Generation for Low-Precision Signed-Integer GEMMEuro-Par 2017: Parallel Processing Workshops - Euro-Par 2017 International Workshops, Santiago de Compostela, Spain, August 28-29, 2017, Revised Selected Papers (Lecture Notes in Computer Science), 10659, 688–699, Springer, 2017
@inproceedings{DBLP:conf/europar/SchindlerMF17, author = {Schindler, G{\"{u}}nther and M{\"{u}}cke, Manfred and Fr{\"{o}}ning, Holger}, editor = {Heras, Dora Blanco and Boug{\'{e}}, Luc and Mencagli, Gabriele and Jeannot, Emmanuel and Sakellariou, Rizos and Badia, Rosa M. and Barbosa, Jorge G. and Ricci, Laura and Scott, Stephen L. and Lankes, Stefan and Weidendorfer, Josef}, title = {Linking Application Description with Efficient {SIMD} Code Generation for Low-Precision Signed-Integer {GEMM}}, booktitle = {Euro-Par 2017: Parallel Processing Workshops - Euro-Par 2017 International Workshops, Santiago de Compostela, Spain, August 28-29, 2017, Revised Selected Papers}, series = {Lecture Notes in Computer Science}, volume = {10659}, pages = {688--699}, publisher = {Springer}, year = {2017}, url = {https://doi.org/10.1007/978-3-319-75178-8\_55}, doi = {10.1007/978-3-319-75178-8\_55}, timestamp = {Thu, 14 Oct 2021 10:28:38 +0200}, }
- Can Modern Graph Processing Engines Run Concurrent Queries Efficiently?Fifth International Workshop on Graph Data-management Experiences & Systems, GRADES@SIGMOD/PODS 2017, Chicago, IL, USA, May 14 - 19, 2017, 5:1–5:6, ACM, 2017
@inproceedings{DBLP:conf/grades/HauckPF17, author = {Hauck, Matthias and Paradies, Marcus and Fr{\"{o}}ning, Holger}, editor = {Boncz, Peter A. and Larriba{-}Pey, Josep Llu{\'{\i}}s}, title = {Can Modern Graph Processing Engines Run Concurrent Queries Efficiently?}, booktitle = {Fifth International Workshop on Graph Data-management Experiences {\&} Systems, GRADES@SIGMOD/PODS 2017, Chicago, IL, USA, May 14 - 19, 2017}, pages = {5:1--5:6}, publisher = {{ACM}}, year = {2017}, url = {https://doi.org/10.1145/3078447.3078452}, doi = {10.1145/3078447.3078452}, timestamp = {Thu, 10 Dec 2020 13:35:15 +0100}, }
- A Case Study on Implementing Virtual 5D Torus Networks Using Network Components of Lower Dimensionality3rd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2017, Austin, TX, USA, February 5, 2017, 9–16, IEEE Computer Society, 2017
@inproceedings{DBLP:conf/hpca/AndujarV0ADF17, author = {Andujar, Francisco J. and Villar, Juan A. and S{\'{a}}nchez, Jos{\'{e}} L. and Alfaro, Francisco J. and Duato, Jos{\'{e}} and Fr{\"{o}}ning, Holger}, editor = {Escudero{-}Sahuquillo, Jes{\'{u}}s and Garc{\'{\i}}a, Pedro Javier}, title = {A Case Study on Implementing Virtual 5D Torus Networks Using Network Components of Lower Dimensionality}, booktitle = {3rd {IEEE} International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2017, Austin, TX, USA, February 5, 2017}, pages = {9--16}, publisher = {{IEEE} Computer Society}, year = {2017}, url = {https://doi.org/10.1109/HiPINEB.2017.7}, doi = {10.1109/HIPINEB.2017.7}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Early Experiences with Saving Energy in Direct Interconnection Networks3rd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2017, Austin, TX, USA, February 5, 2017, 33–40, IEEE Computer Society, 2017
@inproceedings{DBLP:conf/hpca/ZahnLF17, author = {Zahn, Felix and Lammel, Steffen and Fr{\"{o}}ning, Holger}, editor = {Escudero{-}Sahuquillo, Jes{\'{u}}s and Garc{\'{\i}}a, Pedro Javier}, title = {Early Experiences with Saving Energy in Direct Interconnection Networks}, booktitle = {3rd {IEEE} International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2017, Austin, TX, USA, February 5, 2017}, pages = {33--40}, publisher = {{IEEE} Computer Society}, year = {2017}, url = {https://doi.org/10.1109/HiPINEB.2017.10}, doi = {10.1109/HIPINEB.2017.10}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Modeling and Validating Time, Buffering, and Utilization of a Large-Scale, Real-Time Data Acquisition System2017 International Conference on High Performance Computing & Simulation, HPCS 2017, Genoa, Italy, July 17-21, 2017, 519–525, IEEE, 2017
@inproceedings{DBLP:conf/ieeehpcs/SantosGVF17, author = {Santos, Alejandro and Garc{\'{\i}}a, Pedro Javier and Vandelli, Wainer and Fr{\"{o}}ning, Holger}, title = {Modeling and Validating Time, Buffering, and Utilization of a Large-Scale, Real-Time Data Acquisition System}, booktitle = {2017 International Conference on High Performance Computing {\&} Simulation, {HPCS} 2017, Genoa, Italy, July 17-21, 2017}, pages = {519--525}, publisher = {{IEEE}}, year = {2017}, url = {https://doi.org/10.1109/HPCS.2017.83}, doi = {10.1109/HPCS.2017.83}, timestamp = {Wed, 16 Oct 2019 14:14:54 +0200}, }
- Relaxations for High-Performance Message Passing on Massively Parallel SIMT Processors2017 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2017, Orlando, FL, USA, May 29 - June 2, 2017, 855–865, IEEE Computer Society, 2017
@inproceedings{DBLP:conf/ipps/KlenkFED17, author = {Klenk, Benjamin and Fr{\"{o}}ning, Holger and Eberle, Hans and Dennison, Larry}, title = {Relaxations for High-Performance Message Passing on Massively Parallel {SIMT} Processors}, booktitle = {2017 {IEEE} International Parallel and Distributed Processing Symposium, {IPDPS} 2017, Orlando, FL, USA, May 29 - June 2, 2017}, pages = {855--865}, publisher = {{IEEE} Computer Society}, year = {2017}, url = {https://doi.org/10.1109/IPDPS.2017.94}, doi = {10.1109/IPDPS.2017.94}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- An Overview of MPI Characteristics of Exascale Proxy ApplicationsHigh Performance Computing - 32nd International Conference, ISC High Performance 2017, Frankfurt, Germany, June 18-22, 2017, Proceedings (Lecture Notes in Computer Science), 10266, 217–236, Springer, 2017
@inproceedings{DBLP:conf/supercomputer/KlenkF17, author = {Klenk, Benjamin and Fr{\"{o}}ning, Holger}, editor = {Kunkel, Julian M. and Yokota, Rio and Balaji, Pavan and Keyes, David E.}, title = {An Overview of {MPI} Characteristics of Exascale Proxy Applications}, booktitle = {High Performance Computing - 32nd International Conference, {ISC} High Performance 2017, Frankfurt, Germany, June 18-22, 2017, Proceedings}, series = {Lecture Notes in Computer Science}, volume = {10266}, pages = {217--236}, publisher = {Springer}, year = {2017}, url = {https://doi.org/10.1007/978-3-319-58667-0\_12}, doi = {10.1007/978-3-319-58667-0\_12}, timestamp = {Tue, 14 May 2019 10:00:40 +0200}, }
2016
- Heterogeneous cluster architectures and applicationsConcurr. Comput. Pract. Exp., 28(8), 2319–2321, 2016
@article{DBLP:journals/concurrency/SillaF16, author = {Silla, Federico and Fr{\"{o}}ning, Holger}, title = {Heterogeneous cluster architectures and applications}, journal = {Concurr. Comput. Pract. Exp.}, volume = {28}, number = {8}, pages = {2319--2321}, year = {2016}, url = {https://doi.org/10.1002/cpe.3762}, doi = {10.1002/CPE.3762}, timestamp = {Mon, 02 Mar 2020 00:00:00 +0100}, }
- Analyzing GPU-controlled communication with dynamic parallelism in terms of performance and energyParallel Comput., 57, 125–134, 2016
@article{DBLP:journals/pc/OdenKF16, author = {Oden, Lena and Klenk, Benjamin and Fr{\"{o}}ning, Holger}, title = {Analyzing GPU-controlled communication with dynamic parallelism in terms of performance and energy}, journal = {Parallel Comput.}, volume = {57}, pages = {125--134}, year = {2016}, url = {https://doi.org/10.1016/j.parco.2016.02.005}, doi = {10.1016/J.PARCO.2016.02.005}, timestamp = {Sat, 22 Feb 2020 00:00:00 +0100}, }
- Optimizing the data-collection time of a large-scale data-acquisition system through a simulation frameworkJ. Supercomput., 72(12), 4546–4572, 2016
@article{DBLP:journals/tjs/ColomboFGV16, author = {Colombo, Tommaso and Fr{\"{o}}ning, Holger and Garc{\'{\i}}a, Pedro Javier and Vandelli, Wainer}, title = {Optimizing the data-collection time of a large-scale data-acquisition system through a simulation framework}, journal = {J. Supercomput.}, volume = {72}, number = {12}, pages = {4546--4572}, year = {2016}, url = {https://doi.org/10.1007/s11227-016-1764-1}, doi = {10.1007/S11227-016-1764-1}, timestamp = {Fri, 22 May 2020 01:00:00 +0200}, }
- Analyzing the Energy (Dis-) Proportionality of Scalable Interconnection Networks2nd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era HiPINEB@HPCA 2016, Barcelona, Spain, March 12, 2016, 25–32, IEEE Computer Society, 2016
@inproceedings{DBLP:conf/hpca/ZahnYLGF16, author = {Zahn, Felix and Y{\'{e}}benes, Pedro and Lammel, Steffen and Garc{\'{\i}}a, Pedro Javier and Fr{\"{o}}ning, Holger}, editor = {Escudero{-}Sahuquillo, Jes{\'{u}}s and Garc{\'{\i}}a, Pedro Javier}, title = {Analyzing the Energy (Dis-) Proportionality of Scalable Interconnection Networks}, booktitle = {2nd {IEEE} International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era HiPINEB@HPCA 2016, Barcelona, Spain, March 12, 2016}, pages = {25--32}, publisher = {{IEEE} Computer Society}, year = {2016}, url = {https://doi.org/10.1109/HIPINEB.2016.13}, doi = {10.1109/HIPINEB.2016.13}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Optimizing communication for a 2D-partitioned scalable BFSIEEE High Performance Extreme Computing Conference, HPEC 2016, Waltham, MA, USA, September 13-15, 2016, 1–7, IEEE, 2016
@inproceedings{DBLP:conf/hpec/YoungRHF16, author = {Young, Jeffrey S. and Romera, Julian and Hauck, Matthias and Fr{\"{o}}ning, Holger}, title = {Optimizing communication for a 2D-partitioned scalable {BFS}}, booktitle = {{IEEE} High Performance Extreme Computing Conference, {HPEC} 2016, Waltham, MA, USA, September 13-15, 2016}, pages = {1--7}, publisher = {{IEEE}}, year = {2016}, url = {https://doi.org/10.1109/HPEC.2016.7761596}, doi = {10.1109/HPEC.2016.7761596}, timestamp = {Sun, 12 Nov 2023 00:00:00 +0100}, }
- Exploring Time and Energy for Complex Accesses to a Hybrid Memory CubeSecond International Symposium on Memory Systems, MEMSYS 2016, Alexandria, VA, USA, October 3-6, 2016, 142–150, ACM, 2016
@inproceedings{DBLP:conf/memsys/SchmidtFB16, author = {Schmidt, Juri and Fr{\"{o}}ning, Holger and Br{\"{u}}ning, Ulrich}, editor = {Jacob, Bruce L.}, title = {Exploring Time and Energy for Complex Accesses to a Hybrid Memory Cube}, booktitle = {Second International Symposium on Memory Systems, {MEMSYS} 2016, Alexandria, VA, USA, October 3-6, 2016}, pages = {142--150}, publisher = {{ACM}}, year = {2016}, url = {https://doi.org/10.1145/2989081.2989099}, doi = {10.1145/2989081.2989099}, timestamp = {Fri, 13 Nov 2020 09:24:44 +0100}, }
- SONAR: Automated Communication Characterization for HPC ApplicationsHigh Performance Computing - ISC High Performance 2016 International Workshops, ExaComm, E-MuCoCoS, HPC-IODC, IXPUG, IWOPH, P\^3MA, VHPC, WOPSSS, Frankfurt, Germany, June 19-23, 2016, Revised Selected Papers (Lecture Notes in Computer Science), 9945, 98–114, 2016
@inproceedings{DBLP:conf/supercomputer/LammelZF16, author = {Lammel, Steffen and Zahn, Felix and Fr{\"{o}}ning, Holger}, editor = {Taufer, Michela and Mohr, Bernd and Kunkel, Julian M.}, title = {{SONAR:} Automated Communication Characterization for {HPC} Applications}, booktitle = {High Performance Computing - {ISC} High Performance 2016 International Workshops, ExaComm, E-MuCoCoS, HPC-IODC, IXPUG, IWOPH, P{\^{}}3MA, VHPC, WOPSSS, Frankfurt, Germany, June 19-23, 2016, Revised Selected Papers}, series = {Lecture Notes in Computer Science}, volume = {9945}, pages = {98--114}, year = {2016}, url = {https://doi.org/10.1007/978-3-319-46079-6\_8}, doi = {10.1007/978-3-319-46079-6\_8}, timestamp = {Wed, 25 Sep 2019 18:17:53 +0200}, }
2015
- On the design of a new dynamic credit-based end-to-end flow control mechanism for HPC clustersParallel Comput., 46, 32–59, 2015
@article{DBLP:journals/pc/PradesSFND15, author = {Prades, Javier and Silla, Federico and Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian and Duato, Jos{\'{e}}}, title = {On the design of a new dynamic credit-based end-to-end flow control mechanism for {HPC} clusters}, journal = {Parallel Comput.}, volume = {46}, pages = {32--59}, year = {2015}, url = {https://doi.org/10.1016/j.parco.2015.03.006}, doi = {10.1016/J.PARCO.2015.03.006}, timestamp = {Sun, 02 Oct 2022 01:00:00 +0200}, }
- Modeling a Large Data-Acquisition Network in a Simulation Framework2015 IEEE International Conference on Cluster Computing, CLUSTER 2015, Chicago, IL, USA, September 8-11, 2015, 809–816, IEEE Computer Society, 2015
@inproceedings{DBLP:conf/cluster/ColomboFGV15, author = {Colombo, Tommaso and Fr{\"{o}}ning, Holger and Garc{\'{\i}}a, Pedro Javier and Vandelli, Wainer}, title = {Modeling a Large Data-Acquisition Network in a Simulation Framework}, booktitle = {2015 {IEEE} International Conference on Cluster Computing, {CLUSTER} 2015, Chicago, IL, USA, September 8-11, 2015}, pages = {809--816}, publisher = {{IEEE} Computer Society}, year = {2015}, url = {https://doi.org/10.1109/CLUSTER.2015.137}, doi = {10.1109/CLUSTER.2015.137}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
- Highspeed Graph Processing Exploiting Main-Memory Column StoresEuro-Par 2015: Parallel Processing Workshops - Euro-Par 2015 International Workshops, Vienna, Austria, August 24-25, 2015, Revised Selected Papers (Lecture Notes in Computer Science), 9523, 503–514, Springer, 2015
@inproceedings{DBLP:conf/europar/HauckPFLR15, author = {Hauck, Matthias and Paradies, Marcus and Fr{\"{o}}ning, Holger and Lehner, Wolfgang and Rauhe, Hannes}, editor = {Hunold, Sascha and Costan, Alexandru and Gim{\'{e}}nez, Domingo and Iosup, Alexandru and Ricci, Laura and Requena, Mar{\'{\i}}a Engracia G{\'{o}}mez and Scarano, Vittorio and Varbanescu, Ana Lucia and Scott, Stephen L. and Lankes, Stefan and Weidendorfer, Josef and Alexander, Michael}, title = {Highspeed Graph Processing Exploiting Main-Memory Column Stores}, booktitle = {Euro-Par 2015: Parallel Processing Workshops - Euro-Par 2015 International Workshops, Vienna, Austria, August 24-25, 2015, Revised Selected Papers}, series = {Lecture Notes in Computer Science}, volume = {9523}, pages = {503--514}, publisher = {Springer}, year = {2015}, url = {https://doi.org/10.1007/978-3-319-27308-2\_41}, doi = {10.1007/978-3-319-27308-2\_41}, timestamp = {Tue, 14 May 2019 10:00:46 +0200}, }
- Analyzing communication models for distributed thread-collaborative processors in terms of energy and time2015 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2015, Philadelphia, PA, USA, March 29-31, 2015, 318–327, IEEE Computer Society, 2015
@inproceedings{DBLP:conf/ispass/KlenkOF15, author = {Klenk, Benjamin and Oden, Lena and Fr{\"{o}}ning, Holger}, title = {Analyzing communication models for distributed thread-collaborative processors in terms of energy and time}, booktitle = {2015 {IEEE} International Symposium on Performance Analysis of Systems and Software, {ISPASS} 2015, Philadelphia, PA, USA, March 29-31, 2015}, pages = {318--327}, publisher = {{IEEE} Computer Society}, year = {2015}, url = {https://doi.org/10.1109/ISPASS.2015.7095817}, doi = {10.1109/ISPASS.2015.7095817}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
2014
- Special issue on unconventional cluster architectures and applicationsClust. Comput., 17(2), 291, 2014
@article{DBLP:journals/cluster/SillaF14, author = {Silla, Federico and Fr{\"{o}}ning, Holger}, title = {Special issue on unconventional cluster architectures and applications}, journal = {Clust. Comput.}, volume = {17}, number = {2}, pages = {291}, year = {2014}, url = {https://doi.org/10.1007/s10586-013-0291-6}, doi = {10.1007/S10586-013-0291-6}, timestamp = {Tue, 29 Sep 2020 01:00:00 +0200}, }
- Energy-Efficient Collective Reduce and Allreduce Operations on Distributed GPUs14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2014, Chicago, IL, USA, May 26-29, 2014, 483–492, IEEE Computer Society, 2014
@inproceedings{DBLP:conf/ccgrid/OdenKF14, author = {Oden, Lena and Klenk, Benjamin and Fr{\"{o}}ning, Holger}, title = {Energy-Efficient Collective Reduce and Allreduce Operations on Distributed GPUs}, booktitle = {14th {IEEE/ACM} International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2014, Chicago, IL, USA, May 26-29, 2014}, pages = {483--492}, publisher = {{IEEE} Computer Society}, year = {2014}, url = {https://doi.org/10.1109/CCGrid.2014.21}, doi = {10.1109/CCGRID.2014.21}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Analyzing Put/Get APIs for Thread-Collaborative Processors43rd International Conference on Parallel Processing Workshops, ICPPW 2014, Minneapolis, MN, USA, September 9-12, 2014, 411–418, IEEE Computer Society, 2014
@inproceedings{DBLP:conf/icppw/KlenkOF14, author = {Klenk, Benjamin and Oden, Lena and Fr{\"{o}}ning, Holger}, title = {Analyzing Put/Get APIs for Thread-Collaborative Processors}, booktitle = {43rd International Conference on Parallel Processing Workshops, {ICPPW} 2014, Minneapolis, MN, USA, September 9-12, 2014}, pages = {411--418}, publisher = {{IEEE} Computer Society}, year = {2014}, url = {https://doi.org/10.1109/ICPPW.2014.61}, doi = {10.1109/ICPPW.2014.61}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Infiniband-Verbs on GPU: A Case Study of Controlling an Infiniband Network Device from the GPU2014 IEEE International Parallel & Distributed Processing Symposium Workshops, Phoenix, AZ, USA, May 19-23, 2014, 976–983, IEEE Computer Society, 2014
@inproceedings{DBLP:conf/ipps/OdenFP14, author = {Oden, Lena and Fr{\"{o}}ning, Holger and Pfreundt, Franz{-}Josef}, title = {Infiniband-Verbs on {GPU:} {A} Case Study of Controlling an Infiniband Network Device from the {GPU}}, booktitle = {2014 {IEEE} International Parallel {\&} Distributed Processing Symposium Workshops, Phoenix, AZ, USA, May 19-23, 2014}, pages = {976--983}, publisher = {{IEEE} Computer Society}, year = {2014}, url = {https://doi.org/10.1109/IPDPSW.2014.111}, doi = {10.1109/IPDPSW.2014.111}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Energy-efficient stencil computations on distributed GPUs using dynamic parallelism and GPU-controlled communication2nd International Workshop on Energy Efficient Supercomputing, E2SC ’14, New Orleans, Louisiana, USA, November 16-21, 2014, 31–40, IEEE Computer Society, 2014
@inproceedings{DBLP:conf/sc/OdenKF14, author = {Oden, Lena and Klenk, Benjamin and Fr{\"{o}}ning, Holger}, editor = {Cameron, Kirk W. and Hoisie, Adolfy and Kerbyson, Darren J. and Lowenthal, David K. and Nikolopoulos, Dimitrios S. and Yalamanchili, Sudha and Marquez, Andres}, title = {Energy-efficient stencil computations on distributed GPUs using dynamic parallelism and GPU-controlled communication}, booktitle = {2nd International Workshop on Energy Efficient Supercomputing, {E2SC} '14, New Orleans, Louisiana, USA, November 16-21, 2014}, pages = {31--40}, publisher = {{IEEE} Computer Society}, year = {2014}, url = {https://doi.org/10.1109/E2SC.2014.14}, doi = {10.1109/E2SC.2014.14}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
2013
- On Achieving High Message Rates13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2013, Delft, Netherlands, May 13-16, 2013, 498–505, IEEE Computer Society, 2013
@inproceedings{DBLP:conf/ccgrid/FroningNLLB13, author = {Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian and Litz, Heiner and Leber, Christian and Br{\"{u}}ning, Ulrich}, title = {On Achieving High Message Rates}, booktitle = {13th {IEEE/ACM} International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2013, Delft, Netherlands, May 13-16, 2013}, pages = {498--505}, publisher = {{IEEE} Computer Society}, year = {2013}, url = {https://doi.org/10.1109/CCGrid.2013.43}, doi = {10.1109/CCGRID.2013.43}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- GGAS: Global GPU address spaces for efficient communication in heterogeneous clusters2013 IEEE International Conference on Cluster Computing, CLUSTER 2013, Indianapolis, IN, USA, September 23-27, 2013, 1–8, IEEE Computer Society, 2013
@inproceedings{DBLP:conf/cluster/OdenF13, author = {Oden, Lena and Fr{\"{o}}ning, Holger}, title = {{GGAS:} Global {GPU} address spaces for efficient communication in heterogeneous clusters}, booktitle = {2013 {IEEE} International Conference on Cluster Computing, {CLUSTER} 2013, Indianapolis, IN, USA, September 23-27, 2013}, pages = {1--8}, publisher = {{IEEE} Computer Society}, year = {2013}, url = {https://doi.org/10.1109/CLUSTER.2013.6702638}, doi = {10.1109/CLUSTER.2013.6702638}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
- Oncilla: A GAS runtime for efficient resource allocation and data movement in accelerated clusters2013 IEEE International Conference on Cluster Computing, CLUSTER 2013, Indianapolis, IN, USA, September 23-27, 2013, 1–8, IEEE Computer Society, 2013
@inproceedings{DBLP:conf/cluster/YoungSYMSF13, author = {Young, Jeffrey S. and Shon, Se Hoon and Yalamanchili, Sudhakar and Merritt, Alex and Schwan, Karsten and Fr{\"{o}}ning, Holger}, title = {Oncilla: {A} {GAS} runtime for efficient resource allocation and data movement in accelerated clusters}, booktitle = {2013 {IEEE} International Conference on Cluster Computing, {CLUSTER} 2013, Indianapolis, IN, USA, September 23-27, 2013}, pages = {1--8}, publisher = {{IEEE} Computer Society}, year = {2013}, url = {https://doi.org/10.1109/CLUSTER.2013.6702679}, doi = {10.1109/CLUSTER.2013.6702679}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
- Data Movement Options in Accelerated ClustersEuro-Par 2013: Parallel Processing Workshops - BigDataCloud, DIHC, FedICI, HeteroPar, HiBB, LSDVE, MHPC, OMHI, PADABS, PROPER, Resilience, ROME, and UCHPC 2013, Aachen, Germany, August 26-27, 2013. Revised Selected Papers (Lecture Notes in Computer Science), 8374, 418–422, Springer, 2013
@inproceedings{DBLP:conf/europar/Froning13, author = {Fr{\"{o}}ning, Holger}, editor = {an Mey, Dieter and Alexander, Michael and Bientinesi, Paolo and Cannataro, Mario and Clauss, Carsten and Costan, Alexandru and Kecskemeti, Gabor and Morin, Christine and Ricci, Laura and Sahuquillo, Julio and Schulz, Martin and Scarano, Vittorio and Scott, Stephen L. and Weidendorfer, Josef}, title = {Data Movement Options in Accelerated Clusters}, booktitle = {Euro-Par 2013: Parallel Processing Workshops - BigDataCloud, DIHC, FedICI, HeteroPar, HiBB, LSDVE, MHPC, OMHI, PADABS, PROPER, Resilience, ROME, and {UCHPC} 2013, Aachen, Germany, August 26-27, 2013. Revised Selected Papers}, series = {Lecture Notes in Computer Science}, volume = {8374}, pages = {418--422}, publisher = {Springer}, year = {2013}, url = {https://doi.org/10.1007/978-3-642-54420-0\_41}, doi = {10.1007/978-3-642-54420-0\_41}, timestamp = {Wed, 19 Feb 2020 14:52:57 +0100}, }
2012
- A new degree of freedom for memory allocation in clustersClust. Comput., 15(2), 101–123, 2012
@article{DBLP:journals/cluster/MontanerSFD12, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, title = {A new degree of freedom for memory allocation in clusters}, journal = {Clust. Comput.}, volume = {15}, number = {2}, pages = {101--123}, year = {2012}, url = {https://doi.org/10.1007/s10586-010-0150-7}, doi = {10.1007/S10586-010-0150-7}, timestamp = {Sun, 02 Oct 2022 01:00:00 +0200}, }
- A New End-to-End Flow-Control Mechanism for High Performance Computing Clusters2012 IEEE International Conference on Cluster Computing, CLUSTER 2012, Beijing, China, September 24-28, 2012, 320–328, IEEE Computer Society, 2012
@inproceedings{DBLP:conf/cluster/PradesSDFN12, author = {Prades, Javier and Silla, Federico and Duato, Jos{\'{e}} and Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian}, title = {A New End-to-End Flow-Control Mechanism for High Performance Computing Clusters}, booktitle = {2012 {IEEE} International Conference on Cluster Computing, {CLUSTER} 2012, Beijing, China, September 24-28, 2012}, pages = {320--328}, publisher = {{IEEE} Computer Society}, year = {2012}, url = {https://doi.org/10.1109/CLUSTER.2012.15}, doi = {10.1109/CLUSTER.2012.15}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
2011
- MEMSCALE: in-cluster-memory databases20th ACM Conference on Information and Knowledge Management, CIKM 2011, Glasgow, United Kingdom, October 24-28, 2011, 2569–2572, ACM, 2011
@inproceedings{DBLP:conf/cikm/MontanerSFD11, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, editor = {Macdonald, Craig and Ounis, Iadh and Ruthven, Ian}, title = {{MEMSCALE:} in-cluster-memory databases}, booktitle = {20th {ACM} Conference on Information and Knowledge Management, {CIKM} 2011, Glasgow, United Kingdom, October 24-28, 2011}, pages = {2569--2572}, publisher = {{ACM}}, year = {2011}, url = {https://doi.org/10.1145/2063576.2064022}, doi = {10.1145/2063576.2064022}, timestamp = {Sun, 02 Oct 2022 01:00:00 +0200}, }
- Highly scalable barriers for future high-performance computing clusters18th International Conference on High Performance Computing, HiPC 2011, Bengaluru, India, December 18-21, 2011, 1–10, IEEE Computer Society, 2011
@inproceedings{DBLP:conf/hipc/FroningGMSD11, author = {Fr{\"{o}}ning, Holger and Giese, Alexander and Montaner, H{\'{e}}ctor and Silla, Federico and Duato, Jos{\'{e}}}, title = {Highly scalable barriers for future high-performance computing clusters}, booktitle = {18th International Conference on High Performance Computing, HiPC 2011, Bengaluru, India, December 18-21, 2011}, pages = {1--10}, publisher = {{IEEE} Computer Society}, year = {2011}, url = {https://doi.org/10.1109/HiPC.2011.6152729}, doi = {10.1109/HIPC.2011.6152729}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Unleash Your Memory-Constrained Applications: A 32-Node Non-coherent Distributed-Memory Prototype Cluster13th IEEE International Conference on High Performance Computing & Communication, HPCC 2011, Banff, Alberta, Canada, September 2-4, 2011, 9–16, IEEE, 2011
@inproceedings{DBLP:conf/hpcc/MontanerSFD11, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, editor = {Thulasiraman, Parimala and Yang, Laurence Tianruo and Pan, Qiwen and Liu, Xingang and Chen, Yaw{-}Chung and Huang, Yo{-}Ping and Chang, Lin{-}Huang and Hung, Che{-}Lun and Lee, Che{-}Rung and Shi, Justin Y. and Zhang, Ying}, title = {Unleash Your Memory-Constrained Applications: {A} 32-Node Non-coherent Distributed-Memory Prototype Cluster}, booktitle = {13th {IEEE} International Conference on High Performance Computing {\&} Communication, {HPCC} 2011, Banff, Alberta, Canada, September 2-4, 2011}, pages = {9--16}, publisher = {{IEEE}}, year = {2011}, url = {https://doi.org/10.1109/HPCC.2011.12}, doi = {10.1109/HPCC.2011.12}, timestamp = {Sun, 02 Oct 2022 01:00:00 +0200}, }
- MEMSCALE\(^\mboxTM\): A Scalable Environment for Databases13th IEEE International Conference on High Performance Computing & Communication, HPCC 2011, Banff, Alberta, Canada, September 2-4, 2011, 339–346, IEEE, 2011
@inproceedings{DBLP:conf/hpcc/MontanerSFD11a, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, editor = {Thulasiraman, Parimala and Yang, Laurence Tianruo and Pan, Qiwen and Liu, Xingang and Chen, Yaw{-}Chung and Huang, Yo{-}Ping and Chang, Lin{-}Huang and Hung, Che{-}Lun and Lee, Che{-}Rung and Shi, Justin Y. and Zhang, Ying}, title = {MEMSCALE\({}^{\mbox{TM}}\): {A} Scalable Environment for Databases}, booktitle = {13th {IEEE} International Conference on High Performance Computing {\&} Communication, {HPCC} 2011, Banff, Alberta, Canada, September 2-4, 2011}, pages = {339--346}, publisher = {{IEEE}}, year = {2011}, url = {https://doi.org/10.1109/HPCC.2011.51}, doi = {10.1109/HPCC.2011.51}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Network InterfacesEncyclopedia of Parallel Computing, 1292–1298, Springer, 2011
@incollection{DBLP:reference/parallel/Froning11, author = {Fr{\"{o}}ning, Holger}, editor = {Padua, David A.}, title = {Network Interfaces}, booktitle = {Encyclopedia of Parallel Computing}, pages = {1292--1298}, publisher = {Springer}, year = {2011}, url = {https://doi.org/10.1007/978-0-387-09766-4\_319}, doi = {10.1007/978-0-387-09766-4\_319}, timestamp = {Wed, 12 Jul 2017 01:00:00 +0200}, }
2010
- Getting Rid of Coherency Overhead for Memory-Hungry ApplicationsIEEE International Conference on Cluster Computing, Heraklion, Crete, Greece, 20-24 September, 2010, 48–57, IEEE Computer Society, 2010
@inproceedings{DBLP:conf/cluster/MontanerSFD10, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, title = {Getting Rid of Coherency Overhead for Memory-Hungry Applications}, booktitle = {{IEEE} International Conference on Cluster Computing, Heraklion, Crete, Greece, 20-24 September, 2010}, pages = {48--57}, publisher = {{IEEE} Computer Society}, year = {2010}, url = {https://doi.org/10.1109/CLUSTER.2010.14}, doi = {10.1109/CLUSTER.2010.14}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
- Efficient hardware support for the Partitioned Global Address Space24th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010, Atlanta, Georgia, USA, 19-23 April 2010 - Workshop Proceedings, 1–6, IEEE, 2010
@inproceedings{DBLP:conf/ipps/FroningL10, author = {Fr{\"{o}}ning, Holger and Litz, Heiner}, title = {Efficient hardware support for the Partitioned Global Address Space}, booktitle = {24th {IEEE} International Symposium on Parallel and Distributed Processing, {IPDPS} 2010, Atlanta, Georgia, USA, 19-23 April 2010 - Workshop Proceedings}, pages = {1--6}, publisher = {{IEEE}}, year = {2010}, url = {https://doi.org/10.1109/IPDPSW.2010.5470851}, doi = {10.1109/IPDPSW.2010.5470851}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
2009
- A HyperTransport 3 Physical Layer Interface for FPGAsReconfigurable Computing: Architectures, Tools and Applications, 5th International Workshop, ARC 2009, Karlsruhe, Germany, March 16-18, 2009. Proceedings (Lecture Notes in Computer Science), 5453, 4–14, Springer, 2009
@inproceedings{DBLP:conf/arc/LitzFB09, author = {Litz, Heiner and Fr{\"{o}}ning, Holger and Br{\"{u}}ning, Ulrich}, editor = {Becker, J{\"{u}}rgen and Woods, Roger F. and Athanas, Peter M. and Morgan, Fearghal}, title = {A HyperTransport 3 Physical Layer Interface for FPGAs}, booktitle = {Reconfigurable Computing: Architectures, Tools and Applications, 5th International Workshop, {ARC} 2009, Karlsruhe, Germany, March 16-18, 2009. Proceedings}, series = {Lecture Notes in Computer Science}, volume = {5453}, pages = {4--14}, publisher = {Springer}, year = {2009}, url = {https://doi.org/10.1007/978-3-642-00641-8\_4}, doi = {10.1007/978-3-642-00641-8\_4}, timestamp = {Fri, 19 Jul 2019 13:02:47 +0200}, }
- An FPGA based verification platform for HyperTransport 3.x19th International Conference on Field Programmable Logic and Applications, FPL 2009, August 31 - September 2, 2009, Prague, Czech Republic, 631–634, IEEE, 2009
@inproceedings{DBLP:conf/fpl/LitzFTB09, author = {Litz, Heiner and Fr{\"{o}}ning, Holger and Th{\"{u}}rmer, Maximilian and Br{\"{u}}ning, Ulrich}, editor = {Danek, Martin and Kadlec, Jiri and Nelson, Brent E.}, title = {An {FPGA} based verification platform for HyperTransport 3.x}, booktitle = {19th International Conference on Field Programmable Logic and Applications, {FPL} 2009, August 31 - September 2, 2009, Prague, Czech Republic}, pages = {631--634}, publisher = {{IEEE}}, year = {2009}, url = {https://doi.org/10.1109/FPL.2009.5272393}, doi = {10.1109/FPL.2009.5272393}, timestamp = {Wed, 16 Oct 2019 14:14:53 +0200}, }
- Efficient Virtualization of High-Performance Network InterfacesThe Eighth International Conference on Networks, ICN 2009, 1-6 March 2009, Gosier, Guadeloupe, France, 434–439, IEEE Computer Society, 2009
@inproceedings{DBLP:conf/icn/FroningLB09, author = {Fr{\"{o}}ning, Holger and Litz, Heiner and Br{\"{u}}ning, Ulrich}, editor = {Bestak, Robert and George, Laurent and Zaborovsky, Vladimir S. and Dini, Cosmin}, title = {Efficient Virtualization of High-Performance Network Interfaces}, booktitle = {The Eighth International Conference on Networks, {ICN} 2009, 1-6 March 2009, Gosier, Guadeloupe, France}, pages = {434--439}, publisher = {{IEEE} Computer Society}, year = {2009}, url = {https://doi.org/10.1109/ICN.2009.23}, doi = {10.1109/ICN.2009.23}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- An FPGA-Based Custom High Performance Interconnection NetworkReConFig’09: 2009 International Conference on Reconfigurable Computing and FPGAs, Cancun, Quintana Roo, Mexico, 9-11 December 2009, Proceedings, 113–118, IEEE Computer Society, 2009
@inproceedings{DBLP:conf/reconfig/NussleGFB09, author = {N{\"{u}}ssle, Mondrian and Geib, Benjamin and Fr{\"{o}}ning, Holger and Br{\"{u}}ning, Ulrich}, editor = {Prasanna, Viktor K. and Torres, Lionel and Cumplido, Ren{\'{e}}}, title = {An FPGA-Based Custom High Performance Interconnection Network}, booktitle = {ReConFig'09: 2009 International Conference on Reconfigurable Computing and FPGAs, Cancun, Quintana Roo, Mexico, 9-11 December 2009, Proceedings}, pages = {113--118}, publisher = {{IEEE} Computer Society}, year = {2009}, url = {https://doi.org/10.1109/ReConFig.2009.23}, doi = {10.1109/RECONFIG.2009.23}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
2008
- VELO: A Novel Communication Engine for Ultra-Low Latency Message Transfers2008 International Conference on Parallel Processing, ICPP 2008, September 8-12, 2008, Portland, Oregon, USA, 238–245, IEEE Computer Society, 2008
@inproceedings{DBLP:conf/icpp/LitzFNB08, author = {Litz, Heiner and Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian and Br{\"{u}}ning, Ulrich}, title = {{VELO:} {A} Novel Communication Engine for Ultra-Low Latency Message Transfers}, booktitle = {2008 International Conference on Parallel Processing, {ICPP} 2008, September 8-12, 2008, Portland, Oregon, {USA}}, pages = {238--245}, publisher = {{IEEE} Computer Society}, year = {2008}, url = {https://doi.org/10.1109/ICPP.2008.85}, doi = {10.1109/ICPP.2008.85}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
2007
- Architectural improvements of interconnection network interfaces, 2007
@phdthesis{DBLP:phd/de/Froning2007, author = {Fr{\"{o}}ning, Holger}, title = {Architectural improvements of interconnection network interfaces}, school = {University of Mannheim, Germany}, year = {2007}, url = {https://nbn-resolving.org/urn:nbn:de:bsz:180-madoc-14307}, urn = {urn:nbn:de:bsz:180-madoc-14307}, timestamp = {Sat, 17 Jul 2021 01:00:00 +0200}, }
2005
- Performance Evaluation of the ATOLL InterconnectIASTED International Conference on Parallel and Distributed Computing and Networks, part of the 23rd Multi-Conference on Applied Informatics, Innsbruck, Austria, February 15-17, 2005, 129–134, IASTED/ACTA Press, 2005| bib
@inproceedings{DBLP:conf/pdcn/FroningNSHB05, author = {Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian and Slogsnat, David and Haspel, Patrick R. and Br{\"{u}}ning, Ulrich}, editor = {Fahringer, Thomas and Hamza, M. H.}, title = {Performance Evaluation of the {ATOLL} Interconnect}, booktitle = {{IASTED} International Conference on Parallel and Distributed Computing and Networks, part of the 23rd Multi-Conference on Applied Informatics, Innsbruck, Austria, February 15-17, 2005}, pages = {129--134}, publisher = {{IASTED/ACTA} Press}, year = {2005}, timestamp = {Sat, 04 Aug 2018 01:00:00 +0200}, }
- Swordfish: A Simulator for High-Performance NetworksInternational Conference on Parallel and Distributed Computing Systems, PDCS 2005, November 14-16, 2005, Phoenix, AZ, USA, 530–535, IASTED/ACTA Press, 2005| bib
@inproceedings{DBLP:conf/pdcs/NussleFB05, author = {N{\"{u}}ssle, Mondrian and Fr{\"{o}}ning, Holger and Br{\"{u}}ning, Ulrich}, editor = {Zheng, S. Q.}, title = {Swordfish: {A} Simulator for High-Performance Networks}, booktitle = {International Conference on Parallel and Distributed Computing Systems, {PDCS} 2005, November 14-16, 2005, Phoenix, AZ, {USA}}, pages = {530--535}, publisher = {{IASTED/ACTA} Press}, year = {2005}, timestamp = {Wed, 09 Nov 2022 13:58:44 +0100}, }
2002
- ATOLL: Performance and Cost Optimization of a SAN InterconnectInternational Conference on Parallel and Distributed Computing Systems, PDCS 2002, November 4-6, 2002, Cambridge, USA, 496–501, IASTED/ACTA Press, 2002| bib
@inproceedings{DBLP:conf/pdcs/BruningFSR02, author = {Br{\"{u}}ning, Ulrich and Fr{\"{o}}ning, Holger and Schulz, Patrick R. and Rzymianowicz, Lars}, editor = {Akl, Selim G. and Gonzalez, Teofilo F.}, title = {{ATOLL:} Performance and Cost Optimization of a {SAN} Interconnect}, booktitle = {International Conference on Parallel and Distributed Computing Systems, {PDCS} 2002, November 4-6, 2002, Cambridge, {USA}}, pages = {496--501}, publisher = {{IASTED/ACTA} Press}, year = {2002}, timestamp = {Sat, 04 Aug 2018 01:00:00 +0200}, }
2024
- Less Memory Means smaller GPUs: Backpropagation with Compressed ActivationsCoRR, abs/2409.11902, 2024
@article{barley2024, author = {Barley, Daniel and Fr{{\"o}}ning, Holger}, title = {Less Memory Means smaller GPUs: Backpropagation with Compressed Activations}, year = {2024}, volume = {abs/2409.11902}, journal = {CoRR}, url = {https://arxiv.org/abs/2409.11902}, }
- Resource-Efficient Neural Networks for Embedded SystemsJournal of Machine Learning Research, 25(50), 1–51, 2024
@article{JMLR:v25:18-566, author = {Roth, Wolfgang and Schindler, G{{\"u}}nther and Klein, Bernhard and Peharz, Robert and Tschiatschek, Sebastian and Fr{\"{o}}ning, Holger and Pernkopf, Franz and Ghahramani, Zoubin}, title = {Resource-Efficient Neural Networks for Embedded Systems}, journal = {Journal of Machine Learning Research}, year = {2024}, volume = {25}, number = {50}, pages = {1--51}, url = {http://jmlr.org/papers/v25/18-566.html}, }
- Walking Noise: On Layer-Specific Robustness of Neural Architectures against Noisy Computations and Associated Characteristic Learning DynamicsEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), 2024
@inproceedings{borras2024, title = {Walking Noise: On Layer-Specific Robustness of Neural Architectures against Noisy Computations and Associated Characteristic Learning Dynamics}, author = {Borras, Hendrik and Klein, Bernhard and Fr{\"{o}}ning, Holger}, booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases}, year = {2024}, series = {ECML-PKDD}, url = {https://doi.org/10.1007/978-3-031-70359-1_3}, }
- Function Space Diversity for Uncertainty Prediction via Repulsive Last-Layer EnsemblesICML 2024 Workshop on Structured Probabilistic Inference & Generative Modeling, 2024
@inproceedings{steger2024function, title = {Function Space Diversity for Uncertainty Prediction via Repulsive Last-Layer Ensembles}, author = {Steger, Sophie and Knoll, Christian and Klein, Bernhard and Fr{\"o}ning, Holger and Pernkopf, Franz}, booktitle = {ICML 2024 Workshop on Structured Probabilistic Inference {\&} Generative Modeling}, year = {2024}, url = {https://openreview.net/forum?id=FbMN9HjgHI}, }
- Probabilistic Photonic Computing with Chaotic LightCoRR, abs/2401.17915, 2024
@article{brckerhoffplckelmann2024probabilistic, title = {Probabilistic Photonic Computing with Chaotic Light}, author = {Brückerhoff-Plückelmann, Frank and Borras, Hendrik and Klein, Bernhard and Varri, Akhil and Becker, Marlon and Dijkstra, Jelle and Brückerhoff, Martin and Wright, C. David and Salinga, Martin and Bhaskaran, Harish and Risse, Benjamin and Fr{\"o}ning, Holger and Pernice, Wolfram}, year = {2024}, volume = {abs/2401.17915}, journal = {CoRR}, url = {https://arxiv.org/abs/2401.17915}, }
- DeepHYDRA: A Hybrid Deep Learning and DBSCAN-Based Approach to Time-Series Anomaly Detection in Dynamically-Configured Systems38th ACM International Conference on Supercomputing (ICS), 272–285, Association for Computing Machinery, 2024
@inproceedings{10.1145/3650200.3656637, author = {Stehle, Franz Kevin and Vandelli, Wainer and Zahn, Felix and Avolio, Giuseppe and Fr\"{o}ning, Holger}, title = {DeepHYDRA: A Hybrid Deep Learning and DBSCAN-Based Approach to Time-Series Anomaly Detection in Dynamically-Configured Systems}, year = {2024}, isbn = {9798400706103}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, doi = {10.1145/3650200.3656637}, booktitle = {38th ACM International Conference on Supercomputing}, pages = {272–285}, numpages = {14}, location = {Kyoto, Japan}, series = {ICS}, }
- GraphScale: Scalable Processing on FPGAs for HBM and Large GraphsACM Trans. Reconfigurable Technol. Syst., 17(2), Association for Computing Machinery, 2024
@article{10.1145/3616497, author = {Dann, Jonas and Ritter, Daniel and Fr\"{o}ning, Holger}, title = {GraphScale: Scalable Processing on FPGAs for HBM and Large Graphs}, year = {2024}, issue_date = {June 2024}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, volume = {17}, number = {2}, issn = {1936-7406}, url = {https://doi.org/10.1145/3616497}, doi = {10.1145/3616497}, journal = {ACM Trans. Reconfigurable Technol. Syst.}, month = mar, articleno = {22}, numpages = {23}, keywords = {FPGA, Graph processing, HBM}, }
- Random telegraph noise characteristic of nonvolatile resistive random access memories based on optical interference principleJapanese Journal of Applied Physics, 63(3), 031003, IOP Publishing, 2024
@article{Qin_2024, doi = {10.35848/1347-4065/ad26d1}, url = {https://dx.doi.org/10.35848/1347-4065/ad26d1}, year = {2024}, month = mar, publisher = {IOP Publishing}, volume = {63}, number = {3}, pages = {031003}, author = {Qin, Sichen and Zhang, Guiquan and Zhang, Jia-Wei and Zhao, Yu and Song, Chen and Emonds, Yannick and Fröning, Holger}, title = {Random telegraph noise characteristic of nonvolatile resistive random access memories based on optical interference principle}, journal = {Japanese Journal of Applied Physics}, }
- GraphMatch: Subgraph Query Processing on FPGAsCoRR, abs/2402.17559, 2024
@article{dann2024graphmatch, title = {GraphMatch: Subgraph Query Processing on FPGAs}, author = {Dann, Jonas and Götz, Tobias and Ritter, Daniel and Giceva, Jana and Fröning, Holger}, year = {2024}, volume = {abs/2402.17559}, journal = {CoRR}, url = {https://arxiv.org/abs/2402.17559}, doi = {10.48550/ARXIV.2402.17559}, }
- Implications of Noise in Resistive Memory on Deep Neural Networks for Image ClassificationCoRR, abs/2401.05820, 2024
@article{DBLP:journals/corr/abs-2401-05820, author = {Emonds, Yannick and Xi, Kai and Fröning, Holger}, title = {Implications of Noise in Resistive Memory on Deep Neural Networks for Image Classification}, journal = {CoRR}, volume = {abs/2401.05820}, year = {2024}, url = {https://arxiv.org/abs/2401.05820}, doi = {10.48550/ARXIV.2401.05820}, eprinttype = {arXiv}, eprint = {2401.05820}, timestamp = {Thu, 25 Jan 2024 00:00:00 +0100}, }
2023
- Characterization of data compression across CPU platforms and acceleratorsConcurr. Comput. Pract. Exp., 35(20), 2023
@article{DBLP:journals/concurrency/PrombergerSF23, author = {Promberger, Laura and Schwemmer, Rainer and Fr{\"{o}}ning, Holger}, title = {Characterization of data compression across {CPU} platforms and accelerators}, journal = {Concurr. Comput. Pract. Exp.}, volume = {35}, number = {20}, year = {2023}, url = {https://doi.org/10.1002/cpe.6465}, doi = {10.1002/CPE.6465}, timestamp = {Thu, 14 Sep 2023 01:00:00 +0200}, }
- Non-relational Databases on FPGAs: Survey, Design Decisions, ChallengesACM Comput. Surv., 55(11), 225:1–225:37, 2023
@article{DBLP:journals/csur/DannRF23, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, title = {Non-relational Databases on FPGAs: Survey, Design Decisions, Challenges}, journal = {{ACM} Comput. Surv.}, volume = {55}, number = {11}, pages = {225:1--225:37}, year = {2023}, url = {https://doi.org/10.1145/3568990}, doi = {10.1145/3568990}, timestamp = {Fri, 02 Jun 2023 01:00:00 +0200}, }
- CUDAsap: Statically-Determined Execution Statistics as Alternative to Execution-Based Profiling23rd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID), 119–130, IEEE, 2023
@inproceedings{DBLP:conf/ccgrid/EmondsBF23, author = {Emonds, Yannick and Braun, Lorenz and Fr{\"{o}}ning, Holger}, editor = {Simmhan, Yogesh and Altintas, Ilkay and Varbanescu, Ana Lucia and Balaji, Pavan and Prasad, Abhinandan S. and Carnevale, Lorenzo}, title = {CUDAsap: Statically-Determined Execution Statistics as Alternative to Execution-Based Profiling}, booktitle = {23rd {IEEE/ACM} International Symposium on Cluster, Cloud and Internet Computing}, address = {Bangalore, India}, series = {CCGRID}, pages = {119--130}, publisher = {{IEEE}}, year = {2023}, url = {https://doi.org/10.1109/CCGrid57682.2023.00021}, doi = {10.1109/CCGRID57682.2023.00021}, timestamp = {Fri, 21 Jul 2023 22:25:52 +0200}, }
- Implementation Techniques for SPMD Kernels on CPUsInternational Workshop on OpenCL, IWOCL 2023, Cambridge, United Kingdom, April 18-20, 2023, 1:1–1:12, ACM, 2023
@inproceedings{DBLP:conf/iwocl/0003AHFH23, author = {Meyer, Joachim and Alpay, Aksel and Hack, Sebastian and Fr{\"{o}}ning, Holger and Heuveline, Vincent}, title = {Implementation Techniques for {SPMD} Kernels on CPUs}, booktitle = {International Workshop on OpenCL, {IWOCL} 2023, Cambridge, United Kingdom, April 18-20, 2023}, pages = {1:1--1:12}, publisher = {{ACM}}, year = {2023}, url = {https://doi.org/10.1145/3585341.3585342}, doi = {10.1145/3585341.3585342}, timestamp = {Sat, 29 Apr 2023 01:00:00 +0200}, }
- Reducing Memory Requirements for the IPU using Butterfly FactorizationsSC ’23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, SC-W 2023, Denver, CO, USA, November 12-17, 2023, 1255–1263, ACM, 2023
@inproceedings{DBLP:conf/sc/ShekoftehAF23, author = {Shekofteh, S. Kazem and Alles, Christian and Fr{\"{o}}ning, Holger}, title = {Reducing Memory Requirements for the {IPU} using Butterfly Factorizations}, booktitle = {{SC} '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, {SC-W} 2023, Denver, CO, USA, November 12-17, 2023}, pages = {1255--1263}, publisher = {{ACM}}, year = {2023}, url = {https://doi.org/10.1145/3624062.3624196}, doi = {10.1145/3624062.3624196}, timestamp = {Tue, 28 Nov 2023 00:00:00 +0100}, }
- Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of ECML PKDD 2022, Grenoble, France, September 19-23, 2022, Proceedings, Part ICommunications in Computer and Information Science, 1752, Springer, 2023
@proceedings{DBLP:conf/pkdd/2022-w1, editor = {Koprinska, Irena and Mignone, Paolo and Guidotti, Riccardo and Jaroszewicz, Szymon and Fr{\"{o}}ning, Holger and Gullo, Francesco and Ferreira, Pedro M. and Roqueiro, Damian and Ceddia, Gaia and Nowaczyk, Slawomir and Gama, Jo{\~{a}}o and Ribeiro, Rita P. and Gavald{\`{a}}, Ricard and Masciari, Elio and Ras, Zbigniew W. and Ritacco, Ettore and Naretto, Francesca and Theissler, Andreas and Biecek, Przemyslaw and Verbeke, Wouter and Schiele, Gregor and Pernkopf, Franz and Blott, Michaela and Bordino, Ilaria and Danesi, Ivan Luciano and Ponti, Giovanni and Severini, Lorenzo and Appice, Annalisa and Andresini, Giuseppina and Medeiros, Ib{\'{e}}ria and Gra{\c{c}}a, Guilherme and Cooper, Lee A. D. and Ghazaleh, Naghmeh and Richiardi, Jonas and Miranda, Diego Saldana and Sechidis, Konstantinos and Canakoglu, Arif and Pid{\`{o}}, Sara and Pinoli, Pietro and Bifet, Albert and Pashami, Sepideh}, title = {Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of {ECML} {PKDD} 2022, Grenoble, France, September 19-23, 2022, Proceedings, Part {I}}, series = {Communications in Computer and Information Science}, volume = {1752}, publisher = {Springer}, year = {2023}, url = {https://doi.org/10.1007/978-3-031-23618-1}, doi = {10.1007/978-3-031-23618-1}, isbn = {978-3-031-23617-4}, timestamp = {Mon, 26 Jun 2023 01:00:00 +0200}, }
- Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of ECML PKDD 2022, Grenoble, France, September 19-23, 2022, Proceedings, Part IICommunications in Computer and Information Science, 1753, Springer, 2023
@proceedings{DBLP:conf/pkdd/2022-w2, editor = {Koprinska, Irena and Mignone, Paolo and Guidotti, Riccardo and Jaroszewicz, Szymon and Fr{\"{o}}ning, Holger and Gullo, Francesco and Ferreira, Pedro M. and Roqueiro, Damian and Ceddia, Gaia and Nowaczyk, Slawomir and Gama, Jo{\~{a}}o and Ribeiro, Rita P. and Gavald{\`{a}}, Ricard and Masciari, Elio and Ras, Zbigniew W. and Ritacco, Ettore and Naretto, Francesca and Theissler, Andreas and Biecek, Przemyslaw and Verbeke, Wouter and Schiele, Gregor and Pernkopf, Franz and Blott, Michaela and Bordino, Ilaria and Danesi, Ivan Luciano and Ponti, Giovanni and Severini, Lorenzo and Appice, Annalisa and Andresini, Giuseppina and Medeiros, Ib{\'{e}}ria and Gra{\c{c}}a, Guilherme and Cooper, Lee A. D. and Ghazaleh, Naghmeh and Richiardi, Jonas and Miranda, Diego Saldana and Sechidis, Konstantinos and Canakoglu, Arif and Pid{\`{o}}, Sara and Pinoli, Pietro and Bifet, Albert and Pashami, Sepideh}, title = {Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of {ECML} {PKDD} 2022, Grenoble, France, September 19-23, 2022, Proceedings, Part {II}}, series = {Communications in Computer and Information Science}, volume = {1753}, publisher = {Springer}, year = {2023}, url = {https://doi.org/10.1007/978-3-031-23633-4}, doi = {10.1007/978-3-031-23633-4}, isbn = {978-3-031-23632-7}, timestamp = {Fri, 17 Feb 2023 00:00:00 +0100}, }
- On the Non-Associativity of Analog ComputationsCoRR, abs/2309.14292, 2023
@article{DBLP:journals/corr/abs-2309-14292, author = {Kuhn, Lisa and Klein, Bernhard and Fr{\"{o}}ning, Holger}, title = {On the Non-Associativity of Analog Computations}, journal = {CoRR}, volume = {abs/2309.14292}, year = {2023}, url = {https://arxiv.org/abs/2309.14292}, doi = {10.48550/ARXIV.2309.14292}, eprinttype = {arXiv}, eprint = {2309.14292}, timestamp = {Wed, 27 Sep 2023 01:00:00 +0200}, }
- On Performance Analysis of Graphcore IPUs: Analyzing Squared and Skewed Matrix MultiplicationCoRR, abs/2310.00256, 2023
@article{DBLP:journals/corr/abs-2310-00256, author = {Shekofteh, S. Kazem and Alles, Christian and Kochend{\"{o}}rfer, Nils and Fr{\"{o}}ning, Holger}, title = {On Performance Analysis of Graphcore IPUs: Analyzing Squared and Skewed Matrix Multiplication}, journal = {CoRR}, volume = {abs/2310.00256}, year = {2023}, url = {https://arxiv.org/abs/2310.00256}, doi = {10.48550/ARXIV.2310.00256}, eprinttype = {arXiv}, eprint = {2310.00256}, timestamp = {Wed, 18 Oct 2023 01:00:00 +0200}, }
- Compressing the Backward Pass of Large-Scale Neural Architectures by Structured Activation PruningCoRR, abs/2311.16883, 2023
@article{DBLP:journals/corr/abs-2311-16883, author = {Barley, Daniel and Fr{\"{o}}ning, Holger}, title = {Compressing the Backward Pass of Large-Scale Neural Architectures by Structured Activation Pruning}, journal = {CoRR}, volume = {abs/2311.16883}, year = {2023}, url = {https://arxiv.org/abs/2311.16883}, doi = {10.48550/ARXIV.2311.16883}, eprinttype = {arXiv}, eprint = {2311.16883}, timestamp = {Mon, 04 Dec 2023 00:00:00 +0100}, }
2022
- QONNX: Representing Arbitrary-Precision Quantized Neural NetworksCoRR, abs/2206.07527, 2022
@article{DBLP:journals/corr/abs-2206-07527, author = {Pappalardo, Alessandro and Umuroglu, Yaman and Blott, Michaela and Mitrevski, Jovan and Hawks, Benjamin and Tran, Nhan and Loncar, Vladimir and Summers, Sioni and Borras, Hendrik and Muhizi, Jules and Trahms, Matthew and Hsu, Shih{-}Chieh and Hauck, Scott and Duarte, Javier M.}, title = {{QONNX:} Representing Arbitrary-Precision Quantized Neural Networks}, journal = {CoRR}, volume = {abs/2206.07527}, year = {2022}, url = {https://arxiv.org/abs/2206.07527}, doi = {10.48550/ARXIV.2206.07527}, eprinttype = {arXiv}, eprint = {2206.07527}, timestamp = {Tue, 21 Jun 2022 17:35:15 +0200}, }
- Open-source FPGA-ML codesign for the MLPerf Tiny BenchmarkCoRR, abs/2206.11791, 2022
@article{DBLP:journals/corr/abs-2206-11791, author = {Borras, Hendrik and Guglielmo, Giuseppe Di and Duarte, Javier M. and Ghielmetti, Nicol{\`{o}} and Hawks, Benjamin and Hauck, Scott and Hsu, Shih{-}Chieh and Kastner, Ryan and Liang, Jason and Meza, Andres and Muhizi, Jules and Nguyen, Tai and Roy, Rushil and Tran, Nhan and Umuroglu, Yaman and Weng, Olivia and Yokuda, Aidan and Blott, Michaela}, title = {Open-source {FPGA-ML} codesign for the MLPerf Tiny Benchmark}, journal = {CoRR}, volume = {abs/2206.11791}, year = {2022}, url = {https://arxiv.org/abs/2206.11791}, doi = {10.48550/ARXIV.2206.11791}, eprinttype = {arXiv}, eprint = {2206.11791}, timestamp = {Wed, 29 Jun 2022 11:10:54 +0200}, }
- Joint Program and Layout Transformations to Enable Convolutional Operators on Specialized Hardware Based on Constraint ProgrammingACM Trans. Archit. Code Optim., 19(1), 7:1–7:26, 2022
@article{DBLP:journals/taco/RieberAF22, author = {Rieber, Dennis and Acosta, Axel and Fr{\"{o}}ning, Holger}, title = {Joint Program and Layout Transformations to Enable Convolutional Operators on Specialized Hardware Based on Constraint Programming}, journal = {{ACM} Trans. Archit. Code Optim.}, volume = {19}, number = {1}, pages = {7:1--7:26}, year = {2022}, url = {https://doi.org/10.1145/3487922}, doi = {10.1145/3487922}, timestamp = {Mon, 28 Aug 2023 01:00:00 +0200}, }
- PipeJSON: Parsing JSON at Line Speed on FPGAsInternational Conference on Management of Data, DaMoN 2022, Philadelphia, PA, USA, 13 June 2022, 3:1–3:7, ACM, 2022
@inproceedings{DBLP:conf/damon/DannW0FF22, author = {Dann, Jonas and Wagner, Royden and Ritter, Daniel and Faerber, Christian and Fr{\"{o}}ning, Holger}, editor = {Blanas, Spyros and May, Norman}, title = {PipeJSON: Parsing {JSON} at Line Speed on FPGAs}, booktitle = {International Conference on Management of Data, DaMoN 2022, Philadelphia, PA, USA, 13 June 2022}, pages = {3:1--3:7}, publisher = {{ACM}}, year = {2022}, url = {https://doi.org/10.1145/3533737.3535094}, doi = {10.1145/3533737.3535094}, timestamp = {Wed, 15 Jun 2022 13:47:16 +0200}, }
- GraphScale: Scalable Bandwidth-Efficient Graph Processing on FPGAs32nd International Conference on Field-Programmable Logic and Applications, FPL 2022, Belfast, United Kingdom, August 29 - Sept. 2, 2022, 24–32, IEEE, 2022
@inproceedings{DBLP:conf/fpl/Dann0F22, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, title = {GraphScale: Scalable Bandwidth-Efficient Graph Processing on FPGAs}, booktitle = {32nd International Conference on Field-Programmable Logic and Applications, {FPL} 2022, Belfast, United Kingdom, August 29 - Sept. 2, 2022}, pages = {24--32}, publisher = {{IEEE}}, year = {2022}, url = {https://doi.org/10.1109/FPL57034.2022.00016}, doi = {10.1109/FPL57034.2022.00016}, timestamp = {Mon, 20 Feb 2023 17:38:16 +0100}, }
- Compiler-aided nd-range parallel-for implementations on CPU in hipSYCLIWOCL’22: International Workshop on OpenCL, Bristol, United Kingdom, May 10 - 12, 2022, 28:1–28:3, ACM, 2022
@inproceedings{DBLP:conf/iwocl/0003AFH22, author = {Meyer, Joachim and Alpay, Aksel and Fr{\"{o}}ning, Holger and Heuveline, Vincent}, title = {Compiler-aided nd-range parallel-for implementations on {CPU} in hipSYCL}, booktitle = {IWOCL'22: International Workshop on OpenCL, Bristol, United Kingdom, May 10 - 12, 2022}, pages = {28:1--28:3}, publisher = {{ACM}}, year = {2022}, url = {https://doi.org/10.1145/3529538.3530216}, doi = {10.1145/3529538.3530216}, timestamp = {Mon, 26 Jun 2023 01:00:00 +0200}, }
- HW-Aware Initialization of DNN Auto-Tuning to Improve Exploration Time and RobustnessCoRR, abs/2205.15568, 2022
@article{DBLP:journals/corr/abs-2205-15568, author = {Rieber, Dennis and Reiber, Moritz and Bringmann, Oliver and Fr{\"{o}}ning, Holger}, title = {HW-Aware Initialization of {DNN} Auto-Tuning to Improve Exploration Time and Robustness}, journal = {CoRR}, volume = {abs/2205.15568}, year = {2022}, url = {https://arxiv.org/abs/2205.15568}, doi = {10.48550/ARXIV.2205.15568}, eprinttype = {arXiv}, eprint = {2205.15568}, timestamp = {Wed, 01 Jun 2022 01:00:00 +0200}, }
- Towards Hardware-Specific Automatic Compression of Neural NetworksCoRR, abs/2212.07818, 2022
@article{DBLP:journals/corr/abs-2212-07818, author = {Krieger, Torben and Klein, Bernhard and Fr{\"{o}}ning, Holger}, title = {Towards Hardware-Specific Automatic Compression of Neural Networks}, journal = {CoRR}, volume = {abs/2212.07818}, year = {2022}, url = {https://arxiv.org/abs/2212.07818}, doi = {10.48550/ARXIV.2212.07818}, eprinttype = {arXiv}, eprint = {2212.07818}, timestamp = {Mon, 02 Jan 2023 00:00:00 +0100}, }
2021
- A Simple Model for Portable and Fast Prediction of Execution Time and Power Consumption of GPU KernelsACM Trans. Archit. Code Optim., 18(1), 7:1–7:25, 2021
@article{DBLP:journals/taco/BraunNSHF21, author = {Braun, Lorenz and Nikas, Sotirios and Song, Chen and Heuveline, Vincent and Fr{\"{o}}ning, Holger}, title = {A Simple Model for Portable and Fast Prediction of Execution Time and Power Consumption of {GPU} Kernels}, journal = {{ACM} Trans. Archit. Code Optim.}, volume = {18}, number = {1}, pages = {7:1--7:25}, year = {2021}, url = {https://doi.org/10.1145/3431731}, doi = {10.1145/3431731}, timestamp = {Sat, 30 Sep 2023 01:00:00 +0200}, }
- Exploring Memory Access Patterns for Graph Processing AcceleratorsDatenbanksysteme für Business, Technologie und Web (BTW 2021), 19. Fachtagung des GI-Fachbereichs ,,Datenbanken und Informationssysteme" (DBIS), 13.-17. September 2021, Dresden, Germany, Proceedings (LNI), P-311, 101–122, Gesellschaft für Informatik, Bonn, 2021
@inproceedings{DBLP:conf/btw/Dann0F21, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, editor = {Sattler, Kai{-}Uwe and Herschel, Melanie and Lehner, Wolfgang}, title = {Exploring Memory Access Patterns for Graph Processing Accelerators}, booktitle = {Datenbanksysteme f{\"{u}}r Business, Technologie und Web {(BTW} 2021), 19. Fachtagung des GI-Fachbereichs ,,Datenbanken und Informationssysteme" (DBIS), 13.-17. September 2021, Dresden, Germany, Proceedings}, series = {{LNI}}, volume = {{P-311}}, pages = {101--122}, publisher = {Gesellschaft f{\"{u}}r Informatik, Bonn}, year = {2021}, url = {https://doi.org/10.18420/btw2021-05}, doi = {10.18420/BTW2021-05}, timestamp = {Tue, 04 Jul 2023 17:43:09 +0200}, }
- Towards Addressing Noise and Static Variations of Analog Computations Using Efficient RetrainingMachine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of ECML PKDD 2021, Proceedings Part I (Communications in Computer and Information Science), 1524, 409–420, Springer, 2021
@inproceedings{DBLP:conf/pkdd/KleinKWESSF21, author = {Klein, Bernhard and Kuhn, Lisa and Weis, Johannes and Emmel, Arne and Stradmann, Yannik and Schemmel, Johannes and Fr{\"{o}}ning, Holger}, editor = {Kamp, Michael and Koprinska, Irena and Bibal, Adrien and Bouadi, Tassadit and Fr{\'{e}}nay, Beno{\^{\i}}t and Gal{\'{a}}rraga, Luis and Oramas, Jos{\'{e}} and Adilova, Linara and Krishnamurthy, Yamuna and Kang, Bo and Largeron, Christine and Lijffijt, Jefrey and Viard, Tiphaine and Welke, Pascal and Ruocco, Massimiliano and Aune, Erlend and Gallicchio, Claudio and Schiele, Gregor and Pernkopf, Franz and Blott, Michaela and Fr{\"{o}}ning, Holger and Schindler, G{\"{u}}nther and Guidotti, Riccardo and Monreale, Anna and Rinzivillo, Salvatore and Biecek, Przemyslaw and Ntoutsi, Eirini and Pechenizkiy, Mykola and Rosenhahn, Bodo and Buckley, Christopher L. and Cialfi, Daniela and Lanillos, Pablo and Ramstead, Maxwell and Verbelen, Tim and Ferreira, Pedro M. and Andresini, Giuseppina and Malerba, Donato and Medeiros, Ib{\'{e}}ria and Fournier{-}Viger, Philippe and Nawaz, M. Saqib and Ventura, Sebasti{\'{a}}n and Sun, Meng and Zhou, Min and Bitetta, Valerio and Bordino, Ilaria and Ferretti, Andrea and Gullo, Francesco and Ponti, Giovanni and Severini, Lorenzo and Ribeiro, Rita P. and Gama, Jo{\~{a}}o and Gavald{\`{a}}, Ricard and Cooper, Lee A. D. and Ghazaleh, Naghmeh and Richiardi, Jonas and Roqueiro, Damian and Miranda, Diego Saldana and Sechidis, Konstantinos and Gra{\c{c}}a, Guilherme}, title = {Towards Addressing Noise and Static Variations of Analog Computations Using Efficient Retraining}, booktitle = {Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of {ECML} {PKDD} 2021, Proceedings Part {I}}, series = {Communications in Computer and Information Science}, volume = {1524}, pages = {409--420}, publisher = {Springer}, year = {2021}, url = {https://doi.org/10.1007/978-3-030-93736-2_32}, doi = {10.1007/978-3-030-93736-2\_32}, }
- Demystifying memory access patterns of FPGA-based graph processing acceleratorsGRADES-NDA ’21: 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA), Virtual Event, China, 20 June 2021, 3:1–3:10, ACM, 2021
@inproceedings{DBLP:conf/sigmod/Dann0F21, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, editor = {Kalavri, Vasiliki and Yakovets, Nikolay}, title = {Demystifying memory access patterns of FPGA-based graph processing accelerators}, booktitle = {{GRADES-NDA} '21: 4th {ACM} {SIGMOD} Joint International Workshop on Graph Data Management Experiences {\&} Systems {(GRADES)} and Network Data Analytics (NDA), Virtual Event, China, 20 June 2021}, pages = {3:1--3:10}, publisher = {{ACM}}, year = {2021}, url = {https://doi.org/10.1145/3461837.3464512}, doi = {10.1145/3461837.3464512}, timestamp = {Wed, 14 Jul 2021 16:01:02 +0200}, }
- Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of ECML PKDD 2021, Virtual Event, September 13-17, 2021, Proceedings, Part ICommunications in Computer and Information Science, 1524, Springer, 2021
@proceedings{DBLP:conf/pkdd/2021-w1, editor = {Kamp, Michael and Koprinska, Irena and Bibal, Adrien and Bouadi, Tassadit and Fr{\'{e}}nay, Beno{\^{\i}}t and Gal{\'{a}}rraga, Luis and Oramas, Jos{\'{e}} and Adilova, Linara and Krishnamurthy, Yamuna and Kang, Bo and Largeron, Christine and Lijffijt, Jefrey and Viard, Tiphaine and Welke, Pascal and Ruocco, Massimiliano and Aune, Erlend and Gallicchio, Claudio and Schiele, Gregor and Pernkopf, Franz and Blott, Michaela and Fr{\"{o}}ning, Holger and Schindler, G{\"{u}}nther and Guidotti, Riccardo and Monreale, Anna and Rinzivillo, Salvatore and Biecek, Przemyslaw and Ntoutsi, Eirini and Pechenizkiy, Mykola and Rosenhahn, Bodo and Buckley, Christopher L. and Cialfi, Daniela and Lanillos, Pablo and Ramstead, Maxwell and Verbelen, Tim and Ferreira, Pedro M. and Andresini, Giuseppina and Malerba, Donato and Medeiros, Ib{\'{e}}ria and Fournier{-}Viger, Philippe and Nawaz, M. Saqib and Ventura, Sebasti{\'{a}}n and Sun, Meng and Zhou, Min and Bitetta, Valerio and Bordino, Ilaria and Ferretti, Andrea and Gullo, Francesco and Ponti, Giovanni and Severini, Lorenzo and Ribeiro, Rita P. and Gama, Jo{\~{a}}o and Gavald{\`{a}}, Ricard and Cooper, Lee A. D. and Ghazaleh, Naghmeh and Richiardi, Jonas and Roqueiro, Damian and Miranda, Diego Saldana and Sechidis, Konstantinos and Gra{\c{c}}a, Guilherme}, title = {Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of {ECML} {PKDD} 2021, Virtual Event, September 13-17, 2021, Proceedings, Part {I}}, series = {Communications in Computer and Information Science}, volume = {1524}, publisher = {Springer}, year = {2021}, url = {https://doi.org/10.1007/978-3-030-93736-2}, doi = {10.1007/978-3-030-93736-2}, isbn = {978-3-030-93735-5}, timestamp = {Sat, 12 Mar 2022 00:00:00 +0100}, }
- Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of ECML PKDD 2021, Virtual Event, September 13-17, 2021, Proceedings, Part IICommunications in Computer and Information Science, 1525, Springer, 2021
@proceedings{DBLP:conf/pkdd/2021-w2, editor = {Kamp, Michael and Koprinska, Irena and Bibal, Adrien and Bouadi, Tassadit and Fr{\'{e}}nay, Beno{\^{\i}}t and Gal{\'{a}}rraga, Luis and Oramas, Jos{\'{e}} and Adilova, Linara and Krishnamurthy, Yamuna and Kang, Bo and Largeron, Christine and Lijffijt, Jefrey and Viard, Tiphaine and Welke, Pascal and Ruocco, Massimiliano and Aune, Erlend and Gallicchio, Claudio and Schiele, Gregor and Pernkopf, Franz and Blott, Michaela and Fr{\"{o}}ning, Holger and Schindler, G{\"{u}}nther and Guidotti, Riccardo and Monreale, Anna and Rinzivillo, Salvatore and Biecek, Przemyslaw and Ntoutsi, Eirini and Pechenizkiy, Mykola and Rosenhahn, Bodo and Buckley, Christopher L. and Cialfi, Daniela and Lanillos, Pablo and Ramstead, Maxwell and Verbelen, Tim and Ferreira, Pedro M. and Andresini, Giuseppina and Malerba, Donato and Medeiros, Ib{\'{e}}ria and Fournier{-}Viger, Philippe and Nawaz, M. Saqib and Ventura, Sebasti{\'{a}}n and Sun, Meng and Zhou, Min and Bitetta, Valerio and Bordino, Ilaria and Ferretti, Andrea and Gullo, Francesco and Ponti, Giovanni and Severini, Lorenzo and Ribeiro, Rita P. and Gama, Jo{\~{a}}o and Gavald{\`{a}}, Ricard and Cooper, Lee A. D. and Ghazaleh, Naghmeh and Richiardi, Jonas and Roqueiro, Damian and Miranda, Diego Saldana and Sechidis, Konstantinos and Gra{\c{c}}a, Guilherme}, title = {Machine Learning and Principles and Practice of Knowledge Discovery in Databases - International Workshops of {ECML} {PKDD} 2021, Virtual Event, September 13-17, 2021, Proceedings, Part {II}}, series = {Communications in Computer and Information Science}, volume = {1525}, publisher = {Springer}, year = {2021}, url = {https://doi.org/10.1007/978-3-030-93733-1}, doi = {10.1007/978-3-030-93733-1}, isbn = {978-3-030-93732-4}, timestamp = {Sat, 12 Mar 2022 00:00:00 +0100}, }
- Understanding Cache Boundness of ML Operators on ARM ProcessorsCoRR, abs/2102.00932, 2021
@article{DBLP:journals/corr/abs-2102-00932, author = {Klein, Bernhard and Gratl, Christoph and M{\"{u}}cke, Manfred and Fr{\"{o}}ning, Holger}, title = {Understanding Cache Boundness of {ML} Operators on {ARM} Processors}, journal = {CoRR}, volume = {abs/2102.00932}, year = {2021}, url = {https://arxiv.org/abs/2102.00932}, eprinttype = {arXiv}, eprint = {2102.00932}, timestamp = {Thu, 14 Oct 2021 01:00:00 +0200}, }
- The Programming of Deep Learning Accelerators as a Constraint Satisfaction ProblemCoRR, abs/2104.04731, 2021
@article{DBLP:journals/corr/abs-2104-04731, author = {Rieber, Dennis and Acosta, Axel and Fr{\"{o}}ning, Holger}, title = {The Programming of Deep Learning Accelerators as a Constraint Satisfaction Problem}, journal = {CoRR}, volume = {abs/2104.04731}, year = {2021}, url = {https://arxiv.org/abs/2104.04731}, eprinttype = {arXiv}, eprint = {2104.04731}, timestamp = {Tue, 22 Feb 2022 00:00:00 +0100}, }
- Scheduling of Graph Queries: Controlling Intra- and Inter-query Parallelism for a High System ThroughputCoRR, abs/2110.10797, 2021
@article{DBLP:journals/corr/abs-2110-10797, author = {Hauck, Matthias and Oukid, Ismail and Fr{\"{o}}ning, Holger}, title = {Scheduling of Graph Queries: Controlling Intra- and Inter-query Parallelism for a High System Throughput}, journal = {CoRR}, volume = {abs/2110.10797}, year = {2021}, url = {https://arxiv.org/abs/2110.10797}, eprinttype = {arXiv}, eprint = {2110.10797}, timestamp = {Thu, 28 Oct 2021 01:00:00 +0200}, }
2020
- cCUDA: Effective Co-Scheduling of Concurrent Kernels on GPUsIEEE Trans. Parallel Distributed Syst., 31(4), 766–778, 2020
@article{DBLP:journals/tpds/ShekoftehNNFY20, author = {Shekofteh, S. Kazem and Noori, Hamid and Naghibzadeh, Mahmoud and Fr{\"{o}}ning, Holger and Yazdi, Hadi Sadoghi}, title = {cCUDA: Effective Co-Scheduling of Concurrent Kernels on GPUs}, journal = {{IEEE} Trans. Parallel Distributed Syst.}, volume = {31}, number = {4}, pages = {766--778}, year = {2020}, url = {https://doi.org/10.1109/TPDS.2019.2944602}, doi = {10.1109/TPDS.2019.2944602}, timestamp = {Fri, 02 Oct 2020 01:00:00 +0200}, }
- Towards Real-Time Single-Channel Singing-Voice Separation with Pruned Multi-Scaled Densenets2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020, 806–810, IEEE, 2020
@inproceedings{DBLP:conf/icassp/HuberSSRPF20, author = {Huber, Markus and Schindler, G{\"{u}}nther and Sch{\"{o}}rkhuber, Christian and Roth, Wolfgang and Pernkopf, Franz and Fr{\"{o}}ning, Holger}, title = {Towards Real-Time Single-Channel Singing-Voice Separation with Pruned Multi-Scaled Densenets}, booktitle = {2020 {IEEE} International Conference on Acoustics, Speech and Signal Processing, {ICASSP} 2020, Barcelona, Spain, May 4-8, 2020}, pages = {806--810}, publisher = {{IEEE}}, year = {2020}, url = {https://doi.org/10.1109/ICASSP40776.2020.9053542}, doi = {10.1109/ICASSP40776.2020.9053542}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- On Network Locality in MPI-Based HPC ApplicationsICPP 2020: 49th International Conference on Parallel Processing, Edmonton, AB, Canada, August 17-20, 2020, 57:1–57:10, ACM, 2020
@inproceedings{DBLP:conf/icpp/ZahnF20, author = {Zahn, Felix and Fr{\"{o}}ning, Holger}, editor = {Amaral, Jos{\'{e}} Nelson and John, Lizy Kurian and Shen, Xipeng}, title = {On Network Locality in MPI-Based {HPC} Applications}, booktitle = {{ICPP} 2020: 49th International Conference on Parallel Processing, Edmonton, AB, Canada, August 17-20, 2020}, pages = {57:1--57:10}, publisher = {{ACM}}, year = {2020}, url = {https://doi.org/10.1145/3404397.3404436}, doi = {10.1145/3404397.3404436}, timestamp = {Wed, 12 Aug 2020 17:44:07 +0200}, }
- Automated Partitioning of Data-Parallel Kernels using Polyhedral CompilationICPP Workshops ’20: Workshops, Edmonton, AB, Canada, August 17-20, 2020, 13:1–13:10, ACM, 2020
@inproceedings{DBLP:conf/icppw/MatzDF20, author = {Matz, Alexander and Doerfert, Johannes and Fr{\"{o}}ning, Holger}, editor = {Silla, Federico and Abdelrahman, Tarek S.}, title = {Automated Partitioning of Data-Parallel Kernels using Polyhedral Compilation}, booktitle = {{ICPP} Workshops '20: Workshops, Edmonton, AB, Canada, August 17-20, 2020}, pages = {13:1--13:10}, publisher = {{ACM}}, year = {2020}, url = {https://doi.org/10.1145/3409390.3409403}, doi = {10.1145/3409390.3409403}, timestamp = {Mon, 03 Jan 2022 00:00:00 +0100}, }
- Assessing the Overhead of Offloading Compression TasksICPP Workshops ’20: Workshops, Edmonton, AB, Canada, August 17-20, 2020, 15:1–15:10, ACM, 2020
@inproceedings{DBLP:conf/icppw/PrombergerSF20, author = {Promberger, Laura and Schwemmer, Rainer and Fr{\"{o}}ning, Holger}, editor = {Silla, Federico and Abdelrahman, Tarek S.}, title = {Assessing the Overhead of Offloading Compression Tasks}, booktitle = {{ICPP} Workshops '20: Workshops, Edmonton, AB, Canada, August 17-20, 2020}, pages = {15:1--15:10}, publisher = {{ACM}}, year = {2020}, url = {https://doi.org/10.1145/3409390.3409405}, doi = {10.1145/3409390.3409405}, timestamp = {Wed, 15 Dec 2021 00:00:00 +0100}, }
- On Resource-Efficient Bayesian Network Classifiers and Deep Neural Networks25th International Conference on Pattern Recognition, ICPR 2020, Virtual Event / Milan, Italy, January 10-15, 2021, 10297–10304, IEEE, 2020
@inproceedings{DBLP:conf/icpr/RothPSF20, author = {Roth, Wolfgang and Pernkopf, Franz and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger}, title = {On Resource-Efficient Bayesian Network Classifiers and Deep Neural Networks}, booktitle = {25th International Conference on Pattern Recognition, {ICPR} 2020, Virtual Event / Milan, Italy, January 10-15, 2021}, pages = {10297--10304}, publisher = {{IEEE}}, year = {2020}, url = {https://doi.org/10.1109/ICPR48806.2021.9413156}, doi = {10.1109/ICPR48806.2021.9413156}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- Parameterized Structured Pruning for Deep Neural NetworksMachine Learning, Optimization, and Data Science - 6th International Conference, LOD 2020, Siena, Italy, July 19-23, 2020, Revised Selected Papers, Part II (Lecture Notes in Computer Science), 12566, 16–27, Springer, 2020
@inproceedings{DBLP:conf/mod/SchindlerRPF20, author = {Schindler, G{\"{u}}nther and Roth, Wolfgang and Pernkopf, Franz and Fr{\"{o}}ning, Holger}, editor = {Nicosia, Giuseppe and Ojha, Varun and Malfa, Emanuele La and Jansen, Giorgio and Sciacca, Vincenzo and Pardalos, Panos M. and Giuffrida, Giovanni and Umeton, Renato}, title = {Parameterized Structured Pruning for Deep Neural Networks}, booktitle = {Machine Learning, Optimization, and Data Science - 6th International Conference, {LOD} 2020, Siena, Italy, July 19-23, 2020, Revised Selected Papers, Part {II}}, series = {Lecture Notes in Computer Science}, volume = {12566}, pages = {16--27}, publisher = {Springer}, year = {2020}, url = {https://doi.org/10.1007/978-3-030-64580-9\_3}, doi = {10.1007/978-3-030-64580-9\_3}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- Search Space Complexity of Iteration Domain Based Instruction Embedding for Deep Learning AcceleratorsIoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, ITEM 2020, Co-located with ECML/PKDD 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected Papers (Communications in Computer and Information Science), 1325, 213–228, Springer, 2020
@inproceedings{DBLP:conf/pkdd/RieberF20, author = {Rieber, Dennis and Fr{\"{o}}ning, Holger}, editor = {Gama, Jo{\~{a}}o and Pashami, Sepideh and Bifet, Albert and {Sayed Mouchaweh}, Moamar and Fr{\"{o}}ning, Holger and Pernkopf, Franz and Schiele, Gregor and Blott, Michaela}, title = {Search Space Complexity of Iteration Domain Based Instruction Embedding for Deep Learning Accelerators}, booktitle = {IoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, {ITEM} 2020, Co-located with {ECML/PKDD} 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected Papers}, series = {Communications in Computer and Information Science}, volume = {1325}, pages = {213--228}, publisher = {Springer}, year = {2020}, url = {https://doi.org/10.1007/978-3-030-66770-2\_16}, doi = {10.1007/978-3-030-66770-2\_16}, timestamp = {Wed, 07 Apr 2021 01:00:00 +0200}, }
- On the Difficulty of Designing Processor Arrays for Deep Neural NetworksIoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, ITEM 2020, Co-located with ECML/PKDD 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected Papers (Communications in Computer and Information Science), 1325, 229–240, Springer, 2020
@inproceedings{DBLP:conf/pkdd/StehleSF20, author = {Stehle, Kevin and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger}, editor = {Gama, Jo{\~{a}}o and Pashami, Sepideh and Bifet, Albert and {Sayed Mouchaweh}, Moamar and Fr{\"{o}}ning, Holger and Pernkopf, Franz and Schiele, Gregor and Blott, Michaela}, title = {On the Difficulty of Designing Processor Arrays for Deep Neural Networks}, booktitle = {IoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, {ITEM} 2020, Co-located with {ECML/PKDD} 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected Papers}, series = {Communications in Computer and Information Science}, volume = {1325}, pages = {229--240}, publisher = {Springer}, year = {2020}, url = {https://doi.org/10.1007/978-3-030-66770-2\_17}, doi = {10.1007/978-3-030-66770-2\_17}, timestamp = {Mon, 15 Feb 2021 00:00:00 +0100}, }
- IoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, ITEM 2020, Co-located with ECML/PKDD 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected PapersCommunications in Computer and Information Science, 1325, Springer, 2020
@proceedings{DBLP:conf/pkdd/2020iotstreams, editor = {Gama, Jo{\~{a}}o and Pashami, Sepideh and Bifet, Albert and {Sayed Mouchaweh}, Moamar and Fr{\"{o}}ning, Holger and Pernkopf, Franz and Schiele, Gregor and Blott, Michaela}, title = {IoT Streams for Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning - Second International Workshop, IoT Streams 2020, and First International Workshop, {ITEM} 2020, Co-located with {ECML/PKDD} 2020, Ghent, Belgium, September 14-18, 2020, Revised Selected Papers}, series = {Communications in Computer and Information Science}, volume = {1325}, publisher = {Springer}, year = {2020}, url = {https://doi.org/10.1007/978-3-030-66770-2}, doi = {10.1007/978-3-030-66770-2}, isbn = {978-3-030-66769-6}, timestamp = {Tue, 16 Feb 2021 00:00:00 +0100}, }
- Resource-Efficient Neural Networks for Embedded SystemsCoRR, abs/2001.03048, 2020
@article{DBLP:journals/corr/abs-2001-03048, author = {Roth, Wolfgang and Schindler, G{\"{u}}nther and Z{\"{o}}hrer, Matthias and Pfeifenberger, Lukas and Peharz, Robert and Tschiatschek, Sebastian and Fr{\"{o}}ning, Holger and Pernkopf, Franz and Ghahramani, Zoubin}, title = {Resource-Efficient Neural Networks for Embedded Systems}, journal = {CoRR}, volume = {abs/2001.03048}, year = {2020}, url = {http://arxiv.org/abs/2001.03048}, eprinttype = {arXiv}, eprint = {2001.03048}, timestamp = {Mon, 13 Jan 2020 00:00:00 +0100}, }
- A Simple Model for Portable and Fast Prediction of Execution Time and Power Consumption of GPU KernelsCoRR, abs/2001.07104, 2020
@article{DBLP:journals/corr/abs-2001-07104, author = {Braun, Lorenz and Nikas, Sotirios and Song, Chen and Heuveline, Vincent and Fr{\"{o}}ning, Holger}, title = {A Simple Model for Portable and Fast Prediction of Execution Time and Power Consumption of {GPU} Kernels}, journal = {CoRR}, volume = {abs/2001.07104}, year = {2020}, url = {https://arxiv.org/abs/2001.07104}, eprinttype = {arXiv}, eprint = {2001.07104}, timestamp = {Sat, 23 Jan 2021 00:00:00 +0100}, }
- Resource-Efficient Speech Mask Estimation for Multi-Channel Speech EnhancementCoRR, abs/2007.11477, 2020
@article{DBLP:journals/corr/abs-2007-11477, author = {Pfeifenberger, Lukas and Z{\"{o}}hrer, Matthias and Schindler, G{\"{u}}nther and Roth, Wolfgang and Fr{\"{o}}ning, Holger and Pernkopf, Franz}, title = {Resource-Efficient Speech Mask Estimation for Multi-Channel Speech Enhancement}, journal = {CoRR}, volume = {abs/2007.11477}, year = {2020}, url = {https://arxiv.org/abs/2007.11477}, eprinttype = {arXiv}, eprint = {2007.11477}, timestamp = {Wed, 29 Jul 2020 01:00:00 +0200}, }
- Exploring Memory Access Patterns for Graph Processing AcceleratorsCoRR, abs/2010.13619, 2020
@article{DBLP:journals/corr/abs-2010-13619, author = {Dann, Jonas and Ritter, Daniel and Fr{\"{o}}ning, Holger}, title = {Exploring Memory Access Patterns for Graph Processing Accelerators}, journal = {CoRR}, volume = {abs/2010.13619}, year = {2020}, url = {https://arxiv.org/abs/2010.13619}, eprinttype = {arXiv}, eprint = {2010.13619}, timestamp = {Mon, 02 Nov 2020 00:00:00 +0100}, }
2019
- Constructing virtual 5-dimensional tori out of lower-dimensional network cardsConcurr. Comput. Pract. Exp., 31(2), 2019
@article{DBLP:journals/concurrency/AndujarVSADF19, author = {Andujar, Francisco J. and Villar, Juan A. and S{\'{a}}nchez, Jos{\'{e}} L. and Alfaro, Francisco J. and Duato, Jos{\'{e}} and Fr{\"{o}}ning, Holger}, title = {Constructing virtual 5-dimensional tori out of lower-dimensional network cards}, journal = {Concurr. Comput. Pract. Exp.}, volume = {31}, number = {2}, year = {2019}, url = {https://doi.org/10.1002/cpe.4361}, doi = {10.1002/CPE.4361}, timestamp = {Mon, 02 Mar 2020 00:00:00 +0100}, }
- On link width scaling for energy-proportional direct interconnection networksConcurr. Comput. Pract. Exp., 31(2), 2019
@article{DBLP:journals/concurrency/ZahnLF19, author = {Zahn, Felix and Lammel, Steffen and Fr{\"{o}}ning, Holger}, title = {On link width scaling for energy-proportional direct interconnection networks}, journal = {Concurr. Comput. Pract. Exp.}, volume = {31}, number = {2}, year = {2019}, url = {https://doi.org/10.1002/cpe.4439}, doi = {10.1002/CPE.4439}, timestamp = {Mon, 02 Mar 2020 00:00:00 +0100}, }
- Metric Selection for GPU Kernel ClassificationACM Trans. Archit. Code Optim., 15(4), 68:1–68:27, 2019
@article{DBLP:journals/taco/ShekoftehNNYF19, author = {Shekofteh, S. Kazem and Noori, Hamid and Naghibzadeh, Mahmoud and Yazdi, Hadi Sadoghi and Fr{\"{o}}ning, Holger}, title = {Metric Selection for {GPU} Kernel Classification}, journal = {{ACM} Trans. Archit. Code Optim.}, volume = {15}, number = {4}, pages = {68:1--68:27}, year = {2019}, url = {https://doi.org/10.1145/3295690}, doi = {10.1145/3295690}, timestamp = {Sat, 08 Jan 2022 00:00:00 +0100}, }
- Quantifying the NUMA Behavior of Partitioned GPGPU Applications12th Workshop on General Purpose Processing Using GPUs, GPGPU@ASPLOS 2019, Providence, RI, USA, April 13, 2019, 53–62, ACM, 2019
@inproceedings{DBLP:conf/asplos/MatzF19, author = {Matz, Alexander and Fr{\"{o}}ning, Holger}, editor = {Jog, Adwait and Kayiran, Onur}, title = {Quantifying the {NUMA} Behavior of Partitioned {GPGPU} Applications}, booktitle = {12th Workshop on General Purpose Processing Using GPUs, GPGPU@ASPLOS 2019, Providence, RI, USA, April 13, 2019}, pages = {53--62}, publisher = {{ACM}}, year = {2019}, url = {https://doi.org/10.1145/3300053.3319420}, doi = {10.1145/3300053.3319420}, timestamp = {Tue, 16 Apr 2019 17:25:22 +0200}, }
- Effects of Congestion Management on Energy Saving Techniques in Interconnection NetworksThe 5th International Workshop on High-Performance Interconnection Networks in the ExaScale and Big-Data Era, HiPINEB@HPCA 2019, 17 February 2019, Washington, DC, USA, 9–16, IEEE Computer Society, 2019
@inproceedings{DBLP:conf/hpca/ZahnYEGF19, author = {Zahn, Felix and Y{\'{e}}benes, Pedro and Escudero{-}Sahuquillo, Jes{\'{u}}s and Garc{\'{\i}}a, Pedro Javier and Fr{\"{o}}ning, Holger}, title = {Effects of Congestion Management on Energy Saving Techniques in Interconnection Networks}, booktitle = {The 5th International Workshop on High-Performance Interconnection Networks in the ExaScale and Big-Data Era, HiPINEB@HPCA 2019, 17 February 2019, Washington, DC, {USA}}, pages = {9--16}, publisher = {{IEEE} Computer Society}, year = {2019}, url = {https://doi.org/10.1109/HiPINEB.2019.00009}, doi = {10.1109/HIPINEB.2019.00009}, timestamp = {Mon, 03 Jan 2022 00:00:00 +0100}, }
- Software-Based Buffering of Associative Operations on Random Memory Addresses2019 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2019, Rio de Janeiro, Brazil, May 20-24, 2019, 943–952, IEEE, 2019
@inproceedings{DBLP:conf/ipps/HauckPF19, author = {Hauck, Matthias and Paradies, Marcus and Fr{\"{o}}ning, Holger}, title = {Software-Based Buffering of Associative Operations on Random Memory Addresses}, booktitle = {2019 {IEEE} International Parallel and Distributed Processing Symposium, {IPDPS} 2019, Rio de Janeiro, Brazil, May 20-24, 2019}, pages = {943--952}, publisher = {{IEEE}}, year = {2019}, url = {https://doi.org/10.1109/IPDPS.2019.00102}, doi = {10.1109/IPDPS.2019.00102}, timestamp = {Wed, 16 Oct 2019 14:14:51 +0200}, }
- Training Discrete-Valued Neural Networks with Sign Activations Using Weight DistributionsMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2019, Würzburg, Germany, September 16-20, 2019, Proceedings, Part II (Lecture Notes in Computer Science), 11907, 382–398, Springer, 2019
@inproceedings{DBLP:conf/pkdd/RothSFP19, author = {Roth, Wolfgang and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger and Pernkopf, Franz}, editor = {Brefeld, Ulf and {\'{E}}lisa Fromont and Hotho, Andreas and Knobbe, Arno J. and Maathuis, Marloes H. and Robardet, C{\'{e}}line}, title = {Training Discrete-Valued Neural Networks with Sign Activations Using Weight Distributions}, booktitle = {Machine Learning and Knowledge Discovery in Databases - European Conference, {ECML} {PKDD} 2019, W{\"{u}}rzburg, Germany, September 16-20, 2019, Proceedings, Part {II}}, series = {Lecture Notes in Computer Science}, volume = {11907}, pages = {382--398}, publisher = {Springer}, year = {2019}, url = {https://doi.org/10.1007/978-3-030-46147-8\_23}, doi = {10.1007/978-3-030-46147-8\_23}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- CUDA Flux: A Lightweight Instruction Profiler for CUDA Applications2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, PMBS@SC 2019, Denver, CO, USA, November 18, 2019, 73–81, IEEE, 2019
@inproceedings{DBLP:conf/sc/BraunF19, author = {Braun, Lorenz and Fr{\"{o}}ning, Holger}, title = {{CUDA} Flux: {A} Lightweight Instruction Profiler for {CUDA} Applications}, booktitle = {2019 {IEEE/ACM} Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, PMBS@SC 2019, Denver, CO, USA, November 18, 2019}, pages = {73--81}, publisher = {{IEEE}}, year = {2019}, url = {https://doi.org/10.1109/PMBS49563.2019.00014}, doi = {10.1109/PMBS49563.2019.00014}, timestamp = {Wed, 22 Apr 2020 16:43:07 +0200}, }
2018
- Heterogeneous and unconventional cluster architectures and applicationsConcurr. Comput. Pract. Exp., 30(17), 2018
@article{DBLP:journals/concurrency/FroningS18, author = {Fr{\"{o}}ning, Holger and Silla, Federico}, title = {Heterogeneous and unconventional cluster architectures and applications}, journal = {Concurr. Comput. Pract. Exp.}, volume = {30}, number = {17}, year = {2018}, url = {https://doi.org/10.1002/cpe.4661}, doi = {10.1002/CPE.4661}, timestamp = {Mon, 02 Mar 2020 00:00:00 +0100}, }
- Buffer Provisioning for Large-Scale Data-Acquisition Systems12th ACM International Conference on Distributed and Event-based Systems, DEBS 2018, Hamilton, New Zealand, June 25-29, 2018, 100–111, ACM, 2018
@inproceedings{DBLP:conf/debs/SantosVGF18, author = {Santos, Alejandro and Vandelli, Wainer and Garc{\'{\i}}a, Pedro Javier and Fr{\"{o}}ning, Holger}, editor = {Hinze, Annika and Eyers, David M. and Hirzel, Martin and Weidlich, Matthias and Bhowmik, Sukanya}, title = {Buffer Provisioning for Large-Scale Data-Acquisition Systems}, booktitle = {12th {ACM} International Conference on Distributed and Event-based Systems, {DEBS} 2018, Hamilton, New Zealand, June 25-29, 2018}, pages = {100--111}, publisher = {{ACM}}, year = {2018}, url = {https://doi.org/10.1145/3210284.3210288}, doi = {10.1145/3210284.3210288}, timestamp = {Fri, 26 May 2023 07:40:34 +0200}, }
- Evaluating Energy-Saving Strategies on Torus, K-Ary N-Tree, and Dragonfly4th IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2018, Vienna, Austria, February 24, 2018, 16–23, IEEE Computer Society, 2018
@inproceedings{DBLP:conf/hpca/ZahnSF18, author = {Zahn, Felix and Schoffer, Armin and Fr{\"{o}}ning, Holger}, title = {Evaluating Energy-Saving Strategies on Torus, K-Ary N-Tree, and Dragonfly}, booktitle = {4th {IEEE} International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2018, Vienna, Austria, February 24, 2018}, pages = {16--23}, publisher = {{IEEE} Computer Society}, year = {2018}, url = {https://doi.org/10.1109/HiPINEB.2018.00011}, doi = {10.1109/HIPINEB.2018.00011}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Resource Efficient Deep Eigenvector Beamforming2018 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2018, Calgary, AB, Canada, April 15-20, 2018, 3354–3358, IEEE, 2018
@inproceedings{DBLP:conf/icassp/ZohrerPSFP18, author = {Z{\"{o}}hrer, Matthias and Pfeifenberger, Lukas and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger and Pernkopf, Franz}, title = {Resource Efficient Deep Eigenvector Beamforming}, booktitle = {2018 {IEEE} International Conference on Acoustics, Speech and Signal Processing, {ICASSP} 2018, Calgary, AB, Canada, April 15-20, 2018}, pages = {3354--3358}, publisher = {{IEEE}}, year = {2018}, url = {https://doi.org/10.1109/ICASSP.2018.8462503}, doi = {10.1109/ICASSP.2018.8462503}, timestamp = {Wed, 16 Oct 2019 14:14:52 +0200}, }
- Towards Efficient Forward Propagation on Resource-Constrained SystemsMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2018, Dublin, Ireland, September 10-14, 2018, Proceedings, Part I (Lecture Notes in Computer Science), 11051, 426–442, Springer, 2018
@inproceedings{DBLP:conf/pkdd/SchindlerZPF18, author = {Schindler, G{\"{u}}nther and Z{\"{o}}hrer, Matthias and Pernkopf, Franz and Fr{\"{o}}ning, Holger}, editor = {Berlingerio, Michele and Bonchi, Francesco and G{\"{a}}rtner, Thomas and Hurley, Neil and Ifrim, Georgiana}, title = {Towards Efficient Forward Propagation on Resource-Constrained Systems}, booktitle = {Machine Learning and Knowledge Discovery in Databases - European Conference, {ECML} {PKDD} 2018, Dublin, Ireland, September 10-14, 2018, Proceedings, Part {I}}, series = {Lecture Notes in Computer Science}, volume = {11051}, pages = {426--442}, publisher = {Springer}, year = {2018}, url = {https://doi.org/10.1007/978-3-030-10925-7\_26}, doi = {10.1007/978-3-030-10925-7\_26}, timestamp = {Tue, 21 Mar 2023 00:00:00 +0100}, }
- Efficient and Robust Machine Learning for Real-World SystemsCoRR, abs/1812.02240, 2018
@article{DBLP:journals/corr/abs-1812-02240, author = {Pernkopf, Franz and Roth, Wolfgang and Z{\"{o}}hrer, Matthias and Pfeifenberger, Lukas and Schindler, G{\"{u}}nther and Fr{\"{o}}ning, Holger and Tschiatschek, Sebastian and Peharz, Robert and Mattina, Matthew and Ghahramani, Zoubin}, title = {Efficient and Robust Machine Learning for Real-World Systems}, journal = {CoRR}, volume = {abs/1812.02240}, year = {2018}, url = {http://arxiv.org/abs/1812.02240}, eprinttype = {arXiv}, eprint = {1812.02240}, timestamp = {Tue, 01 Jan 2019 00:00:00 +0100}, }
2017
- InfiniBand Verbs on GPU: a case study of controlling an InfiniBand network device from the GPUInt. J. High Perform. Comput. Appl., 31(4), 274–284, 2017
@article{DBLP:journals/ijhpca/OdenF17, author = {Oden, Lena and Fr{\"{o}}ning, Holger}, title = {InfiniBand Verbs on {GPU:} a case study of controlling an InfiniBand network device from the {GPU}}, journal = {Int. J. High Perform. Comput. Appl.}, volume = {31}, number = {4}, pages = {274--284}, year = {2017}, url = {https://doi.org/10.1177/1094342015588142}, doi = {10.1177/1094342015588142}, timestamp = {Thu, 12 Mar 2020 00:00:00 +0100}, }
- Linking Application Description with Efficient SIMD Code Generation for Low-Precision Signed-Integer GEMMEuro-Par 2017: Parallel Processing Workshops - Euro-Par 2017 International Workshops, Santiago de Compostela, Spain, August 28-29, 2017, Revised Selected Papers (Lecture Notes in Computer Science), 10659, 688–699, Springer, 2017
@inproceedings{DBLP:conf/europar/SchindlerMF17, author = {Schindler, G{\"{u}}nther and M{\"{u}}cke, Manfred and Fr{\"{o}}ning, Holger}, editor = {Heras, Dora Blanco and Boug{\'{e}}, Luc and Mencagli, Gabriele and Jeannot, Emmanuel and Sakellariou, Rizos and Badia, Rosa M. and Barbosa, Jorge G. and Ricci, Laura and Scott, Stephen L. and Lankes, Stefan and Weidendorfer, Josef}, title = {Linking Application Description with Efficient {SIMD} Code Generation for Low-Precision Signed-Integer {GEMM}}, booktitle = {Euro-Par 2017: Parallel Processing Workshops - Euro-Par 2017 International Workshops, Santiago de Compostela, Spain, August 28-29, 2017, Revised Selected Papers}, series = {Lecture Notes in Computer Science}, volume = {10659}, pages = {688--699}, publisher = {Springer}, year = {2017}, url = {https://doi.org/10.1007/978-3-319-75178-8\_55}, doi = {10.1007/978-3-319-75178-8\_55}, timestamp = {Thu, 14 Oct 2021 10:28:38 +0200}, }
- Can Modern Graph Processing Engines Run Concurrent Queries Efficiently?Fifth International Workshop on Graph Data-management Experiences & Systems, GRADES@SIGMOD/PODS 2017, Chicago, IL, USA, May 14 - 19, 2017, 5:1–5:6, ACM, 2017
@inproceedings{DBLP:conf/grades/HauckPF17, author = {Hauck, Matthias and Paradies, Marcus and Fr{\"{o}}ning, Holger}, editor = {Boncz, Peter A. and Larriba{-}Pey, Josep Llu{\'{\i}}s}, title = {Can Modern Graph Processing Engines Run Concurrent Queries Efficiently?}, booktitle = {Fifth International Workshop on Graph Data-management Experiences {\&} Systems, GRADES@SIGMOD/PODS 2017, Chicago, IL, USA, May 14 - 19, 2017}, pages = {5:1--5:6}, publisher = {{ACM}}, year = {2017}, url = {https://doi.org/10.1145/3078447.3078452}, doi = {10.1145/3078447.3078452}, timestamp = {Thu, 10 Dec 2020 13:35:15 +0100}, }
- A Case Study on Implementing Virtual 5D Torus Networks Using Network Components of Lower Dimensionality3rd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2017, Austin, TX, USA, February 5, 2017, 9–16, IEEE Computer Society, 2017
@inproceedings{DBLP:conf/hpca/AndujarV0ADF17, author = {Andujar, Francisco J. and Villar, Juan A. and S{\'{a}}nchez, Jos{\'{e}} L. and Alfaro, Francisco J. and Duato, Jos{\'{e}} and Fr{\"{o}}ning, Holger}, editor = {Escudero{-}Sahuquillo, Jes{\'{u}}s and Garc{\'{\i}}a, Pedro Javier}, title = {A Case Study on Implementing Virtual 5D Torus Networks Using Network Components of Lower Dimensionality}, booktitle = {3rd {IEEE} International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2017, Austin, TX, USA, February 5, 2017}, pages = {9--16}, publisher = {{IEEE} Computer Society}, year = {2017}, url = {https://doi.org/10.1109/HiPINEB.2017.7}, doi = {10.1109/HIPINEB.2017.7}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Early Experiences with Saving Energy in Direct Interconnection Networks3rd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2017, Austin, TX, USA, February 5, 2017, 33–40, IEEE Computer Society, 2017
@inproceedings{DBLP:conf/hpca/ZahnLF17, author = {Zahn, Felix and Lammel, Steffen and Fr{\"{o}}ning, Holger}, editor = {Escudero{-}Sahuquillo, Jes{\'{u}}s and Garc{\'{\i}}a, Pedro Javier}, title = {Early Experiences with Saving Energy in Direct Interconnection Networks}, booktitle = {3rd {IEEE} International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era, HiPINEB@HPCA 2017, Austin, TX, USA, February 5, 2017}, pages = {33--40}, publisher = {{IEEE} Computer Society}, year = {2017}, url = {https://doi.org/10.1109/HiPINEB.2017.10}, doi = {10.1109/HIPINEB.2017.10}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Modeling and Validating Time, Buffering, and Utilization of a Large-Scale, Real-Time Data Acquisition System2017 International Conference on High Performance Computing & Simulation, HPCS 2017, Genoa, Italy, July 17-21, 2017, 519–525, IEEE, 2017
@inproceedings{DBLP:conf/ieeehpcs/SantosGVF17, author = {Santos, Alejandro and Garc{\'{\i}}a, Pedro Javier and Vandelli, Wainer and Fr{\"{o}}ning, Holger}, title = {Modeling and Validating Time, Buffering, and Utilization of a Large-Scale, Real-Time Data Acquisition System}, booktitle = {2017 International Conference on High Performance Computing {\&} Simulation, {HPCS} 2017, Genoa, Italy, July 17-21, 2017}, pages = {519--525}, publisher = {{IEEE}}, year = {2017}, url = {https://doi.org/10.1109/HPCS.2017.83}, doi = {10.1109/HPCS.2017.83}, timestamp = {Wed, 16 Oct 2019 14:14:54 +0200}, }
- Relaxations for High-Performance Message Passing on Massively Parallel SIMT Processors2017 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2017, Orlando, FL, USA, May 29 - June 2, 2017, 855–865, IEEE Computer Society, 2017
@inproceedings{DBLP:conf/ipps/KlenkFED17, author = {Klenk, Benjamin and Fr{\"{o}}ning, Holger and Eberle, Hans and Dennison, Larry}, title = {Relaxations for High-Performance Message Passing on Massively Parallel {SIMT} Processors}, booktitle = {2017 {IEEE} International Parallel and Distributed Processing Symposium, {IPDPS} 2017, Orlando, FL, USA, May 29 - June 2, 2017}, pages = {855--865}, publisher = {{IEEE} Computer Society}, year = {2017}, url = {https://doi.org/10.1109/IPDPS.2017.94}, doi = {10.1109/IPDPS.2017.94}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- An Overview of MPI Characteristics of Exascale Proxy ApplicationsHigh Performance Computing - 32nd International Conference, ISC High Performance 2017, Frankfurt, Germany, June 18-22, 2017, Proceedings (Lecture Notes in Computer Science), 10266, 217–236, Springer, 2017
@inproceedings{DBLP:conf/supercomputer/KlenkF17, author = {Klenk, Benjamin and Fr{\"{o}}ning, Holger}, editor = {Kunkel, Julian M. and Yokota, Rio and Balaji, Pavan and Keyes, David E.}, title = {An Overview of {MPI} Characteristics of Exascale Proxy Applications}, booktitle = {High Performance Computing - 32nd International Conference, {ISC} High Performance 2017, Frankfurt, Germany, June 18-22, 2017, Proceedings}, series = {Lecture Notes in Computer Science}, volume = {10266}, pages = {217--236}, publisher = {Springer}, year = {2017}, url = {https://doi.org/10.1007/978-3-319-58667-0\_12}, doi = {10.1007/978-3-319-58667-0\_12}, timestamp = {Tue, 14 May 2019 10:00:40 +0200}, }
2016
- Heterogeneous cluster architectures and applicationsConcurr. Comput. Pract. Exp., 28(8), 2319–2321, 2016
@article{DBLP:journals/concurrency/SillaF16, author = {Silla, Federico and Fr{\"{o}}ning, Holger}, title = {Heterogeneous cluster architectures and applications}, journal = {Concurr. Comput. Pract. Exp.}, volume = {28}, number = {8}, pages = {2319--2321}, year = {2016}, url = {https://doi.org/10.1002/cpe.3762}, doi = {10.1002/CPE.3762}, timestamp = {Mon, 02 Mar 2020 00:00:00 +0100}, }
- Analyzing GPU-controlled communication with dynamic parallelism in terms of performance and energyParallel Comput., 57, 125–134, 2016
@article{DBLP:journals/pc/OdenKF16, author = {Oden, Lena and Klenk, Benjamin and Fr{\"{o}}ning, Holger}, title = {Analyzing GPU-controlled communication with dynamic parallelism in terms of performance and energy}, journal = {Parallel Comput.}, volume = {57}, pages = {125--134}, year = {2016}, url = {https://doi.org/10.1016/j.parco.2016.02.005}, doi = {10.1016/J.PARCO.2016.02.005}, timestamp = {Sat, 22 Feb 2020 00:00:00 +0100}, }
- Optimizing the data-collection time of a large-scale data-acquisition system through a simulation frameworkJ. Supercomput., 72(12), 4546–4572, 2016
@article{DBLP:journals/tjs/ColomboFGV16, author = {Colombo, Tommaso and Fr{\"{o}}ning, Holger and Garc{\'{\i}}a, Pedro Javier and Vandelli, Wainer}, title = {Optimizing the data-collection time of a large-scale data-acquisition system through a simulation framework}, journal = {J. Supercomput.}, volume = {72}, number = {12}, pages = {4546--4572}, year = {2016}, url = {https://doi.org/10.1007/s11227-016-1764-1}, doi = {10.1007/S11227-016-1764-1}, timestamp = {Fri, 22 May 2020 01:00:00 +0200}, }
- Analyzing the Energy (Dis-) Proportionality of Scalable Interconnection Networks2nd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era HiPINEB@HPCA 2016, Barcelona, Spain, March 12, 2016, 25–32, IEEE Computer Society, 2016
@inproceedings{DBLP:conf/hpca/ZahnYLGF16, author = {Zahn, Felix and Y{\'{e}}benes, Pedro and Lammel, Steffen and Garc{\'{\i}}a, Pedro Javier and Fr{\"{o}}ning, Holger}, editor = {Escudero{-}Sahuquillo, Jes{\'{u}}s and Garc{\'{\i}}a, Pedro Javier}, title = {Analyzing the Energy (Dis-) Proportionality of Scalable Interconnection Networks}, booktitle = {2nd {IEEE} International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era HiPINEB@HPCA 2016, Barcelona, Spain, March 12, 2016}, pages = {25--32}, publisher = {{IEEE} Computer Society}, year = {2016}, url = {https://doi.org/10.1109/HIPINEB.2016.13}, doi = {10.1109/HIPINEB.2016.13}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Optimizing communication for a 2D-partitioned scalable BFSIEEE High Performance Extreme Computing Conference, HPEC 2016, Waltham, MA, USA, September 13-15, 2016, 1–7, IEEE, 2016
@inproceedings{DBLP:conf/hpec/YoungRHF16, author = {Young, Jeffrey S. and Romera, Julian and Hauck, Matthias and Fr{\"{o}}ning, Holger}, title = {Optimizing communication for a 2D-partitioned scalable {BFS}}, booktitle = {{IEEE} High Performance Extreme Computing Conference, {HPEC} 2016, Waltham, MA, USA, September 13-15, 2016}, pages = {1--7}, publisher = {{IEEE}}, year = {2016}, url = {https://doi.org/10.1109/HPEC.2016.7761596}, doi = {10.1109/HPEC.2016.7761596}, timestamp = {Sun, 12 Nov 2023 00:00:00 +0100}, }
- Exploring Time and Energy for Complex Accesses to a Hybrid Memory CubeSecond International Symposium on Memory Systems, MEMSYS 2016, Alexandria, VA, USA, October 3-6, 2016, 142–150, ACM, 2016
@inproceedings{DBLP:conf/memsys/SchmidtFB16, author = {Schmidt, Juri and Fr{\"{o}}ning, Holger and Br{\"{u}}ning, Ulrich}, editor = {Jacob, Bruce L.}, title = {Exploring Time and Energy for Complex Accesses to a Hybrid Memory Cube}, booktitle = {Second International Symposium on Memory Systems, {MEMSYS} 2016, Alexandria, VA, USA, October 3-6, 2016}, pages = {142--150}, publisher = {{ACM}}, year = {2016}, url = {https://doi.org/10.1145/2989081.2989099}, doi = {10.1145/2989081.2989099}, timestamp = {Fri, 13 Nov 2020 09:24:44 +0100}, }
- SONAR: Automated Communication Characterization for HPC ApplicationsHigh Performance Computing - ISC High Performance 2016 International Workshops, ExaComm, E-MuCoCoS, HPC-IODC, IXPUG, IWOPH, P\^3MA, VHPC, WOPSSS, Frankfurt, Germany, June 19-23, 2016, Revised Selected Papers (Lecture Notes in Computer Science), 9945, 98–114, 2016
@inproceedings{DBLP:conf/supercomputer/LammelZF16, author = {Lammel, Steffen and Zahn, Felix and Fr{\"{o}}ning, Holger}, editor = {Taufer, Michela and Mohr, Bernd and Kunkel, Julian M.}, title = {{SONAR:} Automated Communication Characterization for {HPC} Applications}, booktitle = {High Performance Computing - {ISC} High Performance 2016 International Workshops, ExaComm, E-MuCoCoS, HPC-IODC, IXPUG, IWOPH, P{\^{}}3MA, VHPC, WOPSSS, Frankfurt, Germany, June 19-23, 2016, Revised Selected Papers}, series = {Lecture Notes in Computer Science}, volume = {9945}, pages = {98--114}, year = {2016}, url = {https://doi.org/10.1007/978-3-319-46079-6\_8}, doi = {10.1007/978-3-319-46079-6\_8}, timestamp = {Wed, 25 Sep 2019 18:17:53 +0200}, }
2015
- On the design of a new dynamic credit-based end-to-end flow control mechanism for HPC clustersParallel Comput., 46, 32–59, 2015
@article{DBLP:journals/pc/PradesSFND15, author = {Prades, Javier and Silla, Federico and Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian and Duato, Jos{\'{e}}}, title = {On the design of a new dynamic credit-based end-to-end flow control mechanism for {HPC} clusters}, journal = {Parallel Comput.}, volume = {46}, pages = {32--59}, year = {2015}, url = {https://doi.org/10.1016/j.parco.2015.03.006}, doi = {10.1016/J.PARCO.2015.03.006}, timestamp = {Sun, 02 Oct 2022 01:00:00 +0200}, }
- Modeling a Large Data-Acquisition Network in a Simulation Framework2015 IEEE International Conference on Cluster Computing, CLUSTER 2015, Chicago, IL, USA, September 8-11, 2015, 809–816, IEEE Computer Society, 2015
@inproceedings{DBLP:conf/cluster/ColomboFGV15, author = {Colombo, Tommaso and Fr{\"{o}}ning, Holger and Garc{\'{\i}}a, Pedro Javier and Vandelli, Wainer}, title = {Modeling a Large Data-Acquisition Network in a Simulation Framework}, booktitle = {2015 {IEEE} International Conference on Cluster Computing, {CLUSTER} 2015, Chicago, IL, USA, September 8-11, 2015}, pages = {809--816}, publisher = {{IEEE} Computer Society}, year = {2015}, url = {https://doi.org/10.1109/CLUSTER.2015.137}, doi = {10.1109/CLUSTER.2015.137}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
- Highspeed Graph Processing Exploiting Main-Memory Column StoresEuro-Par 2015: Parallel Processing Workshops - Euro-Par 2015 International Workshops, Vienna, Austria, August 24-25, 2015, Revised Selected Papers (Lecture Notes in Computer Science), 9523, 503–514, Springer, 2015
@inproceedings{DBLP:conf/europar/HauckPFLR15, author = {Hauck, Matthias and Paradies, Marcus and Fr{\"{o}}ning, Holger and Lehner, Wolfgang and Rauhe, Hannes}, editor = {Hunold, Sascha and Costan, Alexandru and Gim{\'{e}}nez, Domingo and Iosup, Alexandru and Ricci, Laura and Requena, Mar{\'{\i}}a Engracia G{\'{o}}mez and Scarano, Vittorio and Varbanescu, Ana Lucia and Scott, Stephen L. and Lankes, Stefan and Weidendorfer, Josef and Alexander, Michael}, title = {Highspeed Graph Processing Exploiting Main-Memory Column Stores}, booktitle = {Euro-Par 2015: Parallel Processing Workshops - Euro-Par 2015 International Workshops, Vienna, Austria, August 24-25, 2015, Revised Selected Papers}, series = {Lecture Notes in Computer Science}, volume = {9523}, pages = {503--514}, publisher = {Springer}, year = {2015}, url = {https://doi.org/10.1007/978-3-319-27308-2\_41}, doi = {10.1007/978-3-319-27308-2\_41}, timestamp = {Tue, 14 May 2019 10:00:46 +0200}, }
- Analyzing communication models for distributed thread-collaborative processors in terms of energy and time2015 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2015, Philadelphia, PA, USA, March 29-31, 2015, 318–327, IEEE Computer Society, 2015
@inproceedings{DBLP:conf/ispass/KlenkOF15, author = {Klenk, Benjamin and Oden, Lena and Fr{\"{o}}ning, Holger}, title = {Analyzing communication models for distributed thread-collaborative processors in terms of energy and time}, booktitle = {2015 {IEEE} International Symposium on Performance Analysis of Systems and Software, {ISPASS} 2015, Philadelphia, PA, USA, March 29-31, 2015}, pages = {318--327}, publisher = {{IEEE} Computer Society}, year = {2015}, url = {https://doi.org/10.1109/ISPASS.2015.7095817}, doi = {10.1109/ISPASS.2015.7095817}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
2014
- Special issue on unconventional cluster architectures and applicationsClust. Comput., 17(2), 291, 2014
@article{DBLP:journals/cluster/SillaF14, author = {Silla, Federico and Fr{\"{o}}ning, Holger}, title = {Special issue on unconventional cluster architectures and applications}, journal = {Clust. Comput.}, volume = {17}, number = {2}, pages = {291}, year = {2014}, url = {https://doi.org/10.1007/s10586-013-0291-6}, doi = {10.1007/S10586-013-0291-6}, timestamp = {Tue, 29 Sep 2020 01:00:00 +0200}, }
- Energy-Efficient Collective Reduce and Allreduce Operations on Distributed GPUs14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2014, Chicago, IL, USA, May 26-29, 2014, 483–492, IEEE Computer Society, 2014
@inproceedings{DBLP:conf/ccgrid/OdenKF14, author = {Oden, Lena and Klenk, Benjamin and Fr{\"{o}}ning, Holger}, title = {Energy-Efficient Collective Reduce and Allreduce Operations on Distributed GPUs}, booktitle = {14th {IEEE/ACM} International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2014, Chicago, IL, USA, May 26-29, 2014}, pages = {483--492}, publisher = {{IEEE} Computer Society}, year = {2014}, url = {https://doi.org/10.1109/CCGrid.2014.21}, doi = {10.1109/CCGRID.2014.21}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Analyzing Put/Get APIs for Thread-Collaborative Processors43rd International Conference on Parallel Processing Workshops, ICPPW 2014, Minneapolis, MN, USA, September 9-12, 2014, 411–418, IEEE Computer Society, 2014
@inproceedings{DBLP:conf/icppw/KlenkOF14, author = {Klenk, Benjamin and Oden, Lena and Fr{\"{o}}ning, Holger}, title = {Analyzing Put/Get APIs for Thread-Collaborative Processors}, booktitle = {43rd International Conference on Parallel Processing Workshops, {ICPPW} 2014, Minneapolis, MN, USA, September 9-12, 2014}, pages = {411--418}, publisher = {{IEEE} Computer Society}, year = {2014}, url = {https://doi.org/10.1109/ICPPW.2014.61}, doi = {10.1109/ICPPW.2014.61}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Infiniband-Verbs on GPU: A Case Study of Controlling an Infiniband Network Device from the GPU2014 IEEE International Parallel & Distributed Processing Symposium Workshops, Phoenix, AZ, USA, May 19-23, 2014, 976–983, IEEE Computer Society, 2014
@inproceedings{DBLP:conf/ipps/OdenFP14, author = {Oden, Lena and Fr{\"{o}}ning, Holger and Pfreundt, Franz{-}Josef}, title = {Infiniband-Verbs on {GPU:} {A} Case Study of Controlling an Infiniband Network Device from the {GPU}}, booktitle = {2014 {IEEE} International Parallel {\&} Distributed Processing Symposium Workshops, Phoenix, AZ, USA, May 19-23, 2014}, pages = {976--983}, publisher = {{IEEE} Computer Society}, year = {2014}, url = {https://doi.org/10.1109/IPDPSW.2014.111}, doi = {10.1109/IPDPSW.2014.111}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Energy-efficient stencil computations on distributed GPUs using dynamic parallelism and GPU-controlled communication2nd International Workshop on Energy Efficient Supercomputing, E2SC ’14, New Orleans, Louisiana, USA, November 16-21, 2014, 31–40, IEEE Computer Society, 2014
@inproceedings{DBLP:conf/sc/OdenKF14, author = {Oden, Lena and Klenk, Benjamin and Fr{\"{o}}ning, Holger}, editor = {Cameron, Kirk W. and Hoisie, Adolfy and Kerbyson, Darren J. and Lowenthal, David K. and Nikolopoulos, Dimitrios S. and Yalamanchili, Sudha and Marquez, Andres}, title = {Energy-efficient stencil computations on distributed GPUs using dynamic parallelism and GPU-controlled communication}, booktitle = {2nd International Workshop on Energy Efficient Supercomputing, {E2SC} '14, New Orleans, Louisiana, USA, November 16-21, 2014}, pages = {31--40}, publisher = {{IEEE} Computer Society}, year = {2014}, url = {https://doi.org/10.1109/E2SC.2014.14}, doi = {10.1109/E2SC.2014.14}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
2013
- On Achieving High Message Rates13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2013, Delft, Netherlands, May 13-16, 2013, 498–505, IEEE Computer Society, 2013
@inproceedings{DBLP:conf/ccgrid/FroningNLLB13, author = {Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian and Litz, Heiner and Leber, Christian and Br{\"{u}}ning, Ulrich}, title = {On Achieving High Message Rates}, booktitle = {13th {IEEE/ACM} International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2013, Delft, Netherlands, May 13-16, 2013}, pages = {498--505}, publisher = {{IEEE} Computer Society}, year = {2013}, url = {https://doi.org/10.1109/CCGrid.2013.43}, doi = {10.1109/CCGRID.2013.43}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- GGAS: Global GPU address spaces for efficient communication in heterogeneous clusters2013 IEEE International Conference on Cluster Computing, CLUSTER 2013, Indianapolis, IN, USA, September 23-27, 2013, 1–8, IEEE Computer Society, 2013
@inproceedings{DBLP:conf/cluster/OdenF13, author = {Oden, Lena and Fr{\"{o}}ning, Holger}, title = {{GGAS:} Global {GPU} address spaces for efficient communication in heterogeneous clusters}, booktitle = {2013 {IEEE} International Conference on Cluster Computing, {CLUSTER} 2013, Indianapolis, IN, USA, September 23-27, 2013}, pages = {1--8}, publisher = {{IEEE} Computer Society}, year = {2013}, url = {https://doi.org/10.1109/CLUSTER.2013.6702638}, doi = {10.1109/CLUSTER.2013.6702638}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
- Oncilla: A GAS runtime for efficient resource allocation and data movement in accelerated clusters2013 IEEE International Conference on Cluster Computing, CLUSTER 2013, Indianapolis, IN, USA, September 23-27, 2013, 1–8, IEEE Computer Society, 2013
@inproceedings{DBLP:conf/cluster/YoungSYMSF13, author = {Young, Jeffrey S. and Shon, Se Hoon and Yalamanchili, Sudhakar and Merritt, Alex and Schwan, Karsten and Fr{\"{o}}ning, Holger}, title = {Oncilla: {A} {GAS} runtime for efficient resource allocation and data movement in accelerated clusters}, booktitle = {2013 {IEEE} International Conference on Cluster Computing, {CLUSTER} 2013, Indianapolis, IN, USA, September 23-27, 2013}, pages = {1--8}, publisher = {{IEEE} Computer Society}, year = {2013}, url = {https://doi.org/10.1109/CLUSTER.2013.6702679}, doi = {10.1109/CLUSTER.2013.6702679}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
- Data Movement Options in Accelerated ClustersEuro-Par 2013: Parallel Processing Workshops - BigDataCloud, DIHC, FedICI, HeteroPar, HiBB, LSDVE, MHPC, OMHI, PADABS, PROPER, Resilience, ROME, and UCHPC 2013, Aachen, Germany, August 26-27, 2013. Revised Selected Papers (Lecture Notes in Computer Science), 8374, 418–422, Springer, 2013
@inproceedings{DBLP:conf/europar/Froning13, author = {Fr{\"{o}}ning, Holger}, editor = {an Mey, Dieter and Alexander, Michael and Bientinesi, Paolo and Cannataro, Mario and Clauss, Carsten and Costan, Alexandru and Kecskemeti, Gabor and Morin, Christine and Ricci, Laura and Sahuquillo, Julio and Schulz, Martin and Scarano, Vittorio and Scott, Stephen L. and Weidendorfer, Josef}, title = {Data Movement Options in Accelerated Clusters}, booktitle = {Euro-Par 2013: Parallel Processing Workshops - BigDataCloud, DIHC, FedICI, HeteroPar, HiBB, LSDVE, MHPC, OMHI, PADABS, PROPER, Resilience, ROME, and {UCHPC} 2013, Aachen, Germany, August 26-27, 2013. Revised Selected Papers}, series = {Lecture Notes in Computer Science}, volume = {8374}, pages = {418--422}, publisher = {Springer}, year = {2013}, url = {https://doi.org/10.1007/978-3-642-54420-0\_41}, doi = {10.1007/978-3-642-54420-0\_41}, timestamp = {Wed, 19 Feb 2020 14:52:57 +0100}, }
2012
- A new degree of freedom for memory allocation in clustersClust. Comput., 15(2), 101–123, 2012
@article{DBLP:journals/cluster/MontanerSFD12, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, title = {A new degree of freedom for memory allocation in clusters}, journal = {Clust. Comput.}, volume = {15}, number = {2}, pages = {101--123}, year = {2012}, url = {https://doi.org/10.1007/s10586-010-0150-7}, doi = {10.1007/S10586-010-0150-7}, timestamp = {Sun, 02 Oct 2022 01:00:00 +0200}, }
- A New End-to-End Flow-Control Mechanism for High Performance Computing Clusters2012 IEEE International Conference on Cluster Computing, CLUSTER 2012, Beijing, China, September 24-28, 2012, 320–328, IEEE Computer Society, 2012
@inproceedings{DBLP:conf/cluster/PradesSDFN12, author = {Prades, Javier and Silla, Federico and Duato, Jos{\'{e}} and Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian}, title = {A New End-to-End Flow-Control Mechanism for High Performance Computing Clusters}, booktitle = {2012 {IEEE} International Conference on Cluster Computing, {CLUSTER} 2012, Beijing, China, September 24-28, 2012}, pages = {320--328}, publisher = {{IEEE} Computer Society}, year = {2012}, url = {https://doi.org/10.1109/CLUSTER.2012.15}, doi = {10.1109/CLUSTER.2012.15}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
2011
- MEMSCALE: in-cluster-memory databases20th ACM Conference on Information and Knowledge Management, CIKM 2011, Glasgow, United Kingdom, October 24-28, 2011, 2569–2572, ACM, 2011
@inproceedings{DBLP:conf/cikm/MontanerSFD11, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, editor = {Macdonald, Craig and Ounis, Iadh and Ruthven, Ian}, title = {{MEMSCALE:} in-cluster-memory databases}, booktitle = {20th {ACM} Conference on Information and Knowledge Management, {CIKM} 2011, Glasgow, United Kingdom, October 24-28, 2011}, pages = {2569--2572}, publisher = {{ACM}}, year = {2011}, url = {https://doi.org/10.1145/2063576.2064022}, doi = {10.1145/2063576.2064022}, timestamp = {Sun, 02 Oct 2022 01:00:00 +0200}, }
- Highly scalable barriers for future high-performance computing clusters18th International Conference on High Performance Computing, HiPC 2011, Bengaluru, India, December 18-21, 2011, 1–10, IEEE Computer Society, 2011
@inproceedings{DBLP:conf/hipc/FroningGMSD11, author = {Fr{\"{o}}ning, Holger and Giese, Alexander and Montaner, H{\'{e}}ctor and Silla, Federico and Duato, Jos{\'{e}}}, title = {Highly scalable barriers for future high-performance computing clusters}, booktitle = {18th International Conference on High Performance Computing, HiPC 2011, Bengaluru, India, December 18-21, 2011}, pages = {1--10}, publisher = {{IEEE} Computer Society}, year = {2011}, url = {https://doi.org/10.1109/HiPC.2011.6152729}, doi = {10.1109/HIPC.2011.6152729}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Unleash Your Memory-Constrained Applications: A 32-Node Non-coherent Distributed-Memory Prototype Cluster13th IEEE International Conference on High Performance Computing & Communication, HPCC 2011, Banff, Alberta, Canada, September 2-4, 2011, 9–16, IEEE, 2011
@inproceedings{DBLP:conf/hpcc/MontanerSFD11, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, editor = {Thulasiraman, Parimala and Yang, Laurence Tianruo and Pan, Qiwen and Liu, Xingang and Chen, Yaw{-}Chung and Huang, Yo{-}Ping and Chang, Lin{-}Huang and Hung, Che{-}Lun and Lee, Che{-}Rung and Shi, Justin Y. and Zhang, Ying}, title = {Unleash Your Memory-Constrained Applications: {A} 32-Node Non-coherent Distributed-Memory Prototype Cluster}, booktitle = {13th {IEEE} International Conference on High Performance Computing {\&} Communication, {HPCC} 2011, Banff, Alberta, Canada, September 2-4, 2011}, pages = {9--16}, publisher = {{IEEE}}, year = {2011}, url = {https://doi.org/10.1109/HPCC.2011.12}, doi = {10.1109/HPCC.2011.12}, timestamp = {Sun, 02 Oct 2022 01:00:00 +0200}, }
- MEMSCALE\(^\mboxTM\): A Scalable Environment for Databases13th IEEE International Conference on High Performance Computing & Communication, HPCC 2011, Banff, Alberta, Canada, September 2-4, 2011, 339–346, IEEE, 2011
@inproceedings{DBLP:conf/hpcc/MontanerSFD11a, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, editor = {Thulasiraman, Parimala and Yang, Laurence Tianruo and Pan, Qiwen and Liu, Xingang and Chen, Yaw{-}Chung and Huang, Yo{-}Ping and Chang, Lin{-}Huang and Hung, Che{-}Lun and Lee, Che{-}Rung and Shi, Justin Y. and Zhang, Ying}, title = {MEMSCALE\({}^{\mbox{TM}}\): {A} Scalable Environment for Databases}, booktitle = {13th {IEEE} International Conference on High Performance Computing {\&} Communication, {HPCC} 2011, Banff, Alberta, Canada, September 2-4, 2011}, pages = {339--346}, publisher = {{IEEE}}, year = {2011}, url = {https://doi.org/10.1109/HPCC.2011.51}, doi = {10.1109/HPCC.2011.51}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- Network InterfacesEncyclopedia of Parallel Computing, 1292–1298, Springer, 2011
@incollection{DBLP:reference/parallel/Froning11, author = {Fr{\"{o}}ning, Holger}, editor = {Padua, David A.}, title = {Network Interfaces}, booktitle = {Encyclopedia of Parallel Computing}, pages = {1292--1298}, publisher = {Springer}, year = {2011}, url = {https://doi.org/10.1007/978-0-387-09766-4\_319}, doi = {10.1007/978-0-387-09766-4\_319}, timestamp = {Wed, 12 Jul 2017 01:00:00 +0200}, }
2010
- Getting Rid of Coherency Overhead for Memory-Hungry ApplicationsIEEE International Conference on Cluster Computing, Heraklion, Crete, Greece, 20-24 September, 2010, 48–57, IEEE Computer Society, 2010
@inproceedings{DBLP:conf/cluster/MontanerSFD10, author = {Montaner, H{\'{e}}ctor and Silla, Federico and Fr{\"{o}}ning, Holger and Duato, Jos{\'{e}}}, title = {Getting Rid of Coherency Overhead for Memory-Hungry Applications}, booktitle = {{IEEE} International Conference on Cluster Computing, Heraklion, Crete, Greece, 20-24 September, 2010}, pages = {48--57}, publisher = {{IEEE} Computer Society}, year = {2010}, url = {https://doi.org/10.1109/CLUSTER.2010.14}, doi = {10.1109/CLUSTER.2010.14}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
- Efficient hardware support for the Partitioned Global Address Space24th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010, Atlanta, Georgia, USA, 19-23 April 2010 - Workshop Proceedings, 1–6, IEEE, 2010
@inproceedings{DBLP:conf/ipps/FroningL10, author = {Fr{\"{o}}ning, Holger and Litz, Heiner}, title = {Efficient hardware support for the Partitioned Global Address Space}, booktitle = {24th {IEEE} International Symposium on Parallel and Distributed Processing, {IPDPS} 2010, Atlanta, Georgia, USA, 19-23 April 2010 - Workshop Proceedings}, pages = {1--6}, publisher = {{IEEE}}, year = {2010}, url = {https://doi.org/10.1109/IPDPSW.2010.5470851}, doi = {10.1109/IPDPSW.2010.5470851}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
2009
- A HyperTransport 3 Physical Layer Interface for FPGAsReconfigurable Computing: Architectures, Tools and Applications, 5th International Workshop, ARC 2009, Karlsruhe, Germany, March 16-18, 2009. Proceedings (Lecture Notes in Computer Science), 5453, 4–14, Springer, 2009
@inproceedings{DBLP:conf/arc/LitzFB09, author = {Litz, Heiner and Fr{\"{o}}ning, Holger and Br{\"{u}}ning, Ulrich}, editor = {Becker, J{\"{u}}rgen and Woods, Roger F. and Athanas, Peter M. and Morgan, Fearghal}, title = {A HyperTransport 3 Physical Layer Interface for FPGAs}, booktitle = {Reconfigurable Computing: Architectures, Tools and Applications, 5th International Workshop, {ARC} 2009, Karlsruhe, Germany, March 16-18, 2009. Proceedings}, series = {Lecture Notes in Computer Science}, volume = {5453}, pages = {4--14}, publisher = {Springer}, year = {2009}, url = {https://doi.org/10.1007/978-3-642-00641-8\_4}, doi = {10.1007/978-3-642-00641-8\_4}, timestamp = {Fri, 19 Jul 2019 13:02:47 +0200}, }
- An FPGA based verification platform for HyperTransport 3.x19th International Conference on Field Programmable Logic and Applications, FPL 2009, August 31 - September 2, 2009, Prague, Czech Republic, 631–634, IEEE, 2009
@inproceedings{DBLP:conf/fpl/LitzFTB09, author = {Litz, Heiner and Fr{\"{o}}ning, Holger and Th{\"{u}}rmer, Maximilian and Br{\"{u}}ning, Ulrich}, editor = {Danek, Martin and Kadlec, Jiri and Nelson, Brent E.}, title = {An {FPGA} based verification platform for HyperTransport 3.x}, booktitle = {19th International Conference on Field Programmable Logic and Applications, {FPL} 2009, August 31 - September 2, 2009, Prague, Czech Republic}, pages = {631--634}, publisher = {{IEEE}}, year = {2009}, url = {https://doi.org/10.1109/FPL.2009.5272393}, doi = {10.1109/FPL.2009.5272393}, timestamp = {Wed, 16 Oct 2019 14:14:53 +0200}, }
- Efficient Virtualization of High-Performance Network InterfacesThe Eighth International Conference on Networks, ICN 2009, 1-6 March 2009, Gosier, Guadeloupe, France, 434–439, IEEE Computer Society, 2009
@inproceedings{DBLP:conf/icn/FroningLB09, author = {Fr{\"{o}}ning, Holger and Litz, Heiner and Br{\"{u}}ning, Ulrich}, editor = {Bestak, Robert and George, Laurent and Zaborovsky, Vladimir S. and Dini, Cosmin}, title = {Efficient Virtualization of High-Performance Network Interfaces}, booktitle = {The Eighth International Conference on Networks, {ICN} 2009, 1-6 March 2009, Gosier, Guadeloupe, France}, pages = {434--439}, publisher = {{IEEE} Computer Society}, year = {2009}, url = {https://doi.org/10.1109/ICN.2009.23}, doi = {10.1109/ICN.2009.23}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
- An FPGA-Based Custom High Performance Interconnection NetworkReConFig’09: 2009 International Conference on Reconfigurable Computing and FPGAs, Cancun, Quintana Roo, Mexico, 9-11 December 2009, Proceedings, 113–118, IEEE Computer Society, 2009
@inproceedings{DBLP:conf/reconfig/NussleGFB09, author = {N{\"{u}}ssle, Mondrian and Geib, Benjamin and Fr{\"{o}}ning, Holger and Br{\"{u}}ning, Ulrich}, editor = {Prasanna, Viktor K. and Torres, Lionel and Cumplido, Ren{\'{e}}}, title = {An FPGA-Based Custom High Performance Interconnection Network}, booktitle = {ReConFig'09: 2009 International Conference on Reconfigurable Computing and FPGAs, Cancun, Quintana Roo, Mexico, 9-11 December 2009, Proceedings}, pages = {113--118}, publisher = {{IEEE} Computer Society}, year = {2009}, url = {https://doi.org/10.1109/ReConFig.2009.23}, doi = {10.1109/RECONFIG.2009.23}, timestamp = {Thu, 23 Mar 2023 00:00:00 +0100}, }
2008
- VELO: A Novel Communication Engine for Ultra-Low Latency Message Transfers2008 International Conference on Parallel Processing, ICPP 2008, September 8-12, 2008, Portland, Oregon, USA, 238–245, IEEE Computer Society, 2008
@inproceedings{DBLP:conf/icpp/LitzFNB08, author = {Litz, Heiner and Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian and Br{\"{u}}ning, Ulrich}, title = {{VELO:} {A} Novel Communication Engine for Ultra-Low Latency Message Transfers}, booktitle = {2008 International Conference on Parallel Processing, {ICPP} 2008, September 8-12, 2008, Portland, Oregon, {USA}}, pages = {238--245}, publisher = {{IEEE} Computer Society}, year = {2008}, url = {https://doi.org/10.1109/ICPP.2008.85}, doi = {10.1109/ICPP.2008.85}, timestamp = {Fri, 24 Mar 2023 00:00:00 +0100}, }
2005
- Performance Evaluation of the ATOLL InterconnectIASTED International Conference on Parallel and Distributed Computing and Networks, part of the 23rd Multi-Conference on Applied Informatics, Innsbruck, Austria, February 15-17, 2005, 129–134, IASTED/ACTA Press, 2005| bib
@inproceedings{DBLP:conf/pdcn/FroningNSHB05, author = {Fr{\"{o}}ning, Holger and N{\"{u}}ssle, Mondrian and Slogsnat, David and Haspel, Patrick R. and Br{\"{u}}ning, Ulrich}, editor = {Fahringer, Thomas and Hamza, M. H.}, title = {Performance Evaluation of the {ATOLL} Interconnect}, booktitle = {{IASTED} International Conference on Parallel and Distributed Computing and Networks, part of the 23rd Multi-Conference on Applied Informatics, Innsbruck, Austria, February 15-17, 2005}, pages = {129--134}, publisher = {{IASTED/ACTA} Press}, year = {2005}, timestamp = {Sat, 04 Aug 2018 01:00:00 +0200}, }
- Swordfish: A Simulator for High-Performance NetworksInternational Conference on Parallel and Distributed Computing Systems, PDCS 2005, November 14-16, 2005, Phoenix, AZ, USA, 530–535, IASTED/ACTA Press, 2005| bib
@inproceedings{DBLP:conf/pdcs/NussleFB05, author = {N{\"{u}}ssle, Mondrian and Fr{\"{o}}ning, Holger and Br{\"{u}}ning, Ulrich}, editor = {Zheng, S. Q.}, title = {Swordfish: {A} Simulator for High-Performance Networks}, booktitle = {International Conference on Parallel and Distributed Computing Systems, {PDCS} 2005, November 14-16, 2005, Phoenix, AZ, {USA}}, pages = {530--535}, publisher = {{IASTED/ACTA} Press}, year = {2005}, timestamp = {Wed, 09 Nov 2022 13:58:44 +0100}, }
2002
- ATOLL: Performance and Cost Optimization of a SAN InterconnectInternational Conference on Parallel and Distributed Computing Systems, PDCS 2002, November 4-6, 2002, Cambridge, USA, 496–501, IASTED/ACTA Press, 2002| bib
@inproceedings{DBLP:conf/pdcs/BruningFSR02, author = {Br{\"{u}}ning, Ulrich and Fr{\"{o}}ning, Holger and Schulz, Patrick R. and Rzymianowicz, Lars}, editor = {Akl, Selim G. and Gonzalez, Teofilo F.}, title = {{ATOLL:} Performance and Cost Optimization of a {SAN} Interconnect}, booktitle = {International Conference on Parallel and Distributed Computing Systems, {PDCS} 2002, November 4-6, 2002, Cambridge, {USA}}, pages = {496--501}, publisher = {{IASTED/ACTA} Press}, year = {2002}, timestamp = {Sat, 04 Aug 2018 01:00:00 +0200}, }
Theses
- Architectural improvements of interconnection network interfaces, 2007
@phdthesis{DBLP:phd/de/Froning2007, author = {Fr{\"{o}}ning, Holger}, title = {Architectural improvements of interconnection network interfaces}, school = {University of Mannheim, Germany}, year = {2007}, url = {https://nbn-resolving.org/urn:nbn:de:bsz:180-madoc-14307}, urn = {urn:nbn:de:bsz:180-madoc-14307}, timestamp = {Sat, 17 Jul 2021 01:00:00 +0200}, }