Interactive Exploratory Graph-Enabled Data Analytics at High-Performance Computing Scales

Roger Pearce | 21-ERD-020

Executive Summary

We will develop a next-generation high-performance computing data analytics system to enable interactive hybrid graph and data analytics. If successful, this system will be able to analyze research problems relevant to national missions such as space security and cybersecurity at a scale much larger than the current state of the art.

Publications, Presentations, and Patents

Pearce R., G. Sanders. "Persistent Memory as the Substrate for HPC-Scale Graph Analytics." 2022. The Next Wave. 2022;23(2):33-39. ISSN 2640-1789 [online], 2640-1797 [print]. Available at: www.nsa.gov/thenextwave.

Iwabuchi, Keita, Karim Youssef, Kaushik Velusamy, Maya Gokhale, and Roger Pearce. 2022. "Metall: A Persistent Memory Allocator for Data-Centric Analytics." Parallel Computing 111 2022: 102905.

Reza, T., G. Sanders and R. Pearce,"Towards Distributed 2-Approximation Steiner Minimal Trees in Billion-Edge Graphs." 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS). Lyon, France. May 2022. pp. 549-559, doi: 10.1109/IPDPS53621.2022.00060. 2022.

Pirkelbauer, Peter, Seth Bromberger, Keita Iwabuchi, and Roger Pearce. "Towards Scalable Data Processing in Python with CLIPPy." 2021 IEEE/ACM 11th Workshop on Irregular Applications: Architectures and Algorithms (IA3), Dallas, TX, pp. 43-52. IEEE. Nov. 2021.

Steil, Trevor, Tahsin Reza, Keita Iwabuchi, Benjamin W. Priest, Geoffrey Sanders, and Roger Pearce. "TriPoll: Computing Surveys of Triangles in Massive-Scale Temporal Graphs with Metadata." SC '21: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, Dallas, TX. November 2021.

Steil, Trevor, Geoffrey Sanders, and Roger Pearce. "Towards Distributed Square Counting in Large Graphs." 2021 IEEE High Performance Extreme Computing Conference (HPEC). September 2021.

Reza, Tahsin, Hassan Halawa, Matei Ripeanu, Geoffrey Sanders, and Roger A. Pearce. 2021. “Scalable Pattern Matching in Metadata Graphs via Constraint Checking.” ACM Transactions on Parallel Computing 8, no. 1 January 4, 2021: 2:1-2:45. https://doi.org/10.1145/3434391.

Youssef, Karim, Keita Iwabuchi, Wu-Chun Feng, and Roger Pearce. “Privateer: Multi-versioned Memory-Mapped Data Stores for High-Performance Data Science.” IEEE High Performance Extreme Computing Conference (HPEC). Sept. 2021.