Reliable Decentralized Solvers in Unreliable Computing Environments

Alyson Fox | 22-ERD-045

Executive Summary

We will develop resilient, asynchronous, decentralized algorithms for linear system and eigenvalue problems by generalizing previously-inapplicable resilience-enhancement techniques to enrich conjugate gradient-based numerical solvers. By enabling decentralized systems to continue functioning despite hardware malfunctions and cyber intrusions, we can enhance our nation's ability to create scalable algorithms for exascale computing, as well as reliable infrastructure and communication systems.

Publications, Presentations, and Patents

C. Vogl, Z. Atkins, A. Miedlar, C. Ponce, A. Fox. 2022. "Asynchronous Jacobi Methods with Resilience to Data Corruption and Robustness to Data Delay." Abstract.  Copper Mountian Conference on Iterative Methods. LLNL-ABS-831426. April 2022.

Z. Atkins , C. Vogl, A. Miedlar, C. Ponce, A. Fox. 2022. "The Algorithmic Development of a Fully Asynchronous Conjugate Gradient Method." Abstract. Copper Mountian Conference on Iterative Methods. LLNL-ABS-831528 April 2022 

A. Miedlar, A. Fox, C. Vogl, A. Atkins, C. Ponce. 2022. "Towards Resilient and Robust Asynchronous Linear Systems Solvers for Edge Computing." Abstract. A journey in Numerical Linear Algebra: a Workshop in Honor of Michele Benzi’s 60th Birthday. LLNL-ABS-833997

C. Ponce, Z. Atkins, A. Fox, C. Vogl. 2022. "Resilient Algorithmic Building Blocks for Decentralized Iterative Computing in the Skynet Software." Abstract. SIAM Annual Meeting 2022.

A. Fox, C. Vogl, C Ponce, Z. Atkins, C. Gillette, A. Miedlar. 2022. "Reliable Decentralized Solvers in Unreliable Computing Environments." Abstract. PNNL Granville Seminar. LLNL-ABS-837309

C. Gillette, A. Fox. 2022. "Towards a Dominant SVD Computation via ADMM-like Decentralized Consensus Optimization with Projection Splitting." Abstract. GEM Annual Board Meeting & Conference. Philadelphia, PA. Sept. 14-16, 2023. LLNL-ABS-839494

C. Gillette, A. Fox. 2023. “Towards a Dominant SVD Computation via ADMM-like Decentralized Consensus Optimization with Projection Splitting.” Abstract. SIAM CSE Conference 2023. Amsterdam, Netherlands, Feb 26-Mar 3, 2023. LLNL-ABS-840146

A. Fox, C. Vogl, C. Ponce, L. Erlandson, A. Miedlar, Z. Atkins, C. Gillette. 2023. “Creating Reliable Solvers in Unreliable Edge Computing Environments.” Abstract. VT Department of Mathematics Colloquium. LLNL-ABS-841487

L. Erlandson, A. Fox, A. Miedlar, C. Ponce, C. Vogl. 2023. “Handling Corruption in S-Approximate Conjugate Directions.” Abstract. SIAM CSE Conference 2023. Amsterdam, Netherlands, Feb 26-Mar 3, 2023. LLNL-ABS-84024

L. Erlandson, A. Miedlar. 2023. “New Algorithmic Developments for Heterogeneous Computing Environments.” Abstract. SIAM CSE Conference 2023, Amsterdam, Netherlands, Feb 26-Mar 3, 2023.  LLNL-ABS-839348

L. Erlandson, A. Fox, A. Miedlar, C. Ponce, C. Vogl, “Resilient s-ACD for Collaborative Decentralized Linear Solves” (Poster Presentation, LLNL Postdoc Poster Symposium., Livermore, CA, 2023). LLNL-POST-848121

A. Fox, C. Ponce, C. Vogl, L. Erlandson., A. Miedlar,“Collaborative Autonomy: Reliable Decentralized Solvers” (Poster Presentation, Computing External Review, Livermore, CA, 2023).  LLNL-POST-846619

C. Vogl, Z. Atkins, A. Miedlar, C. Ponce, A. Fox, "Asynchronous Jacobi Methods with Resilience to Data Corruption and Robustness to Data Delay" (Presentation, Copper Mountian Conference on Iterative Methods, Copper Mountain, CO, 2022). LLNL-PRES-833543.

Z. Atkins , C. Vogl, A. Miedlar, C. Ponce, A. Fox, "The Algorithmic Development of a Fully Asynchronous Conjugate Gradient Method" (Presentation, Copper Mountian Conference on Iterative Methods, Copper Mountain, CO, 2022). LLNL-PRES-833329

A. Miedlar, A. Fox, C. Vogl, A. Atkins, C. Ponce, "Towards Resilient and Robust Asynchronous Linear Systems Solvers for Edge Computing" (Presentation, A journey in Numerical Linear Algebra: a Workshop in Honor of Michele Benzi’s 60th Birthday, Livermore, CA, 2022). LLNL-PRES-836025

C. Ponce, Z. Atkins, A. Fox, C. Vogl, "Resilient Algorithmic Building Blocks for Decentralized Iterative Computing in the Skynet Software" (Presentation, SIAM Annual Meeting 2022, July 22, 2022, Pittsburgh, PA, 2022). LLNL-PRES-833575

A. Fox, C. Vogl, C. Ponce, A. Miedlar, "Reliable Decentralized Solvers in Unreliable Computing Environments" (Presentation, DoD Board of Governers Meeting., Livermore, CA, 2022). LLNL-PRES-834159

A. Fox, C. Vogl, C Ponce, Z. Atkins, C. Gillette, A. Miedlar, "Reliable Decentralized Solvers in Unreliable Computing Environments" (Presentation, PNNL Granville Seminar, 2022). LLNL-PRES-838417

A. Fox, C. Vogl, C Ponce, Z. Atkins, C. Gillette, A. Miedlar, "Reliable Decentralized Solvers in Unreliable Computing Environments" (Presentation, CASIS LLNL Workshop, 2022). LNL-PRES-839495

C. Gillette, A. Fox, "Towards a Dominant SVD Computation via ADMM-like Decentralized Consensus Optimization with Projection Splitting" (Presentation, GEM Annual Board Meeting & Conference, 2022). LLNL-PRES-839340

C. Gillette, A. Fox, “Towards a Dominant SVD Computation via ADMM-like Decentralized Consensus Optimization with Projection Splitting” (Presentation, SIAM CSE Conference 2023). LLNL-PRES-845651

L. Erlandson, A. Fox, A. Miedlar, C. Ponce, C. Vogl, “Handling Corruption in S-Approximate Conjugate Directions” (Presentation, SIAM CSE Conference,  2023). LLNL-PRES-845085

A. Fox, L. Erlandson, C. Vogl, C. Ponce, A. Miedlar, Z. Atkins, “Reliable Decentralized Solvers in Unreliable Computing Environments: LDRD mid-project review” (Presentation, LDRD Mid-project review, 2023).  LLNL-PRES-848549

A. Fox, C. Vogl, C.Ponce, L. Erlandson, A. Miedlar, Z. Atkins, C. Gillette, “Creating Reliable Solvers in Unreliable Edge Computing Environments” (Presentation, VT Department of Mathematics Colloquium, 2023). LLNL-PRES-841718

L. Erlandson, A. Fox, A. Miedlar, C. Ponce, C. Vogl, “Resilient s-ACD for Asynchronous Collaborative Solutions of Systems of Linear Equations” (Presentation,18th Conference on Computer Science and Intelligence Systems FedCSIS 2023Sept. 17-23, 2023, Warsaw, Poland, 2023). LLNL-PRES-854113

A. Fox, C. Vogl, C. Ponce, A. Miedlar, JP. Watson, S. Chapin, "Algorithmic Development for Unreliable Computing Environments" (Presentation, ASCR Workshop on Cybersecurity and Privacy for Scientific Computing Ecosystems, Livermore, CA, 2022). LLNL-MI-827865

L. Erlandson, Z. Atkins, A. Fox, C.J. Vogl. A. Miedlar, C. Ponce, “Resilient s-ACD for Asynchronous Collaborative Solutions of Systems of Linear Equations” (Presentation, 18th Conference on Computer Science and Intelligence Systems FedCSIS 2023, Sept. 17-23, 2023, Warsaw, Poland, 2023). Accepted. LLNL-CONF-849356