A list of software we use to make things easiers
This is the multi-page printable view of this section. Click here to print.
Software
1 - cloudmesh
Overview
Cloudmesh allows the creation of an extensible commandline and commandshell tool based internally on a number of python APIs that can be loaded conveniently through plugins.
Plugins useful for this effort include
- cloudmesh-vpn1 – a convenient way to configure VPN
- cloudmesh-common2 – useful common libraries including a StopWatch for benchmarking
- cloudmesh-cmd53 – a plugin manager that allows plugins to be integrated as commandline tool or command shell
- cloudmesh-ee4 – A pluging to create AI grid searchs using LSF and SLURM jobs
- cloudmesh-cc5 – A plugin to allow benchmarks to be run in coordination on heterogeneous compute resources and multiple clusters
- cloudmesh-apptainer6 – mangae apptainers via a Python API
Cloudmesh has over 100 plugins coordinated at http://github.com/cloudmesh
References
Gregor von Laszewski, J. P. Fleischer, and Geoffrey C. Fox. 2022. Hybrid Reusable Computational Analytics Workflow Management with Cloudmesh. https://doi.org/10.48550/ARXIV.2210.16941 ↩︎
2 - sabath
Introduction
SABATH provides benchmarking infrastructure for evaluating scientific ML/AI models. It contains support for scientific machine learning surrogates from external repositories such as SciML-Bench.
The software dependences are explicitly exposed in the surrogate model definition, which allows the use of advanced optimization, communication, and hardware features. For example, distributed, multi-GPU training may be enabled with Horovod. Surrogate models may be implemented using TensorFlow, PyTorch, or MXNET frameworks.
Models
Models are collected so far at