This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

Software

Some software that we developed

A list of software we use to make things easiers

1 - cloudmesh

cloudmesh is a flexible framework to develop cloud and HPC programs using python. It is based on a number of plugins.

Overview

Cloudmesh allows the creation of an extensible commandline and commandshell tool based internally on a number of python APIs that can be loaded conveniently through plugins.

Plugins useful for this effort include

  • cloudmesh-vpn1 – a convenient way to configure VPN
  • cloudmesh-common2 – useful common libraries including a StopWatch for benchmarking
  • cloudmesh-cmd53 – a plugin manager that allows plugins to be integrated as commandline tool or command shell
  • cloudmesh-ee4 – A pluging to create AI grid searchs using LSF and SLURM jobs
  • cloudmesh-cc5 – A plugin to allow benchmarks to be run in coordination on heterogeneous compute resources and multiple clusters
  • cloudmesh-apptainer6 – mangae apptainers via a Python API

Cloudmesh has over 100 plugins coordinated at http://github.com/cloudmesh

7

References

2 - sabath

SABATH provides benchmarking infrastructure for evaluating scientific ML/AI models. It contains support for scientific machine learning surrogates from external repositories such as SciML-Bench.

Introduction

SABATH provides benchmarking infrastructure for evaluating scientific ML/AI models. It contains support for scientific machine learning surrogates from external repositories such as SciML-Bench.

The software dependences are explicitly exposed in the surrogate model definition, which allows the use of advanced optimization, communication, and hardware features. For example, distributed, multi-GPU training may be enabled with Horovod. Surrogate models may be implemented using TensorFlow, PyTorch, or MXNET frameworks.

Models

Models are collected so far at

1

References