Meeting Notes 07-26-2021

Meeting Notes from 07-26-2021

Minutes of Meeting July 26, 2021

Shantenu led a discussion of surrogates noting his work was delayed by a loss of a postdoc. Shantenu divided Surrogates into 3 areas

Shantenu presented PY2 and PY3 plans

In PY2 primary goals are:

  • (mini-)Review of surrogates in HPC – Volunteers? See later
  • Formalizing Performance measures (MLinHPC)
    • Three scenarios discussed above: Climate, Docking, Potentials
  • Experimenting with Performance (MLoutHPC)
    • Use DeepDriveMD to support different surrogates (Table 1) for common physical model (system)

In PY3

  • tackle (more) complex problem of MLoutHPC

AlphaFold2 (Google) and RoseTTaFold (Baker at Washington) DeepMind’s AI for protein structure is coming to the masses news BOTH released

CASP said protein folding solved from AlphaFold2 but RosettaFold is cheaper and as good as AlphaFold2. This could be an opportunity

Beckman noted we see a science transformation using FAIR Methodology.

Rick Stevens has challenged “How much did Go AI cost”

Dataset size is a serious issue.

  • deepmind/alphafold: Open source code for AlphaFold. notes The total download size for the full databases is around 415 GB and the total size when unzipped is 2.2 TB. Please make sure you have a large enough hard drive space, bandwidth and time to download. We recommend using an SSD for better genetic search performance.
  • Hurricane simulation will become inference
  • Doe strategy train leave data where it is similar to medical federated learning
  • Vikram noted that material science led to smaller datasets as just output final results and not the full trajectory

We discussed having a session at The Argonne Training Program on Extreme-Scale Computing (ATPESC) in 2022

Next month we will consider Implications for the project. Vikram and Shantenu volunteered

Last modified January 26, 2024: add notes (fa4a2ea)