The flare's discuss from mir-group

small bug in update_db method of gp

#27 snuck a call to set_L_alpha() into the update_db method of gp. This leads to inefficiencies when constructing large gp models based on hundreds of structures, since it requires the covariance matrix to be updated from scratch every time a new structure is added to the training set. Eliminating the call to set_L_alpha resolves the issue.

Docstrings are missing from many files

Docstrings annotating arguments and outputs to various methods would improve user readability.

otf parser tests (test_parse_otf.py) is based on a fixed output file

To test that it works for the current version of otf.py (and its output conventions), the test should be based on a current output file, e.g. the ones generated by test_OTF.py.

Z_to_element method

Would be a three-liner in util.py to have a rough version:
_Z_to_element = {z: elt for elt, z in _element_to_Z.items()}

def Z_to_element(Z): return _Z_to_element[Z]

Would be useful for mapping 'coded species' integers to species names as @nw13slx mentioned in the comments for issue 28. I'll add this later once I learn how to use branches (or somebody else is welcome to add it, no need to wait for me :) )

Flare io.py

Write flare.io module, which is an input/output module specifically for flare data structures. Should take md_trajectory_to/from_file from vasp_util.py and put it here.

"mff" should no longer be used, the name is gonna change

to "mgp" (Mapped Gaussian Process) (credit to Jon)

Continuous Integration

Travis or CircleCI would be useful to look into for continuous integration, so we can eliminate friction with our pull requests and development.

Serialization and representation (e.g. str) methods for various objects

Methods for serializing certain objects which are passed between models (i.e. atomic environments, structures, etc), or even models themselves, would be useful. The advantage of this over pickled objects is that they can be more human-readable (and I understand that pickled objects have some security risks associated with them).

One example application is that JSON objects are easily storable in certain database architectures. This might be relevant for e.g. FOOGA in the near future if we want to automate the process of training GP models for different datasets, as this would let us store them more easily.

In my development branch I've done this for the AtomicEnvironment object. There are ways we could standardize this or easily implement it across our codebase (e.g. by using Monty, which has an object type which allows for effortless JSON serialization of different Python objects).

update_L_alpha should be parallelized

set_L_alpha gives the option to compute the covariance matrix in parallel. It would be helpful to have the same option for update_L_alpha, especially for large GPs

CPU overload when parallelization is on for both gp and prediction

The training process can overload compute nodes when both gp and predict are parallelized.
This should not happen because the paralleled functions are never called in the same step.
But it could be because we use multithread and concurrence at the same time.

mc kernel issue for test_mff.py

@YuuuuXie , I think that the import statement needs to be reformatted in the test_mff.py file, as it references the mc_kernels module which has since moved / changed:

It results in an error seen above.

Unit tests should skip instead of fail for users without QE

As is, the tests which involve calling Espresso fail if the PWSCF command is not found.
It would be nice if the PWSCF command was not detected for the unit tests for calling QE to not fail, but simply be skipped. This may be more informative to the user, as there are reasons why it could fail in attempting to call QE which would require debugging.
We could also print a message encouraging the user to fix their environment variable.

Tutorials for OTF, MFF, and MFF pairstyle

Before officially releasing the first version of the code, it would be great to include detailed tutorials in the documentation explaining how to use key features (especially otf, mff, and the mff pairstyle in LAMMPS).

Third unit test in test_gp_from_aimd takes forever and sometimes fails

@stevetorr could you take a look?

Here's a screenshot of the error:

print likelihood at the end of each training step

currently it stops printing the likelihood after the hyper-parameter training phase

Pymatgen to/from methods for Flare Structures

Would improve flexibility for the user, as Pymatgen structures have a lot of terrific methods and enjoy a wide user base.

VASP (potentially more) Parser Utility Functions

One feature that would help to accelerate the pipeline of GP from AIMD workflows would be helper functions which parse DFT outputs (like VASP) and turn them into a file of serialized structures decorated with force information. A second helper function could generate atomic environments from a .json file. The use of functions like pymatgen for parsing would be extremely welcome here, as they have very high quality and externally maintained parsers for VASP. These wrappers would be simple to implement and useful.

Relevant to #19.

Kernel Benchmarks

Before trying to make any kernel optimizations, I thought it would be good to make a suite of benchmarks so we can easily and consistently measure any performance boosts.

I am thinking a two phase benchmark would work well.

Setup script with constructs kernel inputs (atomic environments and the required parameters) and write them to disk in a language agnostic form. This data can then be used by multiple implementations of the kernel.
Python/numba implementation test which reads in the data and times the computation of the kernels

Multiple versions of `like` and `likelihood` variables floating around in gp.py

There is some redundancy in certain methods setting the like or likelihood variable in different places.

update_L_alpha and update_L_alpha_v1 in gp.py need unit tests

The tests should check that the resulting matrices agree with set_L_alpha.

Predict methods in OTF should be moved outside of class

Hat tip to Lixin and Yu who discovered the following problem (1) after laborious debugging:

1.Apparently parallelization for python fails when multiple processes are operating on the same instance from the same class object.

This also helps to keep the OTF class itself smaller and focused, while freeing up the predict functions to be used for other purposes.

For instance, the module I'm developing of gp_from_aimd has great cause to use the predict functions, and to avoid duplicating code, having them be in a different file allows them to be called without an OTF instance. I currently have implemented this in my development branch, for reviving gp_from_aimd. Lixin has done the same in hers, so one of us will push it eventually.

ASE virial stress & test otf_parser for its output

calculate stress based on atomic forces
test otf_parser to parser the output file from ase-otf

OTF restart module/method

Occasionally OTF could be interrupted (either by bug, or the user wants to interrupt the training and change the condition) and needs to restart from middle. It will be good to have a restart module, or a method inside otf module.
I have a script to do this while I was training my stanene system. If you guys have written any better wrapped module, that will be great.

flexible interface for DFT call

We need to reform the OTF class to interface with

QE
CP2K
ASE
Pymatgen (VASP)

Allow more flexibility for kernel hyperparameters

option to freeze certain hyperparameters
different hyperparameters for different species

Job hangs due to memory issue?

Flare code can hang in the SLURM job on Odyssey. It could be related to the memory setup. Specifying the memory for SLURM and set the memory limit up can help.
`#SBATCH --mem-per-cpu=6000

ulimit -s unlimited`
But we should look at memory profiling for the code at some point

output file shouldn't be repeatedly open and close

The output file in output.py should not be repeatedly open and close. And we also need to allow multiple output files

VASP IO interface sample file

Relates to issue #19 ; generate a file demonstrating how to set up / run / parse VASP files.

Setup.py file would help new users

A setup.py file contains lots of useful info-- among other things, a list of all python packages which are required. Running this can install everything that is needed and so would be handy for new users.

One such guide to doing so is here:
https://the-hitchhikers-guide-to-packaging.readthedocs.io/en/latest/quickstart.html

We can include this on the wishlist for V 1.0.

ASE interface unit test

open a new branch, tasks:

calculator
otf - md
test interface of different dft calculators

VASP parsers and utility

Would be helpful to (short term) provide built-in methods to parse VASP files for model training, and (long term) support VASP interface for OTF runs

serial QE call in run_espresso is incompatible with impi

IMPI does not take "mpirun exec <input" as OpenMPI does.

It should be "mpirun -np 1 exec < input"

output file as formated as possible

We should maks as much output files as possible in simple column format, which can be easily read by numpy.loadtxt

@YuuuuXie @jonpvandermause
Could you please get a list of output that can be formatted? like hyper parameter, mae/likelihood each step, ...

put the lammps patch in this repo

eom

Transfer training result to LAMMPS coefficient file

need a module to

use otf_parser.py to get trained GP
use mff to build mapping
save coefficients from mff
write coefficients in the format read by LAMMPS pair_style

adding npool option to QE otf jobs

It would be helpful to have the option to parallelize efficiently over multiple compute nodes (i.e. with the -npool flag) for large, expensive otf simulations. Should be an easy fix -- just have to give the option in otf.py to use "run_dft_npool" instead of "run_dft_par".

Missing positional argument in md_run module

In line 50 of md_run.py, the function call to output.write_header is missing the positional argument std_tolerance.

Hyperparameter output to a separate file

Would help to make managing the results of runs easier.

Previous positions are weird; velocity for structure would be useful to have

"Force Source" or "Train Source" class that OTF and TrajectoryTrainer inherit from

This will require some design choice and discussion, but, this would help make our abstraction for the OTF and Trajectory Trainer make more sense, and share methods which make sense (i.e. prediction or std in bound methods).

Underway: Developing new GP from AIMD toolkit

I'm making this for my own development process

np Array typehints in Sphinx

np Arrays are behaving strangely in the Sphinx documentation when used as typehints; this has something to do with their 'mock import' in the configuration. I will look into this and see if this can be fixed so that they render correctly in Sphinx.

To / From methods for Structure object & ASE Atoms

@YuuuuXie may have already done this, but

A static method for turning ASE Atoms objects intro FLARE Structures would help when generating structures using ASE,
A method for turning FLARE Structures into ASE Atoms objects would also be nice.

Both of these would enhance user pre-processing flexibility.

is_std_in_bound() method from OTF could be moved to a new file

Related to issue 14: #14

Because the is_std_in_bound() function is a convenience method for OTF or (the in development) TrajectoryTrainer, it would eliminate redundancy to have it exist outside of the OTF class. If we end up splitting predict functions to a separate file, I think that this function would be a natural fit for there (given that it is to diagnose the result of a prediction on a structure).

naming convention for new branches

This idea is courtesy of @dmclark17. To keep branches organized and easy to navigate, I propose we adhere to the following convention for naming branches:

type of change/branch owner/description of change

Some examples:
bug/jon/gp-hotfix
feature/jon/cool-new-kernel
docs/jon/env-docstrings
etc.

mir-group / flare Goto Github PK

flare's Issues

Recommend Projects

Recommend Topics

Recommend Org