maginngroup / cassandra Goto Github PK

View Code? Open in Web Editor NEW

39.0 39.0 20.0 66.74 MB

Cassandra is a Monte Carlo package to conduct atomistic simulations.

Home Page: https://cassandra.nd.edu/

License: GNU General Public License v3.0

Python 14.07% Makefile 0.30% Fortran 75.18% Shell 0.03% C 9.64% CMake 0.78%

atomistic-simulations molecular-simulation monte-carlo-simulation

cassandra's People

Contributors

Stargazers

Watchers

Forkers

mattwthompson justingilmer rwsmith7531 rmatsum836 ryangmullen rsdefever emarinri ejmaginn daniel1991zy zhu-liu etomica kaihangshi hmcezar jeff-wang bradendkelly masrul nathanbsouza colestrickling jpbergsma

cassandra's Issues

Complete documentation conversion to RTD

Checklist

Add docs for force-shifted LJ
Add docs for Widom insertions
Fix author list on RTD-compiled PDF
Add docs for restricted insertions
Confirm equations in theory section
Confirm code blocks in theory section
Confirm that all links work
Confirm that all equation references are correct
Remove "warning" box and link to PDF docs

Check off the items as they are completed. Feel free to add to the checklist if anything appears to be missing!

SPC/E water NPT example README is incorrect

Current behavior

The README in Cassandra/Examples/NPT/water_spce is incorrect. It claims that the simulation is of 600 molecules, but it only has 90 molecules. It also refers to a file, npt.inp.chk, which does not exist, and it claims to start from that checkpoint file despite actually starting from npt.inp.xyz. The README may have been made for a different simulation from that provided in the examples, and may have other inaccuracies not listed here.

Slow cloning of Cassandra

Compress large fragment files in the Cassandra distribution

Create 'How to contribute' documentation

Make config structures unreasonable with low Rcutoff_Low

Current behavior

The make_config option can generate physically unreasonable starting structures if the Rcutoff_Low selection is small. See image below for an example.

Fix factor of 100 in the test suite results

Mixing_Rule custom option usability

@jshahOSU suggested that we might want to modify the ‘custom’ option for the Mixing_Rule section. Currently, the user has to specify all the cross interactions. We could provide a feature that only the parameters for the cross interaction of interest need to be specified. For the other cross interactions, the user-specified or default combining rules will apply.

Create tests for mcfgen.py

Is your feature request related to a problem? Please describe.

The mcfgen script is not currently being automatically tested in any form.

Describe the solution you'd like

It'd be good to include unit tests for several PDB files with species of different topologies (i.e. with rings, united atom, all atom, zeolites). The resulting MCF in these tests should be compared to accepted MCF.

Energy calculation in Cassandra

Dear Developers,

I'm a Ph.D. student working on sorption in biopolymers. I've been doing molecular dynamics modeling on sorption in polymeric systems for past years (mainly with LAMMPS) and recently I've decided to migrate to Cassandra. first of all, I wanted to thank you for your clear user guide and examples which accelerated me in preparing my modeling tool. there are some questions regarding energy calculations in Cassandra that I find unclear and would be very thankful for If I can have your comments on it.

I start with my most important question:

1- I perform a MD + GCMC loop. first I add SPCE water with a specific chemical potential to my system with GCMC in Cassandra. afterward using a bash code I prepare the final product for GROMACS NPT. I repeat this loop to reach equilibrium. Since water molecule atoms move and vibrate during my MD section even when I use LINCS or RIGID algorithms to constrain the bonds. after some repetition, Cassandra finds the mismatch between mcf file and bond lengths in xyz and stops with an error. to solve this problem I can delete the bond and angle information from water spce. However, this results in previously bonded atoms in the same molecule to interact with lj and thus giving me very high energy and also problems with gcmc. To me, I can solve the problem by using intra-scaliing 0 0 0 0 instead of 0 0 0 1 to stop atoms interacting in the same molecule. (is this correct?) does this option neglect interaction within the same water molecule or all the interaction between different water molecules, too?

2-since all the above mentioned problems can happens in the solid structure of my polymer too can I use the same approach for my cellulose solid structure too?

3- strangely I found out that Cassandra gives me different energy values for wrap or unwrapped input files. If this is expected. should I always use unwrapped configurations for my simulations?

Again thanks for all your time and just wanted to add Cassandra is a very helpful and clear package and I'm happy to join your community

Number of k-vectors is not consistent between orthogonal and cubic boxes

Expected behavior

For a simulation with electrostatic interactions, the number of k-vectors in by a cubic box whose box length is X should be the same as those required by an orthogonal box whose box lenghts are all the same and equal to X.

Current behavior

The number of k-vectors is not the same between cubic and orthogonal boxes of same dimensions for the same system.

Steps to reproduce

Steps to reproduce the behavior:

Use the same exact system with the same box dimensions and the same cutoff
Run 1 simulation where the box is defined as cubic
Run 1 simulation where the box is defined as orthogonal
Check the number of k-space vectors for the two simulations

Possible solution (optional)

Additional context

Further information, files or links (optional)

Any additional information here, attach relevant text or image files and URLs to external sites, publications , etc

Cassandra does not write bond tolerances to MCF file

Current behavior

The MCF files written by Cassandra during the fragment library generation do not contain the bond length tolerances that are (optionally) specified in the MCF file. Therefore, fragment library generation can fail for ring structures even if the PDB bond lengths are within the specified tolerance.

Expected behavior

Cassandra should write the bond length tolerances to the fragment MCF files so that it can generate the fragment libraries without error in such cases.

Transform run_examples.py from Python 2 to Python 3

Is your feature request related to a problem? Please describe.

With Python 2 is no longer being supported/updated, can the run_examples.py be converted to Python 3?

Describe the solution you'd like

A conversion of run_examples.py to Python 3

Cloning repo is slow

When cloning, at least without any special flags:

$ git clone https://github.com/MaginnGroup/Cassandra.git

we get the entire git history in the .git hidden directory, which happens to be quite large. This may be due to some binaries being tracked in the history, which can balloon the size quickly. Here is a snapshot of my clone:

I will tag GOMC-WSU/GOMC#154 as the source of my deja vu (strangely enough, the histories are nearly the same size) and @justinGilmer, who noted this in GOMC, and @YounesN, who is probably responsible for fixing it there.

Improve solids setup using mcfgen.py

Is your feature request related to a problem? Please describe.

When simulating rigid solids, a PDB of the solid framework is required to setup a simulation. If CONECT records are found in this file, the script mcfgen.py generates an MCF with connectivity information, including bonds. This might be problematic when running a simulation for rigid solids, as the initial configuration is typically input as an XYZ file and the bonds of the framework atoms might not be equal to the nominal bond length. As a consequence, the function Compute_Molecule_Bond_Energy raises an error when it checks that the XYZ file and MCF bond lengths are consistent within some tolerance.

The documentation specifies that the PDB file for the rigid solid does not list CONECT information, so the mcfgen.py script will not include bond, angle, or dihedral sections in the force field template.

Describe the solution you'd like

Perhaps adding a "--solid" flag to the command line interface of mcfgen to ignore the CONECT section of the PDB file if present. The generated ff template should only include intermolecular interactions and the final MCF should only have non-bonded parameters.

Describe alternatives you've considered

None

Additional context

None

Further information, files or links (optional)

A minimal example of faujasite is provided. Included is a PDB with CONECT keywords. If the mcfgen.py script executed, the connectivity information will need to be input. Also the ring identification functions are executed.

faujasite.zip

Addressing incorrect bond lengths in ring fragments

Ring fragments with incorrect bond lengths in the starting structure (i.e., the physical bond length does not equal the length specified in the MCF file) are problematic. The following commentary would also apply to fixed angles. In V1.2, the (fragment generation, and end-use) simulations run without warning the user that the bond length is different than the specified bond length. In the version on the develop branch, the either simulation immediately exits with an error (broken bond).

I think it would be a useful feature if (either in Cassandra, or library_setup.py), we could "fix" incorrect bond lengths in rings. I'm open to suggestions regarding the best way to accomplish this end. I'm naively envisioning a solution whereby the coordinates are updated such that the provided configuration is updated to the closest configuration lying on the constraint surface. If integrated directly into the Cassandra executable, this could also "fix" initial configurations that have incorrect bond lengths.

Add unit testing

We now have integrated the test suite into our development pipeline, which is a great first step. These are effectively integration tests. As the code continues to become more complicated we should also add unit tests. This will allow us to test the behavior of specific functions and increase our confidence that new code changes do not break existing behavior.

I need to do more research, but one option is pFUnit. An example of another scientific code that uses pFUnit is here. Look at the .pf files to get an idea of how it is implemented.

Please reply with other options below!

Add version info to auxillary scripts

Describe the solution you'd like

The auxillary scripts should also contain version info that is consistent with the main code releases to help with bug/issue tracking. bump2version can be used to manage the versioning.

Add counters for core overlap in Widom insertions.

Count and report the number of Widom insertions with core overlap for each test particle species/box combination. Output the totals to the log file underneath the chemical potentials.

Implement GSD output file for Cassandra

Is your feature request related to a problem? Please describe.

Cassandra currently saves the trajectory output to an xyz file. Though this is a general format, it contains minimal information about the trajectory, and contains no box information.

Describe the solution you'd like

An alternative output format is the general simulation data object or GSD. This is the native output format for HOOMD-blue. Particles and topologies can vary from one frame to the next, it has a Python API, can be read by tools like freud and OVITO, and is a binary format that supports random frame access.

There is a C API, so I think with fortran-C interoperability we could use the format without too much difficulty.

I'd like to get some input from the other developers on this idea.

MCF generated with mcfgen has doubled fragment connection.

Expected behavior

If fragments 1 and 2 are connected, only one connection should appear in the MCF file.

Current behavior

The connection is doubled.

# Fragment_Connectivity
2
1    1    2
2    2    1

Steps to reproduce

Steps to reproduce the behavior:

Run mcfgen.py C4H10.pdb in the Methane_Butane example.

Additional context

The old Python 2 version of mcfgen.py distributed with V1.2 had the correct behavior:

$ diff C4H10.mcf v1.2_C4H10.mcf 
68c68
< 2
---
> 1
70d69
< 2    2    1

Improvement on custom mixing rules

For the custom mixing rule, it would be good to allow for Aij and Bij coefficients, in addition to sigma and epsilon. It will enable purely repulsive, purely attractive, or some balance of the two. I think, then, we should be able to model WCA potential as well.

Move fragment library setup into Cassandra executable

I am curious to hear opinions for/against removing library_setup.py and moving this functionality entirely into the Cassandra executable.

Under this scenario, fragment libraries would be generated before the first step of any MC simulation. The user would be 'blind' to the fact that the fragment libraries are being generated; they would be discarded at the end of a given simulation. In my view, the benefits of this approach are that the user only has to run a single executable to perform a simulation. In my experience, the computational cost of fragment library generation is inconsequential and would thus not notably impact execution time under most use-cases.

I imagine providing options to save the fragment libraries to disk, and read fragment libraries from disk. This would preserve the current workflow for users who were interested and/or if there was some system for which fragment library generation was costly.

I'm interested in hearing opinions from more experienced developers/users as this would represent a substantial overhaul to the code.

library setup fails with space in file path

Current behavior

If library_setup.py is executed from a directory with a space in the path to the current location, it fails and exits with an error. Example where working directory is: /root/simulations/test sim

rm: cannot remove '/root/simulations/test': No such file or directory
rm: cannot remove 'sim/nvt.inp': No such file or directory
mv: target 'sim/nvt.inp' is not a directory

Expected behavior

library_setup.py should work correctly regardless of if there is a space in the directory path.

Limits to integer size - long solubility runs

I have been running a simulation for ~2 billion steps without issue. When I restart to run for beyond 2 billion steps, the simulation says it is complete. This may be an issue with the size of numbers which the program can store.

Attached are the input files needed to check this problem. In R32_bmimpf6_fail.zip the input file results in the issue discussed above. In R32_bmimpf6_success.zip the input file is only set to run for a short amount of time and is successful at restarting.
R32_bmimpf6_fail.zip
R32_bmimpf6_success.zip

MCFGen exits with keyError when the molecule has no fragments (pdb without CONECT)

Expected behavior

Running mcfgen.py with a .pdb without CONECT (for example, a rigid framework of a zeolite) should work.

Current behavior

When running mcfgen.py with the option -t and the pdb without CONECT I get:

[hmcezar@bitz-desktop silicalite_ch4]$ python ~/Sofware/Cassandra/Cassandra/Scripts/MCF_Generation/mcfgen.py silicalite.pdb -t
Reading Modified PDB File...


*********Generation of Topology File*********

Summary
---- -----------------------------------
   0 bonds                                   
   0 rings                                   
Traceback (most recent call last):
  File "/home/hmcezar/Sofware/Cassandra/Cassandra/Scripts/MCF_Generation/mcfgen.py", line 1420, in <module>
    angleID()
  File "/home/hmcezar/Sofware/Cassandra/Cassandra/Scripts/MCF_Generation/mcfgen.py", line 546, in angleID
    nBonds = len(atomConnect[atom])
KeyError: 1

Steps to reproduce

Steps to reproduce the behavior:

Get a pdb of a zeolite without CONECT
Run mcfgen.py -t

Possible solution (optional)

I will push a fix for this bug.

Add the ability to read and operate with pregenerated trajectories.

Enable Cassandra to read pregenerated trajectories from .xyz and .H files, compute properties from them, and perform Widom insertions in them.

Fix time counter for coordinate writing in simulations with lengths given in minutes

Expected behavior

In the nvt, npt, gcmc, and gemc driver subroutines, the time counter for coordinate writing should increment coord_time by ncoord_freq, which is the time interval in minutes between steps with coordinate writing given in the input file after keyword coord_freq when the units given are minutes.

Current behavior

Instead of incrementing by ncoord_freq, coord_time increments by nthermo_freq, which is the time interval in minutes between steps with thermodynamic property writing given in the input file after keyword prop_freq when the units given are minutes.

Possible solution

Replace nthermo_freq in these sections with ncoord_freq

Additional context

Code copied and pasted from this section (it's the same in each of the abovementioned drivers):

write_flag = .FALSE.
IF (.NOT. timed_run) THEN
   IF ( MOD(i_mcstep,ncoord_freq) == 0) write_flag = .TRUE.
ELSE
   now_time = now_time - coord_time
   IF(now_time .GT. ncoord_freq) THEN
      coord_time = coord_time + nthermo_freq
      write_flag = .TRUE.
   END IF
END IF

Allow dashes in filenames?

Should we allow dashes in file names? This restriction occasionally causes confusion. I'm not opposed to disallowing dashes if there is a good reason. Otherwise, I think we should allow them. If we decide to make this change we need to make sure that dashes work with mcfgen.py and library_setup.py in addition to the core code.

@rwsmith7531 I know you have looked into this?

@ryangmullen @emarinri Thoughts?

Describe the solution you'd like

Allow dashes in file names.

Describe alternatives you've considered

Add an explicit warning in documentation. Also add a more descriptive error message to the code.

Further information, files or links (optional)

Discussion spawned by #87.

Overload store_pair_energy array function using the INTERFACE block

This is in order to simplify the signature of the function when using many molecules.

Correct computation of errors in plot.py

Use Frenkel’s method to compute error bars with block averages. For a long simulation, it involves modifying the block size until a plateau appears in the standard deviation

Minimium image distance in a triclinic box

Expected behavior

The subroutine Minimum_Image_Separation should compute the minimum image distance between two atoms.

Current behavior

The subroutine Minimum_Image_Separation computes the distance from a central atom i to the image of another atom j in a cell centered on atom i. For cubic and orthogonal cells, the distance between atom i and the image of j in this cell will be the minimum image distance. However, for triclinic cells the image j in this cell may not be the closest image to i.

Steps to reproduce

In the input file, create a triclinic box with lattice vectors a = (1, 0, 0), b = (1/2, sqrt(3)/2, 0), c = (0, 0, 1). This can be accomplished by adding the following section to a CASSANDRA input file:

# Box_Info
1
cell_matrix
1.000000  0.500000  0.000000
0.000000  0.866025  0.000000
0.000000  0.000000  1.000000

Create a system with two atoms at scaled positions (1/3, 1/3, 0) and (7/8, 7/8, 0). This can be accomplished by creating an xyz
file with the following content:

2
# BOX: 1.000000 1.000000 1.000000 90. 90. 60.
  LJ  0.500000  0.288675  0.000000
  LJ  1.312500  0.757772  0.000000

3a. As input, the vector pointing from atom 1 to atom 2 is (13/24, 13/24, 0) in scaled coordinates and (0.812501, 0.469097, 0) in Cartesian coordinates. The length of this vector is 0.938195.
3b. In a triclinic cell centered on atom 1, atom 2 gets wrapped along lattice vectors a and b. The vector pointing from atom 1 to image 2 is (-11/24, -11/24, 0) in scaled coordinates and (-0.687499, -0.396928, 0) in Cartesian coordinates. The length of this vector is 0.793856. This is the distance that CASSANDRA finds.
3c. If we wrap atom 2 only using lattice vector a, the vector pointing from atom 1 to image 2 is (-11/24, 13/24, 0) in scaled coordinates and (-0.187499, 0.469097, 0) in Cartesian coordinates. The length of this vector is 0.505181. This is the distance that CASSANDRA should find.

Possible solution (optional)

Rather than wrapping atoms such that the scaled distance between atoms is on the range [-0.5, 0.5] along each lattice vector, we might could wrap atoms until the Cartesian component of the distance is on the range [-hbox, hbox] where hbox is half the distance between faces of an orthogonal cell constructed around atom i.

Improve ring identification in mcfgen.py

Is your feature request related to a problem? Please describe.

The current ring fragment identification functionality in mcfgen.py uses a custom algorithm to detect rings. This is not optimal, as there are established undirected graph cycle detection algorithms implemented in open source libraries.

Describe the solution you'd like

Utilize networkx to detect rings.

Describe alternatives you've considered

None

Overhaul mcfgen.py and library_setup.py

A few months back I wrote an overhaul to mcfgen.py and library_setup.py. See numbers #32 and #33 for the code and reference. I have refrained from merging the changes into the main repository because I am concerned that the updated scripts have not yet had adequate testing. I propose we move forward with integrating these into the repository but first add pytest-based unit testing on these two scripts. See #83 for further motivation.

This overhaul will also move us to python3 for both scripts. We have now included a python2 deprecation warning with both scripts for a couple of releases but I nonetheless propose that we continue to publish the python2 compatible versions of the scripts at least for another release or two, at which time they would be removed.

Add description of i-a convention to cell matrix documentation

Add the description of the convention used to define box vectors to Cassandra documentation regarding the cell_matrix method of specifying the simulation box dimensions.

PBC box images based on wrong dimension

Expected behavior

Apply_PBC_Anint in minimum_image_separation.f90 should apply periodic boundary conditions to all three box dimensions, with the box image in each dimension being dependent on the parent coordinate in the corresponding dimension.

Current behavior

Apply_PBC_Anint, when applying PBC to cubic boxes, bases the y and z box images on the x parent coordinate, not the y or z parent coordinates.

Steps to reproduce

Steps to reproduce the behavior:

Load unwrapped xyz configuration (not from Cassandra because Cassandra only keeps wrapped coordinates)
Write the configuration xyz file from Cassandra
See that y and z coordinates have not been wrapped properly by Cassandra (they can be far outside the box and even different from loaded, unwrapped config)

Possible solution (optional)

Correct the dimension labels in the code for Apply_PBC_Anint.

Additional context

This bug was discovered when testing Cassandra's trajectory reader (still in development) on unwrapped trajectories from LAMMPS.

mcfgen.py does not throw error if no charges are listed

Summary

mcfgen.py writes an MCF file without charges if the charge fields of the .ff file are left blank. E.g., if no charges are specified a line of the Atom_Info section of the MCF file appears as follows:

1    O    O    15.999   LJ    78.208    3.166

rather than with charges:

1    O   O    15.999    -0.82    LJ    78.208    3.166

In some cases it appears that "None" is written in place of the charges, e.g.,

1    O   O    15.999    None    LJ    78.208    3.166

Expected behavior

mcfgen.py should throw an error if no charges are provided since the charges are a required field of an MCF file.

Additional context

The Cassandra error with no charges listed is:

Cannot have an atom of vdw type 148.000 in a box of vdw type LJ
This error occurred in subroutine Get_Atom_Info on step 0.
Fatal Error. Stopping program.

Further information, files or links (optional)

Thanks to @ShahResearchGroup for reporting this bug.

Migrate from python 2 to python 3 for auxiliary scripts

Update mcfgen, plot, libgen

Update counters related to Widom insertions to INT64

Cassandra currently has problems with performing a number of Widom insertions exceeding the largest number storable as an INT32, so the counters related to Widom insertions should be updated to INT64.

Problem reading checkpoint

Expected behavior

Read the checkpoint of a NPT simulation and use it as a starting point of a GCMC simulation.

Current behavior

The program crashes without a clear error message in the log file, showing in STDOUT the message below

forrtl: severe (59): list-directed I/O syntax error, unit 100, file /path/to/file.chk

Steps to reproduce

Steps to reproduce the behavior:

Compile master branch version
Run a NPT simulation
Try to run GCMC simulation reading the configuration with checkpoint

Further information, files or links (optional)

By compiling with debug symbols and traceback, I was able to track the crash to line 225 of read_write_checkpoint.f90:

    READ(restartunit,*) initial_mcstep

I´m attaching an example below. The crash happens when I try to run gcmc_co2.inp.
checkpoint_error.zip

Cassandra fragment generation error for cage structure

Current behavior

The fragment generation fails for a cage structure. MCF and PDB file here. I had to modify this MCF file by hand some but I think it is correct.

There are several separate issues raised by this MCF file:

The line length is over 120 characters. This is too long for line_string.
The number of items in the line for fragment 1 is 30; this is more than the maximum of 20 in line_array.
The crankshaft move as implemented in Cassandra does not work for fused rings/ring systems (see #62).

Possible solution (optional)

I think we should allow for longer lines (360 characters ?), and more items per line (60 ?).

Further information, files or links (optional)

Here is what I mean by "cage":

ntrials for identity switch moves is not initialized to zero

Current behavior

The number of trials for identity switch moves is not initialized to zero. This does not affect the behavior of the code as identity switch is not fully implemented yet. However, it may cause the log file to report that some (non-zero) number of identity switch moves were attempted.

Add installation instructions to documentation

move_rotate.f90 unused variables?

In 'move_rotate.f90', the subroutine called 'Rotate_Molecule_Axis' (starting on line 362) allocates and computes one-dimensional arrays called 'dxrot', 'dyrot', and 'dzrot'. These arrays don't seem to be used for anything internal to the 'Rotate_Molecule_Axis' subroutine and are output to the 'dx', 'dy', and 'dz' arguments specified when the 'Rotate_Molecule_Axis' subroutine is called. The 'dx', 'dy', and 'dz' arrays are local to the 'Rotate' subroutine and don't appear to be utilized anywhere. Am I overlooking where these quantities are utilized in some fashion or are these arrays vestiges of earlier versions that can now be removed?

Implement HMA

Is your feature request related to a problem? Please describe.

Harmonically Mapped Averaging (HMA) allows crystal properties (energy, pressure, heat capacity) to be computed more efficiently (more precision in the same time or the same precision in less time). Other benefits include reduced finite-size effects, reduced truncation effects, shorter decorrelation time and faster equilibration.

Describe the solution you'd like

We'll be adding code to compute HMA properties.

Describe alternatives you've considered

HMA properties could be computed by dumping forces and coordinates to files, but the output would be huge.

Further information, files or links (optional)

HMA paper: https://doi.org/10.1103/PhysRevE.92.043303
HMA in LAMMPS paper: https://doi.org/10.1063/1.5129942
HMA for VASP paper: https://doi.org/10.1016/j.cpc.2020.107554
LAMMPS pull request: lammps/lammps#1503

Fragment generation fails for fused rings/ring systems

Current behavior

The fragments generated for fused ring systems violate the specified fixed bond lengths. If the crankshaft move is applied to an atom that is connected to >2 other ring (not including exoring) atoms, it will cause one of the ring bonds to change length. This can be easily demonstrated by generating fragments for naphthalene (files here) and looking at the fragment bond lengths.

Possible solution (optional)

Currently unsure of the correct solution. One possibility that comes to mind is not applying the crankshaft move to ring atoms connected to > 2 other ring atoms. One concern with this approach is whether all the internal DOF would be properly sampled in that case.

Two examples are shown below. Orange and red atoms are RING atoms; the orange atoms would be used for the crankshaft move whereas the red atoms would not.

Related Issues

Also related to #59 as the cage structure in that example suffers from the same problem.

Duplication of fragments in libgen.py

Jindal noticed that the libary_setup.py generates as many fragment libraries as there are fragments in a molecule. This is fine for small molecules; however, for large molecules, it can consume considerable amount of space. Can we, instead, generate fragment libraries only for unique fragments?

Correct examples to only use seeds that can be stored as INT32

PR #106 updates String_To_Int to be read to INT64 instead of INT32. The current behavior without this change is to overflow to negatives if the number in the input string is too large to be represented by INT32, and this occurs in some of the examples because they provide random number seeds that are large enough to cause this overflow. Until recently, this hasn't been very problematic because the overflow was consistent, but now it causes failures in the test suite for PR #106 , where the overflow was removed, causing the seeds provided in the input file to be the seeds that are actually effectively used, which is not the case where the overflow is present.

The examples in the test suite need to be corrected to provide smaller seeds that do not cause overflow so that they don't cause test suite failures when the update to String_To_Int in PR #106 is made.

Support for NpT GEMC with single box volume fluctuations

Describe the solution you'd like

The ability to run NpT-GEMC with no box fluctuations for a single simulation box; e.g., one box is vapor (with box fluctuations) and another is a solid or pore structure (no box fluctuations).

Further information, files or links (optional)

The solution will likely require modification to input routines to allow for zero box fluctuations in one box and ensuring there are no other cascading changes. The box without box fluctuations should support cubic or non-cubic boxes. The solution should also include a unit test demonstrating the functionality.

Checkpoint files with dashes result in error

Is your feature request related to a problem? Please describe.

I ran into an issue where I was trying to restart a simulation from a checkpoint file named equil-rst.out.chk. I found the same issue addressed here: https://cassandra.nd.edu/index.php/forum/cassandra-software/300-problem-restarting-simulation.

Describe the solution you'd like

I think it's okay to not allow dashes in the file names, but I think the default name for restart simulations (equil-rst) should be changed to something like equil_rst to fit this naming convention. In addition, improve the documentation to make it clear that dashes are not allowed.

Describe alternatives you've considered

Alternatively, the code could be refactored to allow dashes in the file names, but this may be a much more complicated solution.

Additional context

Cassandra 1.2.5

Further information, files or links (optional)

Any additional information here, attach relevant text or image files and URLs to external sites, publications , etc

Improve documentation on production simulation arguments

The documentation is not clear on the number of parameters a production simulation required for an NPT simulation . To clarify, it requires two arguments that control the acceptance output frequency to the log file of thermal and volume moves, respectively