https://gforge.accre.vanderbilt.edu/plugins/scmsvn/viewcvs.php/?root=lammpstools
Using the dump files reader I posted, I wrote a script that transfers that data into an hdf5 database. Download the two py files to the same directory. Again the dump file names should be sequenced like: 1.dump 2.dump 5.dump…
The simplest way to use it is to run it from the directory with the dump files: python dumps2hdf5.py. A temporary database will be generated to allow for resuming if the transfer process is interrupted. The final db name defaults to atoms.hdf5. Optionally, you can delete the dump files, change the source directory and the destination db file, change the dump file match pattern (from *.dump), and the number of timesteps read at one time.
You will need to have h5py installed (which requires numpy installed). My testing has been on python 2.6.
Performance is a barely acceptable ~1MB/sec. The limitation is coming from the text file processing but I don’t know how to make it faster and I don’t want to spend more time on it. I’ve already made an effort to optimize it. The db file size I got was about half the text file(s) size.
python dumps2hdf5.py --help for options.
Feedback is appreciated.