I have the a workflow that does the following:
- OptimizationFW with custodian job type full_optimization
- OptimizationFW with custodian job type normal
- OptimizationFW with custodian job type normal
Sometimes the first Firework reaches state COMPLETED, yet defuses the next Firework and I’m not sure why. I have tried running the same structure again and was able to reproduce the same behavior, but a different structure did not.
I thought this might have been related to the number of optimiziations by custodian, but even with more relaxations in the second structure I tried, I cannot reproduce it.
The main different, it seems, besides the structures being different is the amount of time the first Firework takes (~5 hours) compared to ~30 minutes in the second test case.
I’m not running into any walltime issues that I’m aware of.
This is partly a question of how to debug this.
Below is the -d more of the full optimization Firework that is COMPLETED, but for some reason defused the next one.
Is my understanding correct that what caused this was the PassCalcLocs Firetask (the fourth task in my Firework)? It is right after the RunVaspCustodian task.
What next steps can I take to find out what’s going on?
{
“name”: “MgCu-structure optimization”,
“launches”: [
{
“fworker”: {
“category”: “”,
“query”: “{}”,
“name”: “ACI”,
“env”: {
“scratch_dir”: “/storage/home/bjb54/work/atomate-scratch”,
“vasp_cmd”: “mpirun vasp_std”,
“db_file”: “/storage/home/bjb54/work/atomate/config/db.json”,
“incar_update”: {
“ncore”: 4
}
}
},
“trackers”: [],
“ip”: “10.102.101.223”,
“fw_id”: 7,
“state”: “COMPLETED”,
“host”: “comp-bc-0223.acib.production.int.aci.ics.psu.edu”,
“launch_dir”: “/storage/work/bjb54/test-full-opt/launcher_2017-10-12-16-29-43-799766”,
“action”: {
“defuse_workflow”: false,
“update_spec”: {},
“mod_spec”: [
{
“_push_all”: {
“calc_locs”: [
{
“path”: “/storage/work/bjb54/test-full-opt/launcher_2017-10-12-16-29-43-799766”,
“name”: “structure optimization”,
“filesystem”: null
}
]
}
}
],
“stored_data”: {
“task_id”: 237
},
“exit”: false,
“detours”: [],
“additions”: [],
“defuse_children”: true
},
“launch_id”: 3,
“state_history”: [
{
“checkpoint”: {
“_task_n”: 4,
“_all_update_spec”: {},
“_all_mod_spec”: [
{
“_push_all”: {
“calc_locs”: [
{
“path”: “/storage/work/bjb54/test-full-opt/launcher_2017-10-12-16-29-43-799766”,
“name”: “structure optimization”,
“filesystem”: null
}
]
}
}
],
“_all_stored_data”: {}
},
“updated_on”: “2017-10-12T20:23:02.070220”,
“state”: “RUNNING”,
“created_on”: “2017-10-12T16:29:44.013708”
},
{
“state”: “COMPLETED”,
“created_on”: “2017-10-12T20:23:02.138567”
}
]
}
],
“fw_id”: 7,
“state”: “COMPLETED”,
“created_on”: “2017-10-12T16:27:26.194861”,
“updated_on”: “2017-10-12T20:23:02.415661”
}