I have several hundred .yaml files that look like this:
name: mead
category:
query: ‘{}’
fws:
-
fw_id: 1
spec:
_tasks:
-
_fw_name: PyTask
func: update_cubit.main
stored_data_varname: “cubit_metadata”
-
kwargs: {“cubitfile”: “/home/Synclesis/Projects/STTR_A15A-T004/Sandboxes/dshaw/fireflow/xuchen/sloped-box.py”, “var_names”: {“1”: “aperture2_width”, “0”: “aperture1_height”}, “errfile”: “/home/Synclesis/Projects/STTR_A15A-T004/Sandboxes/dshaw/fireflow/xuchen/cubit.err”, “var_values”: " 0.4 0.6\n", “exofile”: “/home/Synclesis/Projects/STTR_A15A-T004/Sandboxes/dshaw/fireflow/xuchen/results/000000_output/sloped-box.exo”}
.
.
.
- fw_id: 22
spec:
_tasks:
- _fw_name: PyTask
func: movedir.main
stored_data_varname: “movedir_metadata”
kwargs: {“targetdir”: “/home/Synclesis/Projects/STTR_A15A-T004/Sandboxes/dshaw/fireflow/xuchen/results/000000_output/”, “FW”: 21, “filename”: “/home/Synclesis/Projects/STTR_A15A-T004/Sandboxes/dshaw/fireflow/transition/paths.txt”}
links:
1:
- 2
2:
- 3
3:
- 4
4:
- 5
5:
- 6
6:
- 7
7:
- 8
8:
- 9
9:
- 10
10:
- 11
11:
- 12
12:
- 13
13:
- 14
14:
- 15
15:
- 16
16:
- 17
17:
- 18
18:
- 19
19:
- 20
20:
- 21
21:
- 22
metadata: {}
``
I am trying to use qlaunch (with SLURM) to launch ~10 of these yaml tasks at a time. Every combination of qadapter.yaml and my 'lpad add’s and ‘qlaunch’ submissions (the latter two items being controlled by a python script) launch the first .yaml task (multi_task_0.yaml). Each of the fireworks in the multi_task_0.yaml file are executed sequentially, as they should be, but the fireworks in the following ‘multi_task_#.yaml’ get hung up behind those fireworks instead of being launched parallel. That is, I’m trying to run ‘multi_task_1.yaml’ through ‘multi_task_10.yaml’, simultaneously, with the rest of the ‘multi_task_#.yaml’ files queueing up behind them.
I’ve attached the fworker, launchpad, and qadapter .yaml files.
The python trying to launch the FWs currently looks like this:
for irun in range(int(nruns)):
command = “sed -i 's/task_[0-9]+/task_”+str(irun)+"/g’ qadapter.yaml"
os.system(command)
``
command = 'qlaunch -l my_launchpad.yaml -w my_fworker.yaml -q qadapter.yaml rapidfire --nlaunches infinite'
os.system(command)
``
``
Thanks in advance,
Dan
my_fworker.yaml (36 Bytes)
my_launchpad.yaml (102 Bytes)
qadapter.yaml (730 Bytes)