Pelegant freezes up sometimes during touschek_scatter
Posted: 02 Oct 2018, 17:40
I am having occasional trouble running simulations of particle losses due to touschek scattering. Pelegant will stop doing new calculations at some point, but there will not be any error message, it just hangs. It seems to happen whenever the number of "particles selected for tracking" is smaller than the number of processes. Since the number of selected particles is random, I encounter this issue sometimes even when running with 1M particles if I am using a lot of processes.
I include files for a very artificial run with n_simulated=1000, including output from a run using 4 processes. It hangs indefinitely at the 31st TSCATTER element, where there is only 1 particle to be tracked. In previous attempts I observed Pelegant to freeze up even when the number of particles is several times bigger than the number of processes, but that was with a different input file that was using WATCH elements and other features.
I would appreciate any ideas for a workaround. I can reduce the number of processes, but there is no guarantee that the program won't hang anyway. I am also going to try to continue the simulation if it hangs by setting 'overwrite_files=0' and running on a single process, but that will probably be too slow to do on a regular basis.
Cheers,
Gregg
I include files for a very artificial run with n_simulated=1000, including output from a run using 4 processes. It hangs indefinitely at the 31st TSCATTER element, where there is only 1 particle to be tracked. In previous attempts I observed Pelegant to freeze up even when the number of particles is several times bigger than the number of processes, but that was with a different input file that was using WATCH elements and other features.
I would appreciate any ideas for a workaround. I can reduce the number of processes, but there is no guarantee that the program won't hang anyway. I am also going to try to continue the simulation if it hangs by setting 'overwrite_files=0' and running on a single process, but that will probably be too slow to do on a regular basis.
Cheers,
Gregg