CorCornelisse
02-26-2008, 11:56 AM
I'm in the process of evaluation the Totalview debugger and am experiencing a little difficulty when trying to attach to the debug process.
I'm using MPICH2-MX (which is a MPICH2 version patched by Myrinet), I'll post the output of mpich2version:
============
MPICH2 Version: 1.0.6
MPICH2 Release date: Unknown, built on Tue Feb 26 15:13:57 CET 2008
MPICH2 Device: ch_mx
MPICH2 configure: --prefix=/home/ccorneli/local/mpich2 --with-device=ch_mx --with-mx=/usr/local/Cluster-Apps/mx/mx-1.2.0j-2.6.18-clustervision-136.1_cvos-no-compat-syms --with-pm=smpd --with-pmi=smpd --with-mpe --with-romio --enable-compiler-optimizations --enable-cxx --enable-debuginfo --enable-fast --enable-g=dbg --enable-sharedlibs=gcc --enable-totalview
MPICH2 CC: pathcc -march=opteron -m64 -fno-second-underscore -g -O2
MPICH2 CXX: pathCC -march=opteron -m64 -fno-second-underscore -O2
MPICH2 F77: pathf95 -march=opteron -m64 -fno-second-underscore -O2
MPICH2 F90: pathf95 -march=opteron -m64 -fno-second-underscore -O2
MPICH2 Patch level: none
============
I use the SMPD process manager instead of the MPD manager due to its SGE integration. The problem is, once I attach to one of my processes, it works, but somehow it fails to find the slave processes. My job runs on 29 nodes, each node runs 2 processes, so I have a total of 58 processes. As mentioned before I use the SMPD process manager, I tried attaching to the parent SMPD process as well as the individual processes on the start node (the node on which I issue the mpiexec command). The mpiexec version I have does not appear to have the tvsu or tv command line options.
Is Totalview process attaching not supported this way? (using SMPD) Should I switch to MPD or can it be something else?
I'm using MPICH2-MX (which is a MPICH2 version patched by Myrinet), I'll post the output of mpich2version:
============
MPICH2 Version: 1.0.6
MPICH2 Release date: Unknown, built on Tue Feb 26 15:13:57 CET 2008
MPICH2 Device: ch_mx
MPICH2 configure: --prefix=/home/ccorneli/local/mpich2 --with-device=ch_mx --with-mx=/usr/local/Cluster-Apps/mx/mx-1.2.0j-2.6.18-clustervision-136.1_cvos-no-compat-syms --with-pm=smpd --with-pmi=smpd --with-mpe --with-romio --enable-compiler-optimizations --enable-cxx --enable-debuginfo --enable-fast --enable-g=dbg --enable-sharedlibs=gcc --enable-totalview
MPICH2 CC: pathcc -march=opteron -m64 -fno-second-underscore -g -O2
MPICH2 CXX: pathCC -march=opteron -m64 -fno-second-underscore -O2
MPICH2 F77: pathf95 -march=opteron -m64 -fno-second-underscore -O2
MPICH2 F90: pathf95 -march=opteron -m64 -fno-second-underscore -O2
MPICH2 Patch level: none
============
I use the SMPD process manager instead of the MPD manager due to its SGE integration. The problem is, once I attach to one of my processes, it works, but somehow it fails to find the slave processes. My job runs on 29 nodes, each node runs 2 processes, so I have a total of 58 processes. As mentioned before I use the SMPD process manager, I tried attaching to the parent SMPD process as well as the individual processes on the start node (the node on which I issue the mpiexec command). The mpiexec version I have does not appear to have the tvsu or tv command line options.
Is Totalview process attaching not supported this way? (using SMPD) Should I switch to MPD or can it be something else?