[Users] How to generate a profiling file using mpiP

Erik Schnetter schnetter at cct.lsu.edu
Tue Apr 1 15:49:21 CST 2008


Hee Il,

below you only showed the total simulation time for each process.  I  
rather meant that the time spent e.g. in the evolution equations and  
in the boundary conditions could be different between processes.  The  
total time will always be very similar since it is essentially  
synchronised when the termination condition is communicated.

-erik

On Apr 1, 2008, at 16:11:43, Hee Il Kim wrote:
> Thanks all,
>
>
>
> 2008/4/1, Erik Schnetter <schnetter at cct.lsu.edu>:
> how large is the cluster?  A factor of 2 is not really bad, since you
> still get a factor of 32 speedup compared to running on a single CPU.
> What is the slowdown when you go from using 1 to using 2 full nodes?
>
>
> I can use max 80 cpus. If the factor of 2 is not really bad and  
> unavoidable to a low performace network cluster, I think I'd better  
> stop here. I spent too much time on this ^^
>
> Anyway I got many commnets and suggestions from mpich-discuss forum.  
> I found our switch has a good bandwidth value even though its  
> latency is not good as high performace hardwares. Also it equips  
> with the ability to use Open-MX. I will test Open-MX to reduce the  
> latency.
>
> The numbers below taken from mpich2. Note the iteration number was  
> taken 32, a half of the previous one.
>
> Thanks!
>
> Hee Il
>
>
> ==========================
>
> # 1 node = 8 cpus
>
> ./CCTK_Proc0.out:                | Total time for  
> simulation               |        463.46191400 |     423.44646400
> ./CCTK_Proc1.out:                | Total time for  
> simulation               |        463.41564800 |     443.49971700
> ./CCTK_Proc2.out:                | Total time for  
> simulation               |        463.41577600 |     441.01956200
> ./CCTK_Proc3.out:                | Total time for  
> simulation               |        463.41576900 |     415.26195200
> ./CCTK_Proc4.out:                | Total time for  
> simulation               |        463.41564200 |     444.48777900
> ./CCTK_Proc5.out:                | Total time for  
> simulation               |        463.41567800 |     421.04631400
> ./CCTK_Proc6.out:                | Total time for  
> simulation               |        463.41577200 |     421.11031800
> ./CCTK_Proc7.out:                | Total time for  
> simulation               |        463.44232100 |     411.46171500
>
>
> # 2 node = 16 cpus
>
> ./CCTK_Proc0.out:                | Total time for  
> simulation               |        481.08626400 |     439.36345800
> ./CCTK_Proc10.out:                | Total time for  
> simulation               |        481.05228700 |     449.02006200
> ./CCTK_Proc11.out:                | Total time for  
> simulation               |        481.05252200 |     423.33445700
> ./CCTK_Proc12.out:                | Total time for  
> simulation               |        481.05242800 |     444.17976000
> ./CCTK_Proc13.out:                | Total time for  
> simulation               |        481.05249500 |     415.08594100
> ./CCTK_Proc14.out:                | Total time for  
> simulation               |        481.05234400 |     413.60184900
> ./CCTK_Proc15.out:                | Total time for  
> simulation               |        481.05244200 |     407.84548900
> ./CCTK_Proc1.out:                | Total time for  
> simulation               |        481.05222500 |     415.46996500
> ./CCTK_Proc2.out:                | Total time for  
> simulation               |        481.05224300 |     415.90599200
> ./CCTK_Proc3.out:                | Total time for  
> simulation               |        481.05421800 |     404.89330400
> ./CCTK_Proc4.out:                | Total time for  
> simulation               |        481.05222600 |     446.89592900
> ./CCTK_Proc5.out:                | Total time for  
> simulation               |        481.05237600 |     419.93424400
> ./CCTK_Proc6.out:                | Total time for  
> simulation               |        481.04626200 |     423.71448100
> ./CCTK_Proc7.out:                | Total time for  
> simulation               |        481.09418200 |     419.81023700
> ./CCTK_Proc8.out:                | Total time for  
> simulation               |        481.05225000 |     430.81092400
> ./CCTK_Proc9.out:                | Total time for  
> simulation               |        481.05227900 |     448.58403400
>
>
> # 4 nodes = 32 cpus
>
> ./CCTK_Proc0.out:                | Total time for  
> simulation               |        688.29916500 |     415.56597200
>                                                                                                                               max 
>   460.74879500
>
> # 8 nodes = 64 cpus
>
> ./CCTK_Proc0.out:                | Total time for  
> simulation               |        794.68444700 |     428.21476200
>                                                                                                                               max 
>   470.84142600
>
>
>
>
>
>
>
>
>
> _______________________________________________
> Users mailing list
> Users at cactuscode.org
> http://www.cactuscode.org/mailman/listinfo/users


-- 
Erik Schnetter <schnetter at cct.lsu.edu>   http://www.cct.lsu.edu/~eschnett/

My email is as private as my paper mail.  I therefore support encrypting
and signing email messages.  Get my PGP key from www.keyserver.net.



-------------- next part --------------
A non-text attachment was scrubbed...
Name: PGP.sig
Type: application/pgp-signature
Size: 194 bytes
Desc: This is a digitally signed message part
Url : http://www.cactuscode.org/pipermail/users/attachments/20080401/ad7de6dd/attachment.bin 


More information about the Users mailing list