← Back to team overview

yade-dev team mailing list archive

Re: parallel collider - testing needed

 

hi guys,

i have also some benchmark results:

for 1 thread

200801
number of bodies 200813

Elapsed  41.6678731441  sec
Performance  4.79986101782  iter/sec
Extrapolation on 1e5 iters  5.78720460335  hours
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*
Name Count                 Time            Rel. time
-------------------------------------------------------------------------------------------------------
ForceResetter 200 224925us 0.54% InsertionSortCollider 7 21314382us 51.34% InteractionLoop 200 14890015us 35.87% NewtonIntegrator 200 5084295us 12.25%
TOTAL 41513619us              100.00%

Common time  1013.57112694 s


5037  spheres, velocity= 140.463364272 +- 1.28620387158 %
25103  spheres, velocity= 41.138472944 +- 2.34750742651 %
50250  spheres, velocity= 24.1614197693 +- 0.709212706826 %
100467  spheres, velocity= 11.7041352478 +- 0.681390348657 %
200813  spheres, velocity= 5.20881044621 +- 5.57298683259 %


SCORE: 7993
Number of threads  1

--------------------------------------------------------------------------------------------------------
--------------------------------------------------------------------------------------------------------
for 4 threads
200801
number of bodies 200813

Elapsed  18.8133409023  sec
Performance  10.6307540505  iter/sec
Extrapolation on 1e5 iters  2.61296401421  hours
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*
Name Count                 Time            Rel. time
-------------------------------------------------------------------------------------------------------
ForceResetter 200 631347us 3.36% InsertionSortCollider 7 8374089us 44.57% InteractionLoop 200 6866564us 36.55% NewtonIntegrator 200 2915176us 15.52%
TOTAL 18787178us              100.00%

Common time  443.513967991 s


5037  spheres, velocity= 404.919400864 +- 0.912571165941 %
25103  spheres, velocity= 105.118936499 +- 2.36368208547 %
50250  spheres, velocity= 61.4143580936 +- 1.40115209383 %
100467  spheres, velocity= 25.7654736657 +- 2.93262637568 %
200813  spheres, velocity= 12.2452664182 +- 9.39816092272 %


SCORE: 19832
Number of threads  4

--------------------------------------------------------------------------------------------------------
--------------------------------------------------------------------------------------------------------
for 8 threads
200801
number of bodies 200813

Elapsed  19.0994348526  sec
Performance  10.4715140287  iter/sec
Extrapolation on 1e5 iters  2.65269928508  hours
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*
Name Count                 Time            Rel. time
-------------------------------------------------------------------------------------------------------
ForceResetter 200 1380352us 7.24% InsertionSortCollider 7 7577257us 39.74% InteractionLoop 200 6923126us 36.31% NewtonIntegrator 200 3186823us 16.71%
TOTAL 19067561us              100.00%

Common time  479.59920001 s


5037  spheres, velocity= 355.829004066 +- 2.37547928463 %
25103  spheres, velocity= 87.4558634849 +- 2.63148596504 %
50250  spheres, velocity= 56.1805332982 +- 2.18028212667 %
100467  spheres, velocity= 26.26403263 +- 9.82416513972 %
200813  spheres, velocity= 11.736613584 +- 8.6342992153 %


SCORE: 18265
Number of threads  8


--------------------------------------------------------------------------------------------------------
--------------------------------------------------------------------------------------------------------
4 threads without virtualization
200801
number of bodies 200813

Elapsed  23.8045229912  sec
Performance  8.40176465935  iter/sec
Extrapolation on 1e5 iters  3.30618374878  hours
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*
Name Count                 Time            Rel. time
-------------------------------------------------------------------------------------------------------
ForceResetter 200 523769us 2.20% InsertionSortCollider 7 15676590us 65.87% InteractionLoop 200 5077634us 21.33% NewtonIntegrator 200 2522054us 10.60%
TOTAL 23800048us              100.00%

Common time  437.141875982 s


5037  spheres, velocity= 611.163145541 +- 0.257590873987 %
25103  spheres, velocity= 107.643688809 +- 1.64393771392 %
50250  spheres, velocity= 61.8440291363 +- 1.45262072027 %
100467  spheres, velocity= 25.7847996322 +- 0.560241055497 %
200813  spheres, velocity= 8.39799958364 +- 0.0767470749617 %


SCORE: 19900
Number of threads  4



executed on core i7 3770 (in a virtual box because build yade on my native ubuntus fails) so between -j4 and -j8 is no significant difference. also no difference runned in a virtual box or direct on the machine.

so there is one step forward because i had really bad results with 8 threads without a parallel collider.

Matthias





Follow ups

References