yade-dev team mailing list archive
-
yade-dev team
-
Mailing list archive
-
Message #10520
Re: parallel collider - testing needed
hi guys,
i have also some benchmark results:
for 1 thread
200801
number of bodies 200813
Elapsed 41.6678731441 sec
Performance 4.79986101782 iter/sec
Extrapolation on 1e5 iters 5.78720460335 hours
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*
Name Count Time Rel. time
-------------------------------------------------------------------------------------------------------
ForceResetter 200
224925us 0.54%
InsertionSortCollider 7
21314382us 51.34%
InteractionLoop 200
14890015us 35.87%
NewtonIntegrator 200
5084295us 12.25%
TOTAL 41513619us 100.00%
Common time 1013.57112694 s
5037 spheres, velocity= 140.463364272 +- 1.28620387158 %
25103 spheres, velocity= 41.138472944 +- 2.34750742651 %
50250 spheres, velocity= 24.1614197693 +- 0.709212706826 %
100467 spheres, velocity= 11.7041352478 +- 0.681390348657 %
200813 spheres, velocity= 5.20881044621 +- 5.57298683259 %
SCORE: 7993
Number of threads 1
--------------------------------------------------------------------------------------------------------
--------------------------------------------------------------------------------------------------------
for 4 threads
200801
number of bodies 200813
Elapsed 18.8133409023 sec
Performance 10.6307540505 iter/sec
Extrapolation on 1e5 iters 2.61296401421 hours
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*
Name Count Time Rel. time
-------------------------------------------------------------------------------------------------------
ForceResetter 200
631347us 3.36%
InsertionSortCollider 7
8374089us 44.57%
InteractionLoop 200
6866564us 36.55%
NewtonIntegrator 200
2915176us 15.52%
TOTAL 18787178us 100.00%
Common time 443.513967991 s
5037 spheres, velocity= 404.919400864 +- 0.912571165941 %
25103 spheres, velocity= 105.118936499 +- 2.36368208547 %
50250 spheres, velocity= 61.4143580936 +- 1.40115209383 %
100467 spheres, velocity= 25.7654736657 +- 2.93262637568 %
200813 spheres, velocity= 12.2452664182 +- 9.39816092272 %
SCORE: 19832
Number of threads 4
--------------------------------------------------------------------------------------------------------
--------------------------------------------------------------------------------------------------------
for 8 threads
200801
number of bodies 200813
Elapsed 19.0994348526 sec
Performance 10.4715140287 iter/sec
Extrapolation on 1e5 iters 2.65269928508 hours
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*
Name Count Time Rel. time
-------------------------------------------------------------------------------------------------------
ForceResetter 200
1380352us 7.24%
InsertionSortCollider 7
7577257us 39.74%
InteractionLoop 200
6923126us 36.31%
NewtonIntegrator 200
3186823us 16.71%
TOTAL 19067561us 100.00%
Common time 479.59920001 s
5037 spheres, velocity= 355.829004066 +- 2.37547928463 %
25103 spheres, velocity= 87.4558634849 +- 2.63148596504 %
50250 spheres, velocity= 56.1805332982 +- 2.18028212667 %
100467 spheres, velocity= 26.26403263 +- 9.82416513972 %
200813 spheres, velocity= 11.736613584 +- 8.6342992153 %
SCORE: 18265
Number of threads 8
--------------------------------------------------------------------------------------------------------
--------------------------------------------------------------------------------------------------------
4 threads without virtualization
200801
number of bodies 200813
Elapsed 23.8045229912 sec
Performance 8.40176465935 iter/sec
Extrapolation on 1e5 iters 3.30618374878 hours
=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*
Name Count Time Rel. time
-------------------------------------------------------------------------------------------------------
ForceResetter 200
523769us 2.20%
InsertionSortCollider 7
15676590us 65.87%
InteractionLoop 200
5077634us 21.33%
NewtonIntegrator 200
2522054us 10.60%
TOTAL 23800048us 100.00%
Common time 437.141875982 s
5037 spheres, velocity= 611.163145541 +- 0.257590873987 %
25103 spheres, velocity= 107.643688809 +- 1.64393771392 %
50250 spheres, velocity= 61.8440291363 +- 1.45262072027 %
100467 spheres, velocity= 25.7847996322 +- 0.560241055497 %
200813 spheres, velocity= 8.39799958364 +- 0.0767470749617 %
SCORE: 19900
Number of threads 4
executed on core i7 3770 (in a virtual box because build yade on my
native ubuntus fails)
so between -j4 and -j8 is no significant difference. also no difference
runned in a virtual box or direct on the machine.
so there is one step forward because i had really bad results with 8
threads without a parallel collider.
Matthias
Follow ups
References