← Back to team overview

dolfin team mailing list archive

Re: Assembly timings

 

Level 2 blas might not improve run-time (should improve compile time), depending on how many zeros FFC finds. Calling N level 2 BLAS operations is *much* slower than one level 3 BLAS with N columns.

But it's a start.


We should definitely do this. I'll probably do level 2 BLAS first,
since then I only have to modify the output from FFC to use BLAS
calls. This will be an immediate benefit for Johan Hoffman with 3D
Navier-Stokes since compiling the code generated by FFC (with gcc) is
currently the bottle-neck. Hopefully, I can implement this by the end
of this week.

/Anders

_______________________________________________
DOLFIN-dev mailing list
DOLFIN-dev@xxxxxxxxxx
http://www.fenics.org/cgi-bin/mailman/listinfo/dolfin-dev


Rob Kirby

"Mathematical software should be mathematical."




Follow ups

References