dolfin team mailing list archive

Thread
Date

Re: Generic meta-programming, faster than form compiler?

To: A Navaei <axnavaei@xxxxxxxxxxxxxx>
From: "Garth N. Wells" <gnw20@xxxxxxxxx>
Date: Mon, 23 Mar 2009 21:12:41 +0900
Cc: dolfin-dev <dolfin-dev@xxxxxxxxxx>
Delivered-to: dolfin-dev@xxxxxxxxxx
In-reply-to: <866cf96a0903230359q1a19d02bw8f057745bf66a99@mail.gmail.com>
User-agent: Thunderbird 2.0.0.21 (X11/20090318)



A Navaei wrote:

2009/3/23 Kent Andre <kent-and@xxxxxxxxx>:

The code that FFC produces is about as fast as light. It has been
documented in a number of papers.


Is there any data available comparing the FFC performance to the hardware peak?

FFC does not operate in isolation, so it is not possible to make acomparison to max flops of a CPU. Furthermore, in a typical simulationwith code generated by FFC, other parts of the solution process dominate(such as insertion as mentioned by Kent) and the linear solve, sowhether or not FFC generated code is optimal in terms of peak flops of amachine is not relevant to runtime performance.

I don't think you should try to beat FFC with generic meta-programming.
Or you could do it but, but don't have to high expectations...

Insertion into the matrix is currently the bottleneck. But FFC does
not have anything to do with this.


While FFC doesn't have anything to do with this, dolfin does. In the
case of the MTL4 backend wrapper, it is implemented badly by ignoring

the meta-programming potentials.


This is not a constructive comment. Patches are welcome.

For instance, sparse matrix insertion

is done by forming a sparsity pattern outside of MTL4 and then
assigning the pointers to MTL4 API, while loop unrolling could have
been used here.

If you look at the code, the FFC backend does not use the sparsitypattern. The MTL4 inserter does have some options which we have not yetbeen taken advantage of, so again patches are welcome.


Garth


-Ali

Kent


On ma., 2009-03-23 at 10:11 +0000, A Navaei wrote:

The success of MTL4 based on generic meta-programming, arises the
question about re-visiting the efficiency of code-generation
approaches, including FFC. Given that FEM can particularly benefit
from major meta-programming characteristics, namely static
polymorphism and loop unrolling, MTL4 demonstrates that the
code-generation part can be much more efficiently replaced by inlining
performed at compile-time.

Without having a concrete meta-programming implementation, it may be
impossible to predict how much performance one would gain compared to
FFC. However, MTL4 has been reported to be many times faster than
code-generation means such as ATLAS.

Based on this, are there any specific benefits in FFC code-generation
which may not be covered by meta-programming?


-Ali
_______________________________________________
DOLFIN-dev mailing list
DOLFIN-dev@xxxxxxxxxx
http://www.fenics.org/mailman/listinfo/dolfin-dev

_______________________________________________
DOLFIN-dev mailing list
DOLFIN-dev@xxxxxxxxxx
http://www.fenics.org/mailman/listinfo/dolfin-dev

Follow ups

Re: Generic meta-programming, faster than form compiler?
From: Robert Kirby, 2009-03-23

References

Generic meta-programming, faster than form compiler?
From: A Navaei, 2009-03-23
Re: Generic meta-programming, faster than form compiler?
From: Kent Andre, 2009-03-23
Re: Generic meta-programming, faster than form compiler?
From: A Navaei, 2009-03-23