dolfin team mailing list archive

Thread
Date

Re: MTL4 backend: Significant performance results

To: dolfin-dev@xxxxxxxxxx
From: "Garth N. Wells" <gnw20@xxxxxxxxx>
Date: Wed, 16 Jul 2008 21:59:10 +0100
Delivered-to: dolfin-dev@xxxxxxxxxx
In-reply-to: <20080716205408.GC30664@simula.no>
User-agent: Thunderbird 2.0.0.14 (X11/20080505)



Anders Logg wrote:

On Wed, Jul 16, 2008 at 09:48:16PM +0100, Garth N. Wells wrote:
Anders Logg wrote:
Very nice!

Some comments:
1. Beating uBLAS by a factor 3 is not that hard.
A factor 3 is quite something if the matrix is in compressed row format.DOLFIN assembly times into uBLAS, PETSc and Trilinos matrices are allvery close to each other.
Didem Unat (PhD
student at UCSD/Simula) and Ilmar have been looking at the assembly in
DOLFIN recently. We've done some initial benchmarks and have started
investigating how to speedup the assembly. Take a look at what happens
when we assemble into uBLAS:

  (i)   Compute sparsity pattern
  (ii)  Reset tensor
  (iii) Assemble

For uBLAS, each of these steps is approximately an assembly process.
I don't remember the exact numbers, but by just using an
std::vector<std::map<int, double> > instead of a uBLAS matrix, one may
skip (i) and (ii) and get a speedup.
You can do this with uBLAS too by using the uBLAS mapped_matrix (usesstd::map internally) instead of compressed_matrix. The problem is thatit is dead slow for matrix-vector multiplications. Most uBLAS sparsematrix types are faster to assemble than the compressed_matrix, but areslower to traverse.
Before the computation of the sparsity pattern was implemented, DOLFINassembled into a uBLAS vector-of-compressed-vectors because it is quitefast to assemble uninitialised and can be converted quite quickly to acompressed row matrix. This approach may still have merit for some problems.
Garth
I think eventually we should assemble into a special-purpose data
structure that is fast for assembly and the convert row-wise (which is
fast) into something suitable for linear algebra.

This is what we did before (into a vector of vectors, then convert), andit wasn't faster overall.


Garth



------------------------------------------------------------------------

_______________________________________________
DOLFIN-dev mailing list
DOLFIN-dev@xxxxxxxxxx
http://www.fenics.org/mailman/listinfo/dolfin-dev

Follow ups

Re: MTL4 backend: Significant performance results
From: Anders Logg, 2008-07-16

References

MTL4 backend: Significant performance results
From: Dag Lindbo, 2008-07-15
Re: MTL4 backend: Significant performance results
From: Anders Logg, 2008-07-16
Re: MTL4 backend: Significant performance results
From: Garth N. Wells, 2008-07-16
Re: MTL4 backend: Significant performance results
From: Anders Logg, 2008-07-16