dolfin team mailing list archive

Thread
Date

Re: profiling an assembly

To: dolfin-dev@xxxxxxxxxx
From: Anders Logg <logg@xxxxxxxxx>
Date: Mon, 19 May 2008 15:34:25 +0200
Delivered-to: dolfin-dev@xxxxxxxxxx
In-reply-to: <a9f269830805190630h4eb48f7focd0f842faa5e5b7f@mail.gmail.com>
Mail-followup-to: dolfin-dev@xxxxxxxxxx
User-agent: Mutt/1.5.17+20080114 (2008-01-14)

On Mon, May 19, 2008 at 08:30:20AM -0500, Matthew Knepley wrote:
> On Mon, May 19, 2008 at 8:20 AM, Anders Logg <logg@xxxxxxxxx> wrote:
> > It looks to me like the storage needed is indeed n^2*num_cells. I'm
> > not fluent in Fortran, but that's how I interpret this line:
> >
> >  atw(idxatw(el,li,lj)) = atw(idxatw(el,li,lj)) + Atmp(li,lj)
> >
> > This looks expensive (in terms of memory), but maybe not that
> > expensive?
> 
> I think I should make the aggregation point again. The above line executes
> a function call for insertion of every value. This is a lot of
> overhead,

No, I think the above code would be very much faster than PETSc, but
use more memory. The way I interpret it, atw is an array and idxatw is
a *dense* rank 3 tensor so there's no searching, only lookup.

> not only
> for the call, but setting up loop bounds etc. That is why MatSetValues takes
> logical blocks, exactly what you get from FEM, I believe this could be the
> difference between our timing results.

No, the above code is not what we use in DOLFIN. We use MatSetValues
with blocks. The above code is femLego Fortran code.

-- 
Anders

Follow ups

Re: profiling an assembly
From: Matthew Knepley, 2008-05-19

References

Re: profiling an assembly
From: Murtazo Nazarov, 2008-05-18
Re: profiling an assembly
From: Anders Logg, 2008-05-19
Re: profiling an assembly
From: Matthew Knepley, 2008-05-19