I've done some rough timings for assembly of the elasticity demo with
the mesh refined once. For assembly, things are much faster (almost a
factor 3), but the application of Dirichlet boundary conditions has
become very expensive. Both the creation of sub-domains and the
application of the boundary conditions appears to be quite costly. Of
the time required for mesh initialisation, refinement, assembly and
application of boundary conditions, more the 50% of the runtime involves
the application of the boundary conditions. In version 0.64, the
application of boundary conditions involves almost zero overhead.