Re: [OPM] Support for more linear solvers: status

Bård Skaflestad Fri, 14 Nov 2014 11:11:29 -0800

On 2014-11-14 16:44, Jørgen Kvalsvik wrote:

On 14. nov. 2014 16:33, Bård Skaflestad wrote:


As a general rule I usually distrust std::map<> when it comes to
handling anything that has to do with computational performance
and inner loops.


[ Reason elided ]

I changed it to std::map as a test, and with what I had it
out-performed the previous std::vector-of-triples solution.

I don't doubt that std::map<> was faster than the original version thatstd::sort()ed a std::vector<> of triplets. If I'm guessing, however,I'd attribute the speedup more to a non-optimal choice of initial datastructure than std::map<> being particularly good.

If I were implementing the SparseMatrixBuilder<> from scratch I'd usesomething a lot more targeted to the actual application (conversion fromcoordinate to CSR form). Like I said, you don't actually need to use ageneral sorting procedure at all (complexity O(N log N) in number ofnon-zero elements N). You can achieve a linear bucket sort directly bytransposing the structure twice--albeit at the cost of temporarilyduplicating the storage arrays. This is what all commercial packages do(or at least some variations of the basic approach). As a side benefitthis scheme allows easy compression of duplicate entries using askipping/summing rule.

There are further refinements possible too, such as O(1) rather thanO(log (number of row elements)) insertion if you guarantee that youalways visit the matrix elements in the same order every time you formthe coefficient matrix.

If you want to, I can outline this in code but you should really have alook at, say, Timothy Davis' "Direct Methods for Sparse Linear Systems"for some background material.

Is Dune::BCRSMatrix internal storage dependent on construction type?

It certainly used to be. I last checked the implementation in Dune2.2.x. At that time the row-creation iterator allocated a separatevalue container for each row. If you knew the total number of non-zeroelements at object construction time it used a single array for thecolumn indices though. On the other hand the random mode, which doesput quite a bit of responsibility on the user, ends up using just athree contiguous arrays internally--one for the row pointers, one forthe column indices and one for the values themselves (an array of blocktypes).



Bård

_______________________________________________
Opm mailing list
Opm@opm-project.org
http://www.opm-project.org/mailman/listinfo/opm

Re: [OPM] Support for more linear solvers: status

Reply via email to