On 18 October 2024 at 09:39, Robin Liu wrote:
| Dear all,
|
| Computing the Cholesky factor of a matrix is on the order of n^3 while
| inverting a triangular system is n^2. However, it appears that these two
| operations take roughly the same time using armadillo.
|
| Unit: milliseconds
| ticker mean sd min max neval
| chol 80.81 3.001 75.84 88.11 20
| inv-chol 84.89 4.449 80.78 97.30 20
| inv-direct 274.39 10.787 260.85 303.20 20
|
| Inverting the triangular Cholesky factor is fast relative to direct inversion,
| but is of the same order as computing the factor itself. Is this to be
| expected? I am running with
| BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3
| LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.20.so;
| LAPACK version 3.10.0
| along with OPENBLAS_NUM_THREADS=1.
The point of OpenBLAS is to be multithreaded. Why do you constrain it to one
thread?
| I admit my question is not specific to Rcpp or RcppArmadillo, but I thought
| someone on this list could advise.
What is the reference point? "Slow" is relative to what here? Did you try to
vary the matrix dimension too and see if the change in time is aligned with
your expectations?
In any event, Rcpp "is a messenger here" as is RcppArmadillo as is Armadillo
as the actual operations are plain BLAS operations and this list may no be
the most focussed forum to discuss the topic.
Dirk
| My code follows:
| ```
| library(Rcpp)
| library(RcppArmadillo)
|
| src <-
| r"(#include <RcppArmadillo.h>
| #include "/usr/local/lib/R/site-library/RcppClock/include/RcppClock.h"
| using namespace arma;
|
| // [[Rcpp:depends(RcppClock)]]
| // [[Rcpp::depends(RcppArmadillo)]]
| // [[Rcpp::export]]
| void chol_and_inv(const arma::mat &V) {
| Rcpp::Clock clock;
| mat R, Rinv, Vinv;
| for (int i = 0; i < 20; i++) {
| clock.tick("chol");
| R = chol(V);
| clock.tock("chol");
| clock.tick("inv-chol");
| Rinv = inv(trimatu(R));
| clock.tock("inv-chol");
| clock.tick("inv-direct");
| Vinv = inv(V);
| clock.tock("inv-direct");
| }
| clock.stop("timer");
| return;
| })"
|
| # Create large PD matrix
| size <- 2000
| set.seed(123)
| m <- matrix(rnorm(size^2), size, size)
| m <- tcrossprod(m)
| diag(m) <- rowSums(abs(m)) + 2
|
| sourceCpp(code = src)
| result <- chol_and_inv(m)
| library(RcppClock)
| timer
| ```
| Best wishes,
| _______________________________________________
| Rcpp-devel mailing list
| [email protected]
| https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/rcpp-devel
--
dirk.eddelbuettel.com | @eddelbuettel | [email protected]
_______________________________________________
Rcpp-devel mailing list
[email protected]
https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/rcpp-devel