Title: | Model-Based Clustering via Matrix-Variate Mixture Models |
---|---|
Description: | Implements finite mixtures of matrix-variate contaminated normal distributions via expectation conditional-maximization algorithm for model-based clustering, as described in Tomarchio et al.(2020) <arXiv:2005.03861>. One key advantage of this model is the ability to automatically detect potential outlying matrices by computing their a posteriori probability of being typical or atypical points. Finite mixtures of matrix-variate t and matrix-variate normal distributions are also implemented by using expectation-maximization algorithms. |
Authors: | Salvatore D. Tomarchio [aut], Michael P.B. Gallaugher [aut, cre], Antonio Punzo [aut], Paul D. McNicholas [aut] |
Maintainer: | Michael P.B. Gallaugher <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.0.0 |
Built: | 2025-02-12 05:41:10 UTC |
Source: | https://github.com/cran/MatrixMixtures |
Fits, by using expectation-maximization algorithms, mixtures of matrix-variate distributions (normal, t, contaminated normal) to the given data. Can be run in parallel. The Bayesian information criterion (BIC) is used to select the number of groups.
MatrixMixt( X, G = 1:3, mod, tol = 1e-05, maxiter = 10000, ncores = 1, verbose = TRUE )
MatrixMixt( X, G = 1:3, mod, tol = 1e-05, maxiter = 10000, ncores = 1, verbose = TRUE )
X |
A list of dimension |
G |
A vector containing the numbers of groups to be tried. |
mod |
The matrix-variate distribution to be used for the mixture model. Possible
values are: |
tol |
Threshold for Aitken's acceleration procedure. Default value is |
maxiter |
Maximum number of iterations of the algorithms. Default value is |
ncores |
A positive integer indicating the number of cores used for running in parallel.
Default value is |
verbose |
Logical indicating whether the running output should be displayed. |
A list with the following elements:
flag |
Convergence flag (TRUE - success, FALSE - failure). |
pig |
Vector of the estimated mixing proportions (length G). |
nu |
Vector of the estimated degree of freedoms (length G). Only for "MVT". |
alpha |
Vector of the estimated inliers proportions (length G). Only for "MVCN". |
eta |
Vector of the estimated inflation parameters (length G). Only for "MVCN". |
M |
Array of the mean matrices (p x r x G). |
Sigma |
Array of the estimated row covariance matrices (p x p x G). |
Psi |
Array of the estimated column covariance matrices (r x r x G). |
class |
Vector of estimated data classification. |
z |
Matrix of estimated posterior probabilities (N x G). |
v |
Matrix of estimated inlier probabilities (N x G). Only for "MVCN". |
lik |
Estimated log-likelihood. |
BIC |
Estimated BIC. |
data(SimX) res <- MatrixMixt(X = SimX, G = 2, mod = "MVCN")
data(SimX) res <- MatrixMixt(X = SimX, G = 2, mod = "MVCN")
A simulated dataset with 2 groups and 80 observations. Each group consists of 40 observations, 5 of which are outliers.
data(SimX)
data(SimX)
An object of class list
of length 80.