Exploratory tools for outlier detection in compositional data with structural zeros

Loading...
Thumbnail Image
Author (Corporation)
Publication date
2017
Typ of student thesis
Course of study
Type
01A - Journal article
Editors
Editor (Corporation)
Supervisor
Parent work
Journal of Applied Statistics
Special issue
DOI of the original publication
Link
Series
Series number
Volume
44
Issue / Number
4
Pages / Duration
734-752
Patent number
Publisher / Publishing institution
Taylor & Francis
Place of publication / Event location
Edition
Version
Programming language
Assignee
Practice partner / Client
Abstract
The analysis of compositional data using the log-ratio approach is based on ratios between the compositional parts. Zeros in the parts thus cause serious difficulties for the analysis. This is a particular prob- lem in case of structural zeros, which cannot be simply replaced by a non-zero value as it is done, e.g. for values below detection limit or missing values. Instead, zeros to be incorporated into further sta- tistical processing. The focus is on exploratory tools for identifying outliers in compositional data sets with structural zeros. For this pur- pose, Mahalanobis distances are estimated, computed either directly for subcompositions determined by their zero patterns, or by using imputation to improve the efficiency of the estimates, and then pro- ceed to the subcompositional and subgroup level. For this approach, new theory is formulated that allows to estimate covariances for imputed compositional data and to apply estimations on subgroups using parts of this covariance matrix. Moreover, the zero pattern structure is analyzed using principal component analysis for binary data to achieve a comprehensive view of the overall multivariate data structure. The proposed tools are applied to larger compositional data sets from official statistics, where the need for an appropriate treatment of zeros is obvious.
Keywords
Project
Event
Exhibition start date
Exhibition end date
Conference start date
Conference end date
Date of the last check
ISBN
ISSN
1360-0532
0266-4763
Language
English
Created during FHNW affiliation
No
Strategic action fields FHNW
Publication status
Published
Review
Peer review of the complete publication
Open access category
Closed
License
Citation
Templ, M., Hron, K., & Filzmoser, P. (2017). Exploratory tools for outlier detection in compositional data with structural zeros. Journal of Applied Statistics, 44(4), 734–752. https://doi.org/10.1080/02664763.2016.1182135