Evaluating interactive visualization of multidimensional data projection with feature transformation

Xu, Kai ORCID logoORCID: https://orcid.org/0000-0003-2242-5440, Zhang, Leishi ORCID logoORCID: https://orcid.org/0000-0002-3158-2328, Pérez, Daniel, Nguyen, Phong H. and Ogilvie-Smith, Adam (2017) Evaluating interactive visualization of multidimensional data projection with feature transformation. Multimodal Technologies and Interaction, 1 (3) , 13. ISSN 2414-4088 [Article] (doi:10.3390/mti1030013)

PDF - Published version (with publisher's formatting)
Available under License Creative Commons Attribution 4.0.

Download (468kB) | Preview
PDF - Final accepted version (with author's formatting)
Download (456kB) | Preview


There has been extensive research on dimensionality reduction techniques. While these make it possible to present visually the high-dimensional data in 2D or 3D, it remains a challenge for users to make sense of such projected data. Recently, interactive techniques, such as Feature Transformation, have been introduced to address this. This paper describes an user study that was designed to understand how the feature transformation techniques affect user’s understanding of multi-dimensional data visualisation. It was compared with the traditional dimension reduction techniques, both unsupervised (PCA) and supervised (MCML). Thirty-one participants were recruited to detect visually clusters and outliers using visualisations produced by these techniques. Six different datasets with a range of dimensionality and data size were used in the experiment. Five of these are benchmark datasets, which makes it possible to compare with other studies using the same datasets. Both task accuracy and completion time were recorded for comparison. The results showthat there is a strong case for the feature transformation technique. Participants performed best with the visualisations produced with high-level feature transformation, in terms of both accuracy and completion time. The improvements over other techniques are substantial, particularly in the case of the accuracy of the clustering task. However, visualising data with very high dimensionality (i.e., greater than 100 dimensions) remains a challenge.

Item Type: Article
Research Areas: A. > School of Science and Technology > Computer Science
Item ID: 22196
Useful Links:
Depositing User: Leishi Zhang
Date Deposited: 05 Jul 2017 14:14
Last Modified: 29 Nov 2022 20:46
URI: https://eprints.mdx.ac.uk/id/eprint/22196

Actions (login required)

View Item View Item


Activity Overview
6 month trend
6 month trend

Additional statistics are available via IRStats2.