A fused deep learning architecture for viewpoint classification of echocardiography

Gao, Xiaohong W. ORCID: https://orcid.org/0000-0002-8103-6624, Li, Wei, Loomes, Martin J. and Wang, Lianyi (2017) A fused deep learning architecture for viewpoint classification of echocardiography. Information Fusion, 36 . pp. 103-113. ISSN 1566-2535 [Article] (doi:10.1016/j.inffus.2016.11.007)

PDF - Final accepted version (with author's formatting)
Available under License Creative Commons Attribution-NonCommercial-NoDerivatives 4.0.

Download (7MB) | Preview


This study extends the state of the art of deep learning convolutional neural network (CNN) to the classification of video images of echocardiography, aiming at assisting clinicians in diagnosis of heart diseases. Specifically, the architecture of neural networks is established by embracing hand-crafted features within a data-driven learning framework, incorporating both spatial and temporal information sustained by the video images of the moving heart and giving rise to two strands of two-dimensional convolutional neural network (CNN). In particular, the acceleration measurement along the time direction at each point is calculated using dense optical flow technique to represent temporal motion information. Subsequently, the fusion of both networks is conducted via linear integrations of the vectors of class scores obtained from each of the two networks. As a result, this architecture maintains the best classification results for eight viewpoint categories of echo videos with 92.1% accuracy rate whereas 89.5% is achieved using only single spatial CNN network. When concerning only three primary locations, 98% of accuracy rate is realised. In addition, comparisons with a number of well-known hand-engineered approaches are also performed, including 2D KAZE, 2D KAZE with Optical Flow, 3D KAZA, Optical Flow, 2D SIFT and 3D SIFT, which delivers accuracy rate of 89.4%, 84.3%, 87.9%, 79.4%, 83.8% and 73.8% respectively.

Item Type: Article
Research Areas: A. > School of Science and Technology > Computer Science
Item ID: 20927
Notes on copyright: © 2016. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
Useful Links:
Depositing User: Xiaohong Gao
Date Deposited: 10 Nov 2016 11:01
Last Modified: 10 Jun 2021 02:51
URI: https://eprints.mdx.ac.uk/id/eprint/20927

Actions (login required)

View Item View Item


Activity Overview

Additional statistics are available via IRStats2.