(Circulation. 2001;103:2711.)
© 2001 American Heart Association, Inc.
Clinical Investigation and Reports |
From the University of Colorado Health Sciences Center, the Childrens Hospital, Denver (C.G.D., R.S., L.V-C.), and the Department of Mechanical Engineering, University of Colorado, Boulder (S.B., J.H., R.S., R.L.M.), Colo.
Correspondence to Curt G. DeGroff, MD, University of Colorado Health Science Center, the Childrens Hospital, 1056 E 19th Ave, B-100 Denver, CO 80218. E-mail degroff.curt{at}tchden.org
| Abstract |
|---|
|
|
|---|
Methods and ResultsUsing an electronic stethoscope, heart sounds were recorded from 69 patients (37 pathological and 32 innocent murmurs). Sound samples were processed using digital signal analysis and fed into a custom ANN. With optimal settings, sensitivities and specificities of 100% were obtained on the data collected with the ANN classification system developed. For future unknowns, our results suggest the generalization would improve with better representation of all classes in the training data.
ConclusionWe demonstrated that ANNs show significant potential in their use as an accurate diagnostic tool for the classification of heart sound data into innocent and pathological classes. This technology offers great promise for the development of a device for high-volume screening of children for heart disease.
Key Words: heart murmurs neural networks (computer) child heart defects, congenital
| Introduction |
|---|
|
|
|---|
There has been much excitement in the scientific literature
in recent years regarding artificial neural networks
(ANNs)8 9 in
medicine10 and,
specifically, in cardiology
applications.7 8 11 12 13 14 15 16 17 18 19 20
ANNs are valuable tools used in complex pattern recognition and
classification tasks. They learn complex interactions among inputs and
identify relations in input data that may not be apparent to human
analysis.14 The most
common type of ANN consists of 3 layers of processing units: the input
layer, the hidden layer, and the output layer connected in sequence
(Figure 1
), details of which can be found in Appendix
.
|
Studies of ANNs in cardiology have been mainly concerned with the evaluation of ECG signals.14 15 16 18 19 20 There have been several studies on the use of ANNs on heart sounds with limited results and applicability.7 11 12 21 The aim of the present study was to train an ANN to distinguish between innocent and pathological murmurs effectively.
| Methods |
|---|
|
|
|---|
Recordings
Using an electronic stethoscope (Cambridge Heart
Sound Microphone; frequency range, 40 to 600 Hz) and a personal
computer (sampling at 44 kHz), heart sounds were recorded at the
left midsternal border in the supine position in a standard examination
room for all patients. Younger patients were placed in a darkened
examination room with minimal stimuli to encourage cooperation. No
sedation was used. For each patient, 2 separate heart sound
recordings were acquired. One recording location was
chosen with the expectation that murmurs generally not heard from this
location (using standard stethoscopic methods) would be detected by the
electronic microphone used due to its sensitivity.
Two heart sound recordings for each patient of
8
heart cycles in length were recorded. Subsequently, one sound
sample of 3 repre-sentative consecutive heart
cycles was chosen from these 2 recordings by one of the
investigators (C.G.D.) for all 69 patients. The investigator chose the
optimal sound sample by listening to the entirety of the 2
recordings and qualitatively determining the sound sample (of 3
consecutive heart cycles) with the least amount of extraneous noise,
such as breathing, talking, motion artifact, and room noises. This
optimal sound sample was then processed using digital signal
analysis
(Figure 2
).
|
Signal Analysis
A normalized energy spectrum of the sound data was
obtained by applying a Fast Fourier
Transform22 to the heart
sounds. Various spectral resolutions (1, 3, and 5 Hz) and frequency
ranges (0 to 90, 0 to 150, 0 to 210, 0 to 255, and 0 to 300 Hz) were
used as input into the ANN to optimize these parameters to
obtain the most favorable results.
Artificial Neural Network
We used customized ANN software developed in our
laboratory (CUANN; see Appendix B). ANNs were trained to discriminate
between normal and pathological examples. Because of the size of the
available data, the Jack-Knifing
method23 was used. The
Jack-Knifing method is an iterative process in which one observation is
recruited each time for validation. A classifier is trained using the
remaining data and validated on the single, left-out validation point.
This ensures that the validation is unbiased, because the classifier
does not see the validation point during its training. One by one, each
available example is recruited for validation. The approach measures
the power of the classification approach rather than of one specific
classifier.
Each of the 69 data sets was recruited for validation, one
at a time, creating 69 separately trained ANNs. For each spectral
resolution (1, 3, and 5 Hz) and frequency range considered (0 to 90, 0
to 150, and 0 to 210 Hz), a set of 69 neural networks with identical
numbers of input, hidden, and output neurons was created
(Table 1
).
|
Statistical Analysis
Standard equations for sensitivity and specificity
were
used.24
| Results |
|---|
|
|
|---|
|
Recordings
For the majority of patients, the entire
recording procedure took <5 minutes (maximum, 10 minutes). All
of the patient recordings were deemed of acceptable quality by
one investigator (C.G.D.).
ANN Predictions
Table 2
shows the results obtained with a spectral
resolution of 3 Hz and various frequency ranges. Sensitivities and
specificities >90% were obtained.
Figure 4A
shows the predictions of the 69 ANNs (with a
spectral range of 0 to 150 Hz and a spectral resolution of 3 Hz)
constituting a feature set of 50 constituent elements. ANN prediction
is plotted on the y axis versus
the corresponding validation example number on the
x axis. The data were arranged
in a specific, constant order. The first 37 examples (cases 1 through
37) were pathological murmurs, and the next 32 (cases 38 through 69)
were innocent murmurs. The horizontal dashed line represents
the decision threshold. Setting the threshold to either extremity (0 or
1) results in trivial results so that all examples are classified as
normal or pathological. With a threshold of 0.8, a sensitivity of 100%
and a specificity of 92% were obtained. Observe the erroneous
classifications with respect to this threshold marked by bulls-eyes.
Interestingly, increasing the spectral range past 0 to 150 Hz while
keeping the spectral resolution to 3 Hz produced worsening results
(Table 2
), with the ANN predictions not as tightly
distributed
(Figure 4B
).
|
|
To further improve the performance, the spectral
resolution was increased to 1 Hz. In the frequency range of 0 to 90 Hz,
the performance was inadequate. However, it improved as the
spectrum was expanded
(Table 2
). With the upper limit of the frequency range
increased from 90 to 150 Hz, the accuracy of the classifier system
improved to 100% sensitivity and specificity. Increasing the upper
limit from 150 to 210 Hz continued to improve the classifier system
(Table 2
and
Figure 4C
). However, when the upper limit was increased
still further to 255 Hz, the trend was reversed. Although 100%
sensitivity and specificity were maintained, the ANN predictions were
not as tightly distributed (data not shown), in a manner similar to
what occurred with the spectral resolution of 3
Hz.
| Discussion |
|---|
|
|
|---|
In this study, 100% sensitivity and specificity were obtained for the ANN distinguishing between the 69 innocent and pathological heart murmurs presented to it using a spectral resolution of 1 Hz and a spectrum of 0 to 210 Hz. In general, a finer resolution and larger spectrum yielded improved results. The trend in improvement of classifier accuracy with wider energy spectrum may be explained as follows. As the energy spectrum is expanded, the ANN classifier has more information to work with, and its accuracy tends to increase. However, the increased computational load on the ANN may offset this benefit, and it is seen that at a certain point, the trend is reversed. Also, it is common knowledge that a pattern recognition that can handle a larger feature set is better unless the larger feature set contains redundant information. The trend in improved classification performance with higher Fast Fourier Transform resolution was confirmed by decreasing the resolution to 5 Hz (data not shown). Accuracy comparable to 1 Hz or 3 Hz levels was never achieved at this resolution.
Although we deployed all 69 examples for validation, only one example served for validation each time, thus creating 69 separately trained ANNs. The question arose as to whether a classifier system comprising a set of ANNs accruing from the Jack-Knifing approach could be field-deployed. Under such a scheme, the individual ANNs would vote in the classification of a previously unseen observation. To investigate this matter and having no additional data, the data were divided into 2 sets. (1) A set of 54 examples was earmarked as training data for the development of a system of 54 ANNs following the Jack-Knifing scheme, and (2) a set of 15 examples was reserved as field-testing data exclusively to validate the 54 ANNs as multiple experts, with the decision effected by a rule of simple majority. Using the validation set, the system of ANNs was able to classify 7 of 9 pathological examples and 5 of 6 innocent examples correctly. Both of the misclassified pathological examples, a case with pulmonary stenosis and a case with an atrial septal defect, belonged to pathological classes that were grossly under-represented in the training data. When these misclassified pathological examples and the misclassified normal example were reintroduced into the training data set, a threshold could be established with respect to which the resulting classifier system of 57 ANNs was fully accurate when field-tested on the 12 validation examples. These results suggest that ANN generalization would improve with better representation of all classes in the training data, for which more data would have to be collected.
Finally, the current results include training and testing of the classifier system with only one class of aortic stenosis (moderate) and only one class of pulmonary stenosis (mild). Clearly, a wide spectrum of murmurs for each type of pathological case has not yet been presented to the network. However, the classifier system did perform well with a wide range of sound "signatures" found to be present in the patients with murmurs from ventricular septal defects (Appendix C).
Overall, we propose that as additional data are collected, a point will be reached when the important pathological types will each be adequately represented and the generalization capability of the ANN classifier system developed by Jack-Knifing will stabilize at some high accuracy. At that point, misclassifications will be rare and field deployment will be possible. During the stabilizing process, a particular frequency range, spectral resolution, and threshold will be selected to optimize results. Also, after this point, whether the classifier system will be designed so that it continues to "learn" is beyond the scope of this discussion.
Limitations and Future Work
The process of selecting the optimal 3 consecutive
heart cycles required an intelligent selection process. In developing a
screening device, automating this selection
process33 will need to be
investigated. The performance of the ANN with the addition of
noise processes (eg, from an uncooperative patient) to the input signal
will need to be investigated. All of our 69 patients were cooperative
enough for us to obtain adequate heart sound recordings. An
intelligent automated selection algorithm will also need to determine
if an adequate sample exists at all.
The most pressing need at present is to collect more data., In future efforts, it would be impractical to consider that the classifier system could be introduced to all possible heart murmur "signatures." Therefore, future efforts will focus on presenting the ANN with a wide spectrum of each of the common pathological cases studied here, both for training and testing purposes. However, our results here are quite promising with the limited data set available.
If, after additional training data are presented to the ANN classifier system, the system is unable to maintain the accuracy that has been shown here, recordings from the standard 5 cardiac auscultation sites will be considered as inputs. This may be necessary for murmurs not generally heard well at the one recording location chosen (left midsternal border). However, the sensitivity of the electronic microphone used will potentially grant us the economy of one recording location.
Finally, there will most likely be classes of patients with heart lesions for which a screening device using these algorithms may be expected to produce false negatives. Therefore, in patients for whom such a screening device determines that a pathological murmur does not exist, it would simply declare: "no abnormal cardiac murmurs detected." Two examples are newborns with cardiac shunt lesions in whom murmurs typically appear only after pulmonary pressures begin to drop and those with heart lesions whose murmurs may appear only during examination with postural changes.
| Conclusions |
|---|
|
|
|---|
| Appendix A |
|---|
|
|
|---|
![]() | (1) |
![]() | (2) |
To train an ANN for developing input-output mapping, data are required that are representative of the mapping. The first step of training is the forward pass, which consists of calculating the output vector by running the input vector through the ANN. This is followed by a backward pass, where the error derivatives are calculated for each weight. The error derivatives for a weight are summed until all the data points have been run through the network once. This constitutes an epoch. The weights are updated after each epoch such that the ANN error decreases. For a classification task, the output neurons of the ANN represent individual classes where, typically, a binary classification scheme is used.
| Appendix B |
|---|
|
|
|---|
| Appendix C |
|---|
|
|
|---|
![]() | (3) |
The same metric, applied to just one class, provides a
measure of intraclass similarity. In this case, the
metric is evaluated as follows:
![]() | (4) |
We compared the set of examples representing ventricular septal defect murmurs with the set of examples with innocent murmurs. The intraclass correlation metric was 78% for the defects and 77% for the innocent murmurs. Because it was known that the 32 instances of innocent murmurs in our data set included disparate examples, we concluded that the set of ventricular septal defect examples was also heterogeneous.
| Acknowledgments |
|---|
Received December 31, 2000; revision received March 19, 2001; accepted March 21, 2001.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. P. Finley, A. E. Warren, G. P. Sharratt, and M. Amit Assessing Children's Heart Sounds at a Distance With Digital Recordings Pediatrics, December 1, 2006; 118(6): 2322 - 2325. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. B. Mahnke, A. Nowalk, D. Hofkosh, J. R. Zuberbuhler, and Y. M. Law Comparison of Two Educational Interventions on Pediatric Resident Auscultation Skills Pediatrics, May 1, 2004; 113(5): 1331 - 1335. [Abstract] [Full Text] [PDF] |
||||
![]() |
P Ramnarayan and J Britto Paediatric clinical decision support systems Arch. Dis. Child., November 1, 2002; 87(5): 361 - 362. [Full Text] [PDF] |
||||
![]() |
L B Dahl, P Hasvold, E Arild, T Hasvold, C Wren, L B Dahl, P Hasvold, E Arild, and T Hasvold Heart murmurs recorded by a sensor based electronic stethoscope and e-mailed for remote assessment Arch. Dis. Child., October 1, 2002; 87(4): 297 - 301. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Circulation Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 2001 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |