Email updates

Keep up to date with the latest news and content from BioMedical Engineering OnLine and BioMed Central.

Open Access Research

Robust Peak Recognition in Intracranial Pressure Signals

Fabien Scalzo*, Shadnaz Asgari, Sunghan Kim, Marvin Bergsneider and Xiao Hu*

Author Affiliations

Department of Neurosurgery, Geffen School of Medicine, Neural Systems and Dynamic Lab (NSDL), University of California, Los Angeles, CA, USA

For all author emails, please log on.

BioMedical Engineering OnLine 2010, 9:61  doi:10.1186/1475-925X-9-61

The electronic version of this article is the complete one and can be found online at: http://www.biomedical-engineering-online.com/content/9/1/61


Received:13 April 2010
Accepted:19 October 2010
Published:19 October 2010

© 2010 Scalzo et al; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

The waveform morphology of intracranial pressure pulses (ICP) is an essential indicator for monitoring, and forecasting critical intracranial and cerebrovascular pathophysiological variations. While current ICP pulse analysis frameworks offer satisfying results on most of the pulses, we observed that the performance of several of them deteriorates significantly on abnormal, or simply more challenging pulses.

Methods

This paper provides two contributions to this problem. First, it introduces MOCAIP++, a generic ICP pulse processing framework that generalizes MOCAIP (Morphological Clustering and Analysis of ICP Pulse). Its strength is to integrate several peak recognition methods to describe ICP morphology, and to exploit different ICP features to improve peak recognition. Second, it investigates the effect of incorporating, automatically identified, challenging pulses into the training set of peak recognition models.

Results

Experiments on a large dataset of ICP signals, as well as on a representative collection of sampled challenging ICP pulses, demonstrate that both contributions are complementary and significantly improve peak recognition performance in clinical conditions.

Conclusion

The proposed framework allows to extract more reliable statistics about the ICP waveform morphology on challenging pulses to investigate the predictive power of these pulses on the condition of the patient.

1 Background

Traumatic Brain Injuries (TBI) affect more than 2 million people annually in the United States, and their incidence in the world keeps increasing [1]. The treatment of TBI patients in critical care units, as well as other neurological disorders, relies on the continuous measurement of intracranial pressure (ICP) (i.e. the sum of the pressures exerted within the craniospinal axis system). It is known that the management of ICP can attenuate secondary brain injuries and improve chances of recovery. Interestingly, the morphology of ICP waveform holds essential informations about the intracranial adaptive capacity (elastance), and even the outcome of head injured patients [2,3]. For example, it has been shown that variations of the ICP pulse morphology are linked to the development of intracranial hypertension [4-6], cerebral vasospasm [7], changes in the cerebral blood carbon dioxide (CO2) levels [8,9], decreased cerebral bloob flow (CBF) [10], and changes in the craniospinal compliance [11].

The extraction of morphological features is essential to monitor and to understand ICP in an automatic fashion with the ultimate goal of improving the treatment of pathophysiological intracranial and cerebrovascular conditions. Although ICP pulses are typically triphasic [8] (i.e. three peaks), their shape can exhibit irregular variations such that some peaks may be missing. The recognition of these top peaks is a challenging task that has recently drawn special attention from different research groups. Several algorithms have been developed to detect the first peak [12], and to recognize the three peaks of ICP pulses [13-17]. Existing methods can be divided into two categories depending if they work offline, like Morphologram [14], or online, like MOCAIP [13,15] (

    M
orphological
    C
lustering and
    A
nalysis of
    I
CP
    P
ulse). These techniques offer a satisfactory accuracy to recognize the peaks in general cases. However our recent observations show that their performance deteriorates significantly when the pulses exhibit abnormalities or are simply more challenging (a pulse is considered to be challenging if any of its peaks fails to be correctly designated by the baseline MOCAIP algorithm [15], see Figure 1). Such ICP pulses are of particular interest because we suspect that they might hold essential predictive information about the patient condition.

This paper investigates how to improve peak recognition accuracy on challenging ICP pulses. The contribution is two-fold. First, to conduct this study, MOCAIP++, a robust ICP pulse processing framework that generalizes MOCAIP, is introduced. The strength of MOCAIP++ relies on its capacity to integrate different peak recognition methods, and to exploit additional features based on the derivatives of the ICP signal. Our experiments evaluate these characteristics by providing a comparative analysis of three different state-of-the-art peak recognition techniques based on Gaussian Models, Gaussian Mixture Model (GMM), and Spectral Regression Analysis (SR), and by evaluating the impact of ICP features, such as curvature, first and second derivatives on the recognition performance. Second, this paper investigates the effect of incorporating challenging pulses into the training set of peak recognition methods learned in a supervised way. A method is proposed to sample automatically a representative challenging dataset of ICP pulses from a large database of ICP signals collected from 128 neurosurgical patients. The original, and the challenging datasets allow to study how the performance of peak recognition methods, essential to extract morphological features, can be improved.

thumbnailFigure 1. Illustration of two ICP pulses (the actual position of the peak is depicted in green, MOCAIP prediction in black). On the left, an ICP dominant pulse is correctly annotated with the position of the three peaks. On the right, the automatic annotation failed to correctly recognized the third peak because of the uncommon shape of the pulse. This pulse is considered as a challenging one in our study.

2 Methods

2.1 ICP Dataset

Generally, ICP signal recordings consist of several hours long segments. By reviewing those files, we observed that the majority of the recordings contain pulses whose peaks are easily recognized by automatic algorithms. A subset of ICP files, however, contains pulses that are not correctly annotated by automatic peak recognition methods. One reason for these mismatches is that the pulse morphology differs significantly from the most common ones. We consider those pulses to be challenging (an example is shown in Figure 1).

The variation in morphology of these challenging pulses might originate from a combination of external factors such as sampling rate of the ICP, noise and artifact due to the acquisition device, or coughing of the patient. It is also possible that some of these morphological variations come from the condition of the patient and that they might hold relevant predictive information. Unfortunately, the peak recognition accuracy of current techniques on the ICP recordings containing those pulses drops dramatically. It is no longer possible to extract reliable statistics about their ICP waveform morphology to perform further analysis. This observation led us to extract a challenging dataset D' (Section 2.1.2) from the dataset D (Section 2.1.1). The new dataset D' is sampled from the recordings of D that contain a large percentage of challenging pulses. Both datasets, that are described below, will be used in the experiments to evaluate the performance of our framework. In addition, we will investigate if the use of the challenging dataset as part of the training set of peak recognition methods can improve their performance.

2.1.1 Original Data

The source dataset of ICP signals originates from patients admitted to the University of California Los Angeles (UCLA) medical center. Its usage in the present study was approved by the UCLA Internal Review Board. It is a large, representative dataset that is reasonably distributed across gender, age, and type of patient (ICU or NON-ICU). A small portion of this dataset was previously used to evaluate MOCAIP [15] and its extensions based on regression analysis [18]. The ICP and ECG signals were acquired from 128 patients treated for various intracranial pressure relted conditions. ICP was monitored continuously using Codman intraparenchymal microsensors (Codman and Schurtleff, Raynaud, MA) placed in the right frontal lobe. ICP signals were recorded from bedside monitors using corporate data acquisition systems at a sampling frequency of either 240 Hz or 400 Hz. A total of 1425 recordings were extracted, each totalizing several hours. Those ICP and ECG signal recordings were subsequently pre-processed by MOCAIP so that they were first divided into 3 minutes segments. Then, a hierarchical clustering was applied on individual pulses of each segment, and the center of the dominant cluster was extracted to produce a dominant pulse. This clustering process leads to a representative set of 87,125 dominant pulses. It is referred to as the original dataset D from which a smaller, but more challenging dataset will be sampled. The actual positions of the three peaks in the ICP are obtained by manual annotation from experienced researchers following the procedure described in the next subsection.

2.1.2 Challenging Data

The selection of a challenging subset of ICP pulses D' ⊂ D is achieved using a weighted sampling procedure from the file recordings of the original dataset D. Intuitively, the sampling aims at extracting more pulses from recordings that contain a larger percentage of challenging pulses so that they are better represented in the resulting dataset. To do so, each recording is associated with a weight corresponding to its degree of difficulty which is high if MOCAIP often fails to recognize the three peaks. The procedure to weight the files is described below.

Experienced researchers establish the groundtruth by manually setting the positions of the three peaks (p1, p2, and p3) in each pulse. The task of the researcher is to pick the right peak candidates among those automatically detected at curve inflections (Section 2.2.2). Whenever one of the three peaks is missing, its position is labelled with the empty set. Among the set of pulses, 7173 have missing p1, 3699 have missing p2, and 4626 have missing p3. Researchers cross-validate their results and, if necessary, they harmonize them using the annotation of the previous and following pulses as reference. For a few difficult cases where the researchers could not agree on the position of some peaks, the pulse was removed from the dataset. This procedure ensures that the groundtruth is not biased to a specific researcher.

In parallel, MOCAIP is applied to annotate each pulse with the position of the three peaks (p1, p2, and p3). To find difficult files, the predictions of MOCAIP are compared with the manual groundtruth. For each ICP file fi= 1...F, a weight wi= 1...F is set proportional to the percentage of wrongly assigned peaks. This is done by comparing the position of each peak of the ground truth to the position obtained from the automatic files,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M1','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M1">View MathML</a>

(1)

where ℰp1, ℰp2, ℰp3 are the number of wrongly assigned peaks and <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M2','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M2">View MathML</a> are the total number of occurrences in the file of the peaks p1, p2, and p3, respectively. The distribution of the weights is illustrated in Figure 2.

thumbnailFigure 2. The distribution of the weights is illustrated for the files in the original dataset of ICP pulses. The weight of each file is set proportional to the percentage of wrong peak assignations. For example, a value of 1 indicates that all the peaks in that file were not assigned correctly by the MOCAIP, this usually happens in short recordings.

Finally, the challenging dataset D' is created by extracting pulses using weighted sampling, such that a pulse has a probability vi (Eq. 2) to be picked from file fi. Therefore, files with large probability vi will contribute to more pulses in the sampled dataset.

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M3','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M3">View MathML</a>

(2)

To avoid redundancy from the files that contain only a few pulses, each pulse is selected at most once during sampling. The resulting dataset is made of 10638 ICP pulses among which 2816 have missing p1, 604 have missing p2, and 692 have missing p3. The challenging pulses are distributed among 58 patients.

2.2 MOCAIP++

This section introduces MOCAIP++, a generalization of the recently developed MOCAIP [15] which is an end-to-end framework that processes raw ICP signals to extract morphological waveform features through the recognition of the three peaks of the pulse. In its original form, MOCAIP relies on a Gaussian model to represent the prior knowledge about the position of each peak in the pulse. The Gaussian priors were replaced by a regression model in a recent extension [18].

MOCAIP++ generalizes its predecessors in two ways. First, it proposes a unifying view such that different peak recognition techniques can be integrated within the framework. Second, an additional processing step allows to exploit ICP features regardless the peak recognition method that is used. Similarly to MOCAIP, a pulse extraction technique (Section 2.2.1) first process the ICP signal to extract a reliable dominant pulse from which peak candidates are located at curves inflections (Section 2.2.2). Then, MOCAIP++ extracts different ICP features from the dominant pulse (such as curvature, first, and second derivative) (Section 2.2.3). The peak recognition module (Section 2.2.4) exploits the peak candidates and the features to recognize the peaks within the pulse. Finally, various statistics are estimated using the latency of these peaks and their ICP elevation (additional details can be found in the original papers [15,18]). The core of the algorithm is illustrated in Figure 3 and its major components are described in the next subsections.

thumbnailFigure 3. Diagram showing the different modules in MOCAIP++ framework.

2.2.1 ICP Segmentation and Dominant Pulse Extraction

The first component of the framework (ICP pulse segmentation) takes a raw, continuous ICP signal and splits it in a series of individual ICP pulses. An individual pulse is found using a pulse extraction technique [19] combined with the ECG QRS detection [20] that locates each ECG beat. Therefore, the latency of the three peaks within the ICP pulse is relative to the ECG QRS.

Because ICP recordings are subject to various noise and artifacts during the acquisition process, a robust dominant pulse Si is extracted from a sequence of consecutive ICP pulses using hierarchical clustering [21]. It corresponds to the centroid of the largest cluster. In other words, the dominant pulse summarizes a short segment of consecutive ICP pulses.

2.2.2 Detecting Peak Candidates

Then, MOCAIP++ detects peak candidates (a1, a2, ..., aN) at curve inflections of the dominant ICP pulse Si by segmenting the pulse into concave and convex regions using the second derivative of the signal. A peak is said to occur at the intersection of a convex and a concave region on a rising edge of ICP pulse, or at the intersection of a concave and a convex region on the descending edge of the pulse.

2.2.3 ICP Features

Previous MOCAIP-based studies [15,18] exploited the dominant pulses directly as input to peak recognition techniques. In signal processing, it is common to derive features that emphasize different properties of the signal. For example, the first derivative measures the changing rate of the signal with respect to time. As illustrated in Figure 4, it is particularly interesting in our case because for a similar amplitude, a wide peak, and a narrow peak will lead to different derivative values. Therefore, features extracted from the ICP signal derivative provide additional morphological characteristics that should help to discriminate between ICP peaks. One advantage of using these features is that they are invariant to a shift of the signal elevation. Note that the framework is not restricted to these features, any other features could in principle be exploited. In our experiments, we will evaluate the impact of using the first Lx and second Lxx derivatives, as well as the curvature K extracted from the ICP signal within MOCAIP++ framework.

thumbnailFigure 4. Signal L made of two Gaussian peaks with different standard deviations. Its first Lx and second Lxx derivatives are particularly usefull to discriminate peaks because their amplitude depends on the peak width but remains invariant to any global shift in elevation of the signal.

First Derivative

For more robustness, the ICP signal I(x) is first convolved with a Gaussian smoothing filter <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M4','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M4">View MathML</a> where σ is the standard deviation of the Gaussian (σ = 3 in our experiments),

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M5','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M5">View MathML</a>

(3)

Then the derivative Lx is computed according to the smoothed version L of the ICP,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M6','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M6">View MathML</a>

(4)

Second Derivative

Similarly, the computation of the second derivative Lxx relies on the first derivative Lx,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M7','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M7">View MathML</a>

(5)

Curvature

The curvature K is computed as a ratio between the first and the second derivative of the signal,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M8','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M8">View MathML</a>

(6)

2.2.4 Peak Recognition

This module aims at recognizing the three peaks (p1, p2, p3) within an ICP pulse among the set of candidate peaks (a1, a2, ..., aN). Depending on the recognition technique, it can exploit the latency of the peak candidates, the raw ICP pulse, or different features extracted from the pulse. In the next, we describe three different peak recognition approaches. They are based on independent Gaussian models [15], Gaussian Mixture Models (GMM), and Spectral Regression (SR) analysis [18], respectively.

(a) Gaussian Model

The original MOCAIP algorithm exploits Gaussian priors to identify the most likely configuration of the three peaks among the set of candidates. Given P(X1), P(X2), and P(X3) to denote the Gaussian probability distribution of the prior position of the three peaks (p1, p2, p3), peak recognition amounts to searching for the maximum of the following objective function,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M9','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M9">View MathML</a>

(7)

where P(Xi = ak) represents the probability of assigning ak to the i-th peak.

In order to deal with missing peaks, an empty designation a0 is added to the pool of candidates. In addition, to avoid false designation, MOCAIP uses a threshold ρ such that P(Xi = ak) = 0, i ∈ {1, 2, 3}, k ∈ {1, 2, ..., N} if the probability of assigning ak to pi is less than ρ.

(b) Gaussian Mixture Models

In contrast with MOCAIP that uses a model of independent Gaussian distributions to represent the likely position of each peak, the method proposed in this paragraph exploits a multi-modal distribution to model the joint latency of the three peaks. Observed peak configurations are approximated by a Gaussian Mixture Model (GMM), where each component i represents a cluster of configurations μi of the three peaks. A GMM is defined as,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M10','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M10">View MathML</a>

(8)

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M11','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M11">View MathML</a>

(9)

where αi, μi, ∑i are the relative weight, the mean, and the variance of an individual component i, and C is the total number of components. For learning, Expectation-Maximization (EM) was used to estimate the model parameters θ = (α1...C ; μ1...C ; ∑1...C) that maximizes the likelihood of the observed peak configurations. EM was performed for a different number of components C ∈ {1, ..., 10}. The number which minimizes the Bayesian Information Criterion (BIC) [22] was selected.

The detection task amounts to find the best configuration of the three peaks among the set of peak candidates a = (a1, a2, ..., aN) detected in the current pulse. This can be done by finding the configuration that is the more likely on the GMM,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M12','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M12">View MathML</a>

(10)

However, an additional difficulty is caused by missing peaks. One way to solve this problem is to use a hierarchical recognition approach where the possible configurations are first evaluated on the 3 peak model. If the largest response r123 = P(X = {p1, p2, p3}|Θ) fails to be above a given threshold τ3, the marginals X12, X13, X23 using only two dimensions of the model are evaluated,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M13','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M13">View MathML</a>

(11)

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M14','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M14">View MathML</a>

(12)

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M15','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M15">View MathML</a>

(13)

and r12 = P(X12 = {p1, p2}|Θ), r13 = P(X13 = {p1, p3}|Θ), r23 = P(X23 = {p2, p3}|Θ). Again, if the maximum response to the GMM model of all the 2-peak configurations max(r12, r13, r23) is below a certain threshold τ2, 1-peak marginals X1, X2, X3 are evaluated, and the peak with the maximum response is marked.

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M16','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M16">View MathML</a>

(14)

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M17','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M17">View MathML</a>

(15)

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M18','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M18">View MathML</a>

(16)

(c) Spectral Regression

In a recent comparison of regression techniques [18], Spectral Regression (SR) [23] demonstrated excellent accuracy in peak recognition on standard ICP pulses. This motivates us to select it as the baseline regression method within MOCAIP++. The regression model yi = f(xi) maps the position of the peaks as a function of the ICP dominant pulse. The model is automatically learned from training ICP pulses S = {Si = 1...n} labeled with the latency of the peaks yi = (p1, p2, p3) within the pulse. Each pulse Si is resized to a vector xi ∈ ℝs of length s = 500 ms, and normalized in amplitude between [1].

SR combines spectral graph analysis and standard linear regression to obtain a model that gives similar predictions <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M19','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M19">View MathML</a> for data samples xi X that are close (i.e. that are nearest neighbors in a graph representation), such that the following measure ϕ is minimized:

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M20','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M20">View MathML</a>

(17)

where W ∈ ℝn × n is the item-item similarity matrix that associates a positive value to Wi,j if the samples xi, xj belong to the same class. This is done by first using the eigenvectors of the matrix W,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M21','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M21">View MathML</a>

(18)

where D is a diagonal matrix whose entries are column sums of W, Di,i = Σj Wj,i, and e0, e1, ..., ed denote the d + 1 eigenvectors with respect to the d + 1 largest eigenvalues λ0λ1≥ ... ≥ λd.

Then SR finds d vectors {<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M22','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M22">View MathML</a>}that minimize the residual Sum of Square Error (SSE),

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M23','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M23">View MathML</a>

(19)

where <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M24','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M24">View MathML</a> is the i-th element of ej.

For recognition on a new pulse xj, the regression model yj = f(xj) predicts the most likely position of the three peaks yj = (p1, p2, p3). A nearest neighbor search is then performed so that the nearest candidate (a1, a2, ..., aN) to each prediction is assigned to the peak label corresponding to the matched prediction. Additional features fi ∈ ℝ(s, where fi ∈ {Lx, Lxx, K}, can be concatenated to the original input xi ∈ ℝ(s to create a new input vector [xi fi] that combines both modalities.

Although Spectral Regression is a linear regression algorithm, it can easily be extended to become nonlinear by using a kernel projection (Radial Basis Function (RBF)) of the input vectors. We further refer to this technique as the Kernel Spectral Regression (KSR).

3 Results and Discussion

3.1 Accuracy of Peak Recognition Methods on Challenging Data

This section provides a comparative analysis of peak recognition techniques by evaluating their performance on the challenging dataset of ICP pulses. Models based on Gaussian (MOCAIP), Gaussian Mixtures (GMM), Spectral Regression (SR), and Kernel Spectral Regression (KSR) models are evaluated. A five-fold cross-validation is performed on the challenging dataset D', such that at each of the five iterations, four folds are used to train the model while the remaining one is retained for evaluation. The partitioning is randomly made with the constraint that the pulses of a given patient are grouped into the same fold. This ensures that data from the same patient are not present at the same time in the training and testing sets.

During evaluation, a predicted position <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M25','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M25">View MathML</a>i of one of the three peaks is considered to be correct if it is equal to the actual position yi established manually. Given that peaks may be set as missing in the groundtruth, True positive (TP), false positive (FP), true negative (TN), or false negative (FN) are defined as follow,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M26','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M26">View MathML</a>

(20)

Based on these measures, the accuracy <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M27','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M27">View MathML</a> of one of the three peaks p ∈ {p1, p2, p3} is defined as,

<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M28','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M28">View MathML</a>

(21)

The accuracy <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M27','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M27">View MathML</a> of a peak p is obtained by averaging the accuracy over the five-folds. Similarly, the overall accuracy <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M29','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M29">View MathML</a> is obtained by averaging the accuracy of the three peaks, <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M30','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M30">View MathML</a>. The learning of the recognition models is supervised in the sense that it relies on a set of manually labelled ICP pulses. As the number of training examples increases, the overall accuracy is generally expected to improve as well. We report this aspect by plotting the average prediction accuracy for each method against the number of training samples in Figure 5. To test one of the 5 folds, a model is trained by randomly extracting n pulses from the remaining 4 folds.

thumbnailFigure 5. Effect of the number of training samples on the average recognition accuracy (Eq. 21) for different models (KSR, SR, GMM, MOCAIP [15]) using a five-fold crossvalidation on our challenging dataset D'. Results correspond to the average for the three peaks (p1, p2, and p3).

Results clearly indicates that KSR performs better by reaching a maximum accuracy of 88.78% ± 2.35, while the other techniques are less accurate; SR obtains 72.57 ± 2.6, GMM 70.47% ± 2.64, and MOCAIP 65.83% ± 2.96. It is interesting to notice that all the methods reaches their maximum accuracy before 500 training pulses. These results confirms that, besides KSR, current methods do not offer good recognition results on challenging pulses. Although KSR performs better than any other techniques, it requires all the training pulses to be kept as a part of the model to be able to compute the kernel projection. Nevertheless, KSR gives us an insight about what performance a peak recognition technique can achieve on our challenging dataset.

3.1.1 Computational Cost

One of the possible applications our framework is to be used in portable devices to monitor ICP continuously. Such an application requires real time performances of the peak recognition techniques. This section evaluates the performance of the different recognition techniques in terms of their complexity by comparing their computational time during learning and recognition.

Table 1 shows that, on a dataset of 2000 ICP pulses, MOCAIP (Gaussian), and SR are the fastest for training their model with only 60 and 90 ms, while GMM is much more slower with 33,940 ms. For recognition on a single pulse, SR ranks first with 0.19 ms, MOCAIP second (6.7 times slower), GMM is 10 times slower, and KSR is about 100 times slower than SR. Batch recognition performance is measured on a set of 2000 pulses. Under these conditions, KSR improves a lot due to the optimization of matrix operations but remains behind SR. Note that the reported durations only represents the running time of the learning, and recognition methods. Additional time is needed for MOCAIP++ to pre-process the ICP, detect peak candidates, and to compute additional features such as curvature and signal derivatives. Running time were measured using built-in MATLAB functions. These tests were performed on a DELL OPTIPLEX 760 computer equipped with INTEL DUAL-CORE E8600 cadenced with a 3.33 GHz processor and 3 GB of RAM.

Table 1. Running time for learning peak recognition models (Gaussian, SR, KSR, and GMM) from 2000 ICP pulses, and for recognition on 1 and 2000 pulses.

3.2 Feature-based Peak Recognition

This section evaluates the impact of the additional ICP features (Section 2.2.3) within MOCAIP++ on peak recognition performance. The same experimental protocol (five-fold cross-validation) of the previous section is used to evaluate the accuracy of SR, KSR, and GMM using three different features; curvature (Curv), first (Lx) and second (Lxx) derivatives on the challenging dataset D'.

Figure 6 shows that each feature significantly improves the overall accuracy of the SR method. While the original SR-based recognition method [18] attains an accuracy of 72.57% ± 2.6, the use of the second derivative and curvature improves it to 80.26 ± 2.29 and 80.4% ± 2.2, respectively. SR performs best when it is combined with the first derivative Lx of the ICP, reaching an accuracy of 85.81% ± 2.5. This constitutes a very significant result (+13%) in favor of our feature-based MOCAIP++ method.

thumbnailFigure 6. The average recognition accuracy (Eq. 21) is reported versus the number of training samples. Results illustrate the effect of three differential features on the SR model [18]: curvature (Curv), first (Lx) and second (Lxx) derivatives.

When combined with derivative-based features, GMM, and KSR methods exhibit a similar ranking of improvement; first derivative offers the largest effect on accuracy, while curvature and second derivatives generally have less significant improvement. With the use of the first derivative (see Figure 7), GMM method improves from 70.47% ± 2.64 to 77.14% ± 1.85, while KSR only shows a marginal improvement from 88.78% ± 2.35 to 89.36% ± 2.51. We have also noticed in additional experiments that combining different features, such as Lx+Lxx, does not improve the performance obtained by using only the first derivative Lx of the ICP signal. These results demonstrate that the use of the first derivative within MOCAIP++ improves the recognition accuracy of the three peak recognition methods we have integrated. It can also be pointed out that the accuracy reached by SR + Lx is very close to KSR + Lx. Considering the previous remarks about the execution time and the storage of training samples for the kernel computation required for KSR, the use of SR combined with the first derivate seems to provide the right tradeoff between speed and accuracy for peak recognition on challenging ICP pulses.

thumbnailFigure 7. Average recognition accuracy (Eq. 21) after a five-fold cross-validation for MOCAIP-based peak recognition methods improved with the use of the first derivative Lx of the ICP.

3.3 Impact of the Training Data Sampling Strategy

Although peak recognition models are trained in a supervised fashion such that they integrate morphological information from pulses with known peaks into models that may correctly identify peaks in new pulses, the underlying training pulses affect the estimation of the parameters and the performance of such models. Intuitively, the model should be trained on a representative range of pulses (easy, or challenging) to gain sufficient precision. This section evaluates the effect of incorporating pulses extracted from the challenging dataset into the training set of peak recognition methods.

In these experiments, peak recognition methods are estimated from two different annotated training sets (<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M31','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M31">View MathML</a>). The first training set, named reference library <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M32','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M32">View MathML</a>, is made of 3000 randomly selected ICP pulses from the original dataset D. These pulses present a wide range of morphological variations but the majority of them are generally easily annotated. A subset of these data was used in previous works [15] to train MOCAIP. The second training set, named weighted sampling <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M33','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M33">View MathML</a>, is made of 1500 randomly selected pulses from the original dataset D, plus 1500 pulses randomly extracted from the challenging dataset D' created using a weighted sampling procedure (Section 2.1.2). Unlike previous section, where peak recognition methods were assessed against the challenging dataset D', the evaluation is now performed on the full dataset D. This allows us to see if the methods are not subject to overfitting; we verify if the methods that offer good results on challenging data also generalize well on regular pulses.

Figure 8 gives the average accuracy. It can be seen that the use of an equal number of pulses sampled from the full and challenging dataset considerably improves the performance of peak recognition methods over models exclusively learned on random pulses. The improvement is as follows: MOCAIP, from 72.31% to 87.27%, SR, from 75.96% to 82.41%, SR+Lx, from 83.67% to 92.74%, and KSR+Lx from 90.44% to 93.64%. The combination of our two contributions, the use of the first derivative and the weighted sampling for training, improves SR-based MOCAIP approach by about

    17
% (from 75.96% to 92.74%). This is a very significant improvement of performance that should help to extract more reliable statistics about ICP pulse morphology in real clinical conditions.

thumbnailFigure 8. Effect of the source data used to train the peak recognition models on the average recognition accuracy (Eq. 21) evaluated on the large dataset D. The reference library <a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M32','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M32">View MathML</a> is made of randomly chosen ICP pulses. The second training set, weighted sampling<a onClick="popup('http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M33','MathML',630,470);return false;" target="_blank" href="http://www.biomedical-engineering-online.com/content/9/1/61/mathml/M33">View MathML</a>, is made of an equal number of randomly selected pulses from the large dataset D and challenging pulses randomly extracted from the challenging dataset D'. Both datasets contain 3000 pulses

4 Conclusions

Recent works suggest that changes in the waveform morphology of ICP may provide insight to forecasting critical intracranial and cerebrovascular pathophysiological variations. However, automatic analysis of the waveform morphology of ICP acquired in clinical conditions is still beyond current ICP analysis frameworks. Their performance deteriorates significantly when the morphology of the pulse exhibits uncommon morphological changes.

This paper has described MOCAIP++, a generalization of the recently developed MOCAIP, that provides a robust framework for analyzing Intracranial Pressure signal (ICP) in terms of its waveform morphology. The proposed approach improves current methods by allowing the integration of several peak recognition methods. In addition, whereas previous MOCAIP-based studies [15,18] exploited dominant pulses directly as input to peak recognition techniques, MOCAIP++ allows to derive additional features that capture more informative properties of the ICP signal and hence better discriminate the three peaks. The first derivative of the ICP signal has been shown to be the best among the features tested in our experiments (as shown in Figure 9). It improved all the peak recognition methods. This can be explained by its invariance to global shift in elevation from the baseline of the pulse. Performance in terms of peak recognition accuracy obtained by the proposed SR-based extension are close to the non-linear Kernel Spectral Regression (KSR). KSR can be considered really close to the best performing solution for this problem but it has the disadvantage to require to keep all the training samples, and is much slower than the SR.

Another contribution of this paper is to show that incorporating a challenging subset of ICP pulses into the training set of peak recognition methods has a positive effect on their overall accuracy.

thumbnailFigure 9. Illustration of challenging ICP pulses where the original MOCAIP failed to recognize at least one of the peaks. The actual position of the peaks, correctly predicted by MOCAIP++ (SR + Lx) are depicted by green circles, while the black diamonds correspond to the MOCAIP predictions

Experiments on a large dataset of ICP signals, as well as on a representative collection of sampled challenging ICP pulses, demonstrate that both contributions are complementary and significantly improve the recognition performance of ICP peaks in real conditions. These findings provide insight in order to potentially improve other ICP peak recognition frameworks, and will help us to collect more reliable statistics about ICP morphology to further investigate its predictive power on patient condition.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

FS conceived the study, carried out the implementation of the framework, and drafted the manuscript. SA participated to the design of performance evaluation methods and statistical analysis. SK participated to the formalization of the framework. MB provided physiological and clinical background to the study. XH conceived the study including the idea of challenging pulse set and error-based sampling in addition to coordinating the execution of the study. All authors participated to the writing of the manuscript and approved the final manuscript.

Acknowledgements

The present work is supported in part by NINDS grants, R21-NS059797 and R01-NS054881.

References

  1. Bradshaw D: Traumatic Brain Injury Task Force. [http:/ / www.armymedicine.army.mil/ reports/ tbi/ TBITaskForceReportJanuary2008.pdf] webcite

    Department of Defense and Department of Veterans Affairs 2008. OpenURL

  2. Balestreri M, Czosnyka M, Steiner L, Schmidt E, Smielewski P, Matta B, Pickard J: Intracranial hypertension: what additional information can be derived from ICP waveform after head injury?

    Acta Neurochir (Wien) 2004, 146(2):131-141. PubMed Abstract | Publisher Full Text OpenURL

  3. Czosnyka M, Guazzo E, Whitehouse M, Smielewski P, Czosnyka Z, Kirkpatrick P, Piechnik S, Pickard J: Significance of intracranial pressure waveform analysis after head injury.

    Acta Neurochir (Wien) 1996, 138(5):531-41. PubMed Abstract | Publisher Full Text OpenURL

  4. Contant CF, Robertson CS, Crouch J, Gopinath SP, Narayan RK, Grossman RG: Intracranial pressure waveform indices in transient and refractory intracranial hypertension.

    J Neurosci Methods 1995, 57:15-25. PubMed Abstract | Publisher Full Text OpenURL

  5. Hu X, Glenn T, Scalzo F, Bergsneider M, Sarkiss C, Martin N, Vespa P: Intracranial pressure pulse morphological features improved detection of decreased cerebral blood flow.

    Physiol Meas 2010, 31(5):679-695. PubMed Abstract | Publisher Full Text OpenURL

  6. Takizawa H, Gabra-Sanders T, Miller JD: Changes in the cerebrospinal fluid pulse wave spectrum associated with raised intracranial pressure.

    Neurosurgery 1987, 20(3):355-61. PubMed Abstract | Publisher Full Text OpenURL

  7. Cardoso ER, Reddy K, Bose D: Effect of subarachnoid hemorrhage on intracranial pulse waves in cats.

    J Neurosurg 1988, 69(5):712-8. PubMed Abstract | Publisher Full Text OpenURL

  8. Cardoso ER, Rowan JO, Galbraith S: Analysis of the cerebrospinal fluid pulse wave in intracranial pressure.

    J Neurosurg 1983, 59(5):817-21. PubMed Abstract | Publisher Full Text OpenURL

  9. Portnoy HD, Chopp M: Cerebrospinal fluid pulse wave form analysis during hypercapnia and hypoxia.

    Neurosurgery 1981, 9:14-27. PubMed Abstract | Publisher Full Text OpenURL

  10. Hu X, Xu P, Asgari S, Vespa P, Bergsneider M: Forecasting ICP Elevation Based on Prescient Changes of Intracranial Pressure Waveform Morphology.

    IEEE Trans Biomed Eng 2010, 57(5):1070-1078. PubMed Abstract | Publisher Full Text OpenURL

  11. Chopp M, Portnoy HD: Systems analysis of intracranial pressure. Comparison with volume-pressure test and CSF-pulse amplitude analysis.

    J Neurosurg 1980, 53(4):516-27. PubMed Abstract | Publisher Full Text OpenURL

  12. Aboy M, McNames J, Thong T, Tsunami D, Ellenby M, Goldstein B: An automatic beat detection algorithm for pressure signals.

    IEEE Trans Biomed Eng 2005, 52(10):1662-1670. PubMed Abstract | Publisher Full Text OpenURL

  13. Eide P: A new method for processing of continuous intracranial pressure signals.

    Medical Engineering & Physics 2006, 28(6):579-587. OpenURL

  14. Ellis T, McNames J, Aboy M: Pulse Morphology Visualization and Analysis With Applications in Cardiovascular Pressure Signals.

    IEEE Trans Biomed Eng 2007, 54(9):1552-1559. PubMed Abstract | Publisher Full Text OpenURL

  15. Hu X, Xu P, Scalzo F, Vespa P, Bergsneider M: Morphological Clustering and Analysis of Continuous Intracranial Pressure.

    IEEE Trans Biomed Eng 2009, 56(3):696-705. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  16. Scalzo F, Xu P, Bergsneider M, Hu X: Random Subwindows for Robust Peak Recognition in Intracranial Pressure Signals.

    Lect Notes Comput Sc 2008, 358:370-380. Publisher Full Text OpenURL

  17. Scalzo F, Xu P, Bergsneider M, Hu X: Nonlinear regression for sub-peak detection of intracranial pressure signals.

    IEEE Engineering in Medicine and Biology Society (EMBS) 2008, 5411-5414. OpenURL

  18. Scalzo F, Xu P, Asgari S, Bergsneider M, Hu X: Regression Analysis for Peak Designation in Pulsatile Pressure Signals.

    Med Biol Eng Comput 2009, 47(9):967-977. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  19. Hu X, Xu P, Lee D, Vespa P, Bergsneider M: An Algorithm of Extracting Intracranial Pressure Latency Relative to Electrocardiogram R Wave.

    Physiol Meas 2008, 29:459-471. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  20. Afonso VX, Tompkins WJ, Nguyen TQ, Luo S: ECG beat detection using filter banks.

    IEEE Trans Biomed Eng 1999, 46(2):192-202. PubMed Abstract | Publisher Full Text OpenURL

  21. Kaufman L, Rousseeuw PJ: Finding groups in data: an introduction to cluster analysis. Wiley series in probability and mathematical statistics, Hoboken, N.J.: Wiley; 2005.

    [Leonard Kaufman, Peter J. Rousseeuw.]

    OpenURL

  22. Schwarz EG: Estimating the dimension of a model.

    Annals of Statistics 1978, 6(2):461-464. Publisher Full Text OpenURL

  23. Cai D, He X, Han J: SRDA: An Efficient Algorithm for Large-Scale Discriminant Analysis.

    IEEE Trans Knowl Data Eng 2008, 20:1-12. Publisher Full Text OpenURL