In practice, however, the number of clusters is underestimated, a

In practice, however, the number of clusters is underestimated, as the dimension is increased beyond a certain value. The difficulty arising from the high-dimensionality

of the data space is called ‘the curse of dimensionality’ (Bishop, 2006) and it should be selleck inhibitor mitigated by eliminating redundant data information. In this study, we reduced the dimension of the feature space by either extracting the principal components or selecting the coefficients of WT of spike waveforms. In the PCA, the raw data were first filtered by a 300th order 200 Hz high-pass finite impulse response filter with Hamming window function. The high order of filtering effectively eliminated the DC component from the filtered signals, which becomes a potential obstacle in spike clustering, at a relatively small cost of computations. The filtered data were resampled at 20 kHz, from –0.5 ms ahead to 1.05 ms behind each detected peak time (equivalently, sampling points in the interval [−10 : 21]), such that point 0 may coincide with the peak LBH589 molecular weight time. Thus, 128-dimensional (four electrodes of 32 points) data were available for each spike. We then extracted 12 principal components from these 128-dimensional data by using PCA. The PCA, however, is not necessarily useful for clustering, as PCA merely extracts the dimension

exhibiting a large variance in data distribution, whereas clustering is most effectively executed in the dimensions in which the data distribution exhibits multiple sharp peaks rather than a single broad peak. Therefore, another spike-sorting algorithm employed WT for extracting the characteristic features of spike waveforms. The raw unfiltered data were resampled at 20 kHz, from −0.5 ms ahead to 1.05 ms behind each detected peak time (equivalently, sampling points in C-X-C chemokine receptor type 7 (CXCR-7) the interval [−10 : 21]), such that point 0 may coincide with the peak time. Note that WT requires no preparatory filtering that depends on an empirical choice of cut-off frequency. We then applied the multi-resolution analysis to the spike waveform (Halata et al., 2000; Quiroga et al.,

2004) obtained from each channel and derived its time–frequency coefficients. We used Harr’s wavelet (Harr, 1910; Mallat, 1998) and the Cohen-Daubechies-Feauveau 9/7 (CDF97) wavelet (Cohen et al., 1992; Daubechies, 1992). After the multi-resolution analysis, we obtained a one-dimensional distribution of each coefficient over the ensemble of spikes recorded with each channel. A feature is only useful for separating units if it has a multi-modal distribution, i.e. a distribution with more than one peak. We reduced the dimensionality of the data by selecting the wavelet coefficients with multi-modal distributions. We evaluated each coefficient by applying the RVB clustering algorithm to the distribution of that coefficient.

Leave a Reply

Your email address will not be published. Required fields are marked *

*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>