TY - JOUR
T1 - Robust and scalable manifold learning via landmark diffusion for long-term medical signal processing
AU - Shen, Chao
AU - Lin, Yu Ting
AU - Wu, Hau Tieng
N1 - Funding Information:
The authors acknowledge Dr. Shen-Chih Wang for the fruitful discussion and suggestion. The work of Yu-Ting Lin was supported by the National Science and Technology Development Fund (MOST 108-2115-M-075-001) of Ministry of Science and Technology, Taipei, Taiwan. The authors also want to thank the associate editor and three anonymous reviewers for their constructive and helpful comments.
Publisher Copyright:
© 2022 Chao Shen, Yu-Ting Lin, and Hau-Tieng Wu.
PY - 2022
Y1 - 2022
N2 - Motivated by analyzing long-termphysiological time series, we design a robust and scalable spectral embedding algorithm that we refer to as RObust and Scalable Embedding via LANdmark Diffusion ( Roseland). The key is designing a diffusion process on the dataset where the diffusion is done via a small subset called the landmark set. Roseland is theoretically justified under the manifold model, and its computational complexity is comparable with commonly applied subsampling scheme such as the Nyström extension. Specifically, when there are n data points in Rq and nβ points in the landmark set, where β ∈ (0; 1), the computational complexity of Roseland is O(n1+2β + qn1+β), while that of Nystrom is O(n2:81β +qn1+2β). To demonstrate the potential of Roseland, we apply it to three datasets and compare it with several other existing algorithms. First, we apply Roseland to the task of spectral clustering using the MNIST dataset (70,000 images), achieving 85% accuracy when the dataset is clean and 78% accuracy when the dataset is noisy. Compared with other subsampling schemes, overall Roseland achieves a better performance. Second, we apply Roseland to the task of image segmentation using images from COCO. Finally, we demonstrate how to apply Roseland to explore long-term arterial blood pressure waveform dynamics during a liver transplant operation lasting for 12 hours. In conclusion, Roseland is scalable and robust, and it has a potential for analyzing large datasets.
AB - Motivated by analyzing long-termphysiological time series, we design a robust and scalable spectral embedding algorithm that we refer to as RObust and Scalable Embedding via LANdmark Diffusion ( Roseland). The key is designing a diffusion process on the dataset where the diffusion is done via a small subset called the landmark set. Roseland is theoretically justified under the manifold model, and its computational complexity is comparable with commonly applied subsampling scheme such as the Nyström extension. Specifically, when there are n data points in Rq and nβ points in the landmark set, where β ∈ (0; 1), the computational complexity of Roseland is O(n1+2β + qn1+β), while that of Nystrom is O(n2:81β +qn1+2β). To demonstrate the potential of Roseland, we apply it to three datasets and compare it with several other existing algorithms. First, we apply Roseland to the task of spectral clustering using the MNIST dataset (70,000 images), achieving 85% accuracy when the dataset is clean and 78% accuracy when the dataset is noisy. Compared with other subsampling schemes, overall Roseland achieves a better performance. Second, we apply Roseland to the task of image segmentation using images from COCO. Finally, we demonstrate how to apply Roseland to explore long-term arterial blood pressure waveform dynamics during a liver transplant operation lasting for 12 hours. In conclusion, Roseland is scalable and robust, and it has a potential for analyzing large datasets.
UR - http://www.scopus.com/inward/record.url?scp=85130311546&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85130311546&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:85130311546
SN - 1532-4435
VL - 23
JO - Journal of Machine Learning Research
JF - Journal of Machine Learning Research
ER -