Modification of Semi-supervised Algorithm Based on Gaussian Random Fields and Harmonic Functions
Keywords:machine learning, semi-supervised learning, label propagation, Gaussian random fields, k nearest neighbors, harmonic functions
In this paper we propose an improvement for a semi-supervised learning algorithm based on Gaussian random fields and harmonic functions. Semi-supervised learning based on Gaussian random fields and harmonic functions is a graph-based semi-supervised learning method that uses data point similarity to connect unlabeled data points with labeled data points, thus achieving label propagation. The proposed improvement concerns the way of determining similarity between two points by using a hybrid RBF-kNN kernel. This improvement makes the algorithm more resilient to noise and makes label propagation more locality-aware. The proposed improvement was tested on five synthetic datasets. Results indicate that there is no improvement for datasets with big margin between classes, however in datasets with low margin proposed approach with hybrid kernel outperforms existing algorithms with a simple kernel.
Zhu Xiaojin, and Zoubin Ghahramani, "Learning from labeled and unlabeled data with label propagation," ProQuest Number: INFORMATION TO ALL USERS, 2002.
Zhu, Xiaojin, Zoubin Ghahramani, and John D. Lafferty, "Semi-supervised learning using gaussian fields and harmonic functions," In Proceedings of the 20th International conference on Machine learning (ICML-03), 2003, pp. 912–919.
Peter G. Doyle, and J. Laurie Snell, "Random walks and electric networks," vol. 22, American Mathematical Soc., 1984. https://doi.org/10.5948/UPO9781614440222
Wu, Xiao-Ming, Zhenguo Li, Anthony So, John Wright, and Shih-Fu Chang, "Learning with partially absorbing random walks," Advances in neural information processing systems, 25, 2012.
Zhu, Xiaojin, John Lafferty, and Ronald Rosenfeld, "Semi-supervised learning with graphs (Ph. D. thesis)," Pittsburgh, PA, USA, 2005.
Jebara, Tony, Jun Wang, and Shih-Fu Chang, "Graph construction and b-matching for semi-supervised learning," In Proceedings of the 26th annual international conference on machine learning, 2009, pp. 441–448.
Authors who publish with this journal agree to the following terms:
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).