Semi-Supervised Learning With the EM Algorithm: A Comparative Study Between Unstructured and Structured Prediction

Abstract

Semi-supervised learning aims to learn prediction models from both labeled and unlabeled samples. There has been extensive research in this area. Among existing work, generative mixture models with Expectation-Maximization (EM) is a popular method due to clear statistical properties. However, existing literature on EM-based semi-supervised learning largely focuses on unstructured prediction, assuming that samples are independent and identically distributed. Studies on EM-based semi-supervised approach in structured prediction is limited. This article aims to fill the gap through a comparative study between unstructured and structured methods in EM-based semi-supervised learning. Specifically, we compare their theoretical properties and find that both methods can be considered as a generalization of self-training with soft class assignment of unlabeled samples, but the structured method additionally considers structural constraint in soft class assignment. We conducted a case study on real-world flood mapping datasets to compare the two methods. Results show that structured EM is more robust to class confusion caused by noise and obstacles in features in the context of the flood mapping application.

Publication
In IEEE Transactions on Knowledge and Data Engineering
Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.
Create your slides in Markdown - click the Slides button to check out the example.
Wenchong He
Wenchong He
Ph.D. Candidate in Computer Science

I am a Ph.D. candidate in Department of Computer & Information Science & Engineering at the University of Florida. My broad research areas are data science, machine learning and artificial intelligence. Specifically my research focuses on spatiotemporal data mining, knowledge-informed machine learning, trustworthy AI as well as interdisciplinary scientific applications in climate science, environmental monitoring and physics simulation. I am on the academic and industry job market for tenure-track faculty or research scientist position.