A Semi-Supervised Kernel Two-Sample Test

A Semi-Supervised Kernel Two-Sample Test

Jun 5, 2024ยท
Gyumin Lee
Gyumin Lee
ยท 1 min read
Abstract
In recent years, the realm of statistics and machine learning has seen significant advancements in the development of semi-supervised methodologies that leverage both labeled and unlabeled data. One notable area of focus within this method is statistical inference under semi-supervised setting. The main goal of the certain domain is to utilize insights derived from unlabeled data in order to improve statistical estimation and hypothesis testing. In particular, our interest lies on two-sample test which evaluate whether two distributions originate from the same underlying population. In this paper, we aim to extend upon existing kernel two-sample testing method by introducing a novel testing framework of ‘Semi-Supervised Kernel Two-Sample Test’. We propose test statistic making use of both labeled and unlabeled data and prove that our statistic follows Normal distribution asymptotically under certain conditions. Furthermore, we examine its consistency of power and related conditions, analyzing the efficiency of our statistic. We provide numerical analysis on different situations of the condition of labeled and unlabeled data.
Event
Location

Yonsei University

50, Yonsei-ro, Seodaemun-gu, Seoul 03722

Click on the Slides button above to view the built-in slides feature.