INDIGO Home University of Illinois at Urbana-Champaign logo uic building uic pavilion uic student center

Open Classification and Change Detection in the Similarity Space

Show full item record

Bookmark or cite this item: http://hdl.handle.net/10027/21802

Files in this item

File Description Format
PDF FEI-DISSERTATION-2017.pdf (1MB) Restricted Access (no description provided) PDF
Title: Open Classification and Change Detection in the Similarity Space
Author(s): Fei, Geli
Advisor(s): Liu, Bing
Contributor(s): Di Eugenio, Barbara; Gmytrasiewicz, Piotr; Yu, Philip S; Mahmud, Jalal; Liu, Bing
Department / Program: Computer Science
Degree Granting Institution: University of Illinois at Chicago
Degree: PhD, Doctor of Philosophy
Genre: Doctoral
Subject(s): Open classification Covariate shift Cumulative learning Spam detection Change detection
Abstract: The rapid emergence of new topics and the highly diverse nature of online text data have brought new challenges to existing text classification techniques. One of the main challenges is their lack of ability in handling unseen classes of documents due to the closed world assumption, under which all test classes are assumed to be known at training time. However, a more realistic scenario is to expect unseen classes during testing (open world). This problem is called open (world) classification. In this thesis, we start with studying three closely related research problems to open classification. First, we study the problem of text classification under negative covariate shift. Then we proceed to study the general problem of open (world) classification. Furthermore, we propose cumulative machine learning, where unseen classes of documents are not only detected, but also incorporated into the existing system in an efficient manner. One of the key techniques used in the above research is the transformation of documents to a similarity space to detect the special type of change in the test class distribution, i.e., the arrival of unseen classes. As the last part of this thesis, we explore the use of similarity-based approaches in detecting a new type of change in social media accounts. In particular, we study the problem of detecting changed-hands online review accounts. Extensive experiments have shown that the proposed approaches are highly effective.
Issue Date: 2017-03-20
Type: Thesis
URI: http://hdl.handle.net/10027/21802
Date Available in INDIGO: 2017-10-27
Date Deposited: May 2017
 

This item appears in the following Collection(s)

Show full item record

Statistics

Country Code Views
United States of America 26
China 24
Ukraine 9
Russian Federation 6
Germany 3

Browse

My Account

Information

Access Key