jump to main area
:::
A- A A+

Seminars

Text Mining by Using Probabilistic Logical Patterns

  • 2008-06-30 (Mon.), 10:00 AM
  • Auditorium, 2F, Tsai Yuan-Pei Memorial Hall
  • Professor Jung Jin Lee
  • Department of Statistics, Soong Sil University, Seoul, Korea

Abstract

We propose two methods for the classification of binary observations which are applicable to information retrieval. First, we discuss the discovery of minimal sets of features which are necessary for explaining all observations, and the detection of hidden logical patterns in the data which are capable of distinguishing between two groups. Combinations of such patterns are used for developing a classification procedure. Second, we present a classification model which uses probabilistic logical patterns and maximum entropy distribution. Classification experiments by simulation and by using the TREC collection are discussed.

Update:
scroll to top