跳到主要內容區塊
:::
A- A A+

演講公告

:::

Statistical Physics Approach to Information Categorization of Symbolic Sequences

Abstract

We propose a systematic approach to categorize information carried by symbolic sequences based on their usage of repetitive patterns. We proposed a simple formula to quantify the "dis-similarity" between two symbolic sequences. This dis-similarity index comparing two symbolic sequences is closely related to the Shannon entropy and rank order of these repetitive patterns. The physical meaning of this dis-similarity index can be easily understood by applying fundamental statistical physics concepts to dynamical systems. Finally, to illustrate that this generic approach is applicable to a wide range of real-world problems, we apply our algorithm to study literary texts, DNA sequences, and biological time series.

最後更新日期:
回頁首