BayesSurprise

In this work we employ Bayesian surprise to detect interesting/anomalous patterns from discrete sequence data. Many domains consist of discrete sequential time-series such as DNA analysis, online transactions, web click-stream navigation, cyber-attacks, financial transactions and especially sociology life-course data. The difficulty is that each data set has its own unique characteristics and many anomalies defy categorization. Since anomalies are by nature infrequent and elusive, we often do not have enough data for a supervised approach. However, novelty and surprise play a fundamental role in human and animal behavior for survival, attention and adaptation. We use regular expressions to collect the longest repeating sequences and define these as motifs (which may or may not represent novel patterns). The sequences are now composed of simpler motifs which are used to build Probabilistic Suffix Trees (PST) which can capture complex relationships based on motif location and frequency of occurrence. New data that deviates from established motifs either in location of appearance, frequency of appearance, or motif composition may represent recurring patterns that may be different in some way. Bayesian surprise is the result of mismatches between our expectations and actual results, hence the degree of surprise or anomalousness attached to a pattern will vary with respect to these differences. The implication of obtaining large surprise values identifies those patterns likely to be useful and interesting to the user.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
Data		Data
R code		R code
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BayesSurprise

About

Releases

Packages

Languages

kenmcgarry/BayesSurprise

Folders and files

Latest commit

History

Repository files navigation

BayesSurprise

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages