GitHub - CaseOLAP/caseolap-pipelines: Demo of pipelines in CaseOLAP platform and sample data. These pipelines consists of downloading, parsing, indexing of data, text-cube creation and search of entities.

CaseOLAP Workflow

The Context-aware Semantic Online Analytical Processing pipeline (CaseOLAP), was developed in 2016 by Fangbo Tao et.al. CaseOLAP successfully quantifies user-defined phrase-category relationships through analysis of textual data.

CaseOLAP workflow presents data preprocessing (i.e., downloading and parsing text documents), indexing and searching with Elasticsearch, creating a functional document structure called Text-Cube and quantifying phrase-category relationships using the core CaseOLAP algorithm.

Data preprocessing generates key-value pairs for all documents involved. As an example, a key may refer to the document PMID, while a value may refer to different document metadata. Preprocessed data is rearranged by indexing and searching for an entity count, which further facilitates the CaseOLAP score calculation. Obtained raw CaseOLAP results can be taken to integrative analysis including dimensionality reduction, clustering, temporal and geographical analysis, as well as the creation of a graphical database which enables semantic mapping of the documents .

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
data		data
notebooks		notebooks
pipelines		pipelines
README.md		README.md
_config.yml		_config.yml
environment.yaml		environment.yaml
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CaseOLAP Workflow

Pipelines

Data Samples

About

Releases

Packages

Languages

CaseOLAP/caseolap-pipelines

Folders and files

Latest commit

History

Repository files navigation

CaseOLAP Workflow

Pipelines

Data Samples

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages