TaintClassify

SeCloud project on classifying node.js sinks and sources. Based on OWASP list of JavaScript vulnerabilities. Inspired by the paper by Rasthofer et al. A Machine-learning Approach for Classifying and Categorizing Android Sources and Sinks

App for classifying can be found in the secloudapp folder.

JSON data format

Data is extracted from the multiple files downloaded from node.js and located in json folder.

Currently only the 'textRaw' and 'params' are taken into account. Those are aggregated in data.json.

Format is as follows:

{
    "cl": 0,
    "params": [
        "value",
        "message"
    ],
    "textRaw": "assert(value[, message])"
}

Param "cl" refers to the class. There are three classes in this dataset:

    neither:    0
    source:     1
    sink:       2

For unknown class:

cl: -1

The python file that handles parsing is processJSON.py

Features

For handcrafted features to be used as input look at helperJSON.py

Currently features are binary(is a feature present) and extracted from method names. Features are based on OWASP list of JavaScript vulnerabilities e.g. get usually is a source of information. There are 15 such features extracted.

Issues

Dataset is small with 265 hand annotated examples.
Hand crafted features do not cover all possible cases of a source or a sink in Node.js hence some valuable info for classification is missing.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
secloudapp		secloudapp
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
classify.py		classify.py
svm.py		svm.py
svm_model.plk		svm_model.plk

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TaintClassify

App for classifying can be found in the secloudapp folder.

JSON data format

Features

Issues

About

Releases

Packages

Languages

License

kefth/secloud-taint

Folders and files

Latest commit

History

Repository files navigation

TaintClassify

App for classifying can be found in the secloudapp folder.

JSON data format

Features

Issues

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages