Skip to content
This repository has been archived by the owner on May 14, 2020. It is now read-only.
/ graphify Public archive

Graphify is a Neo4j unmanaged extension used for document and text classification using graph-based hierarchical pattern recognition.

License

Notifications You must be signed in to change notification settings

kbastani/graphify

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Graphify

This is a Neo4j unmanaged extension used for document and text classification.

Training Dataset

Natural Language Parsing Model

Classify Unlabeled Documents

Compiled extension

The compiled extension is available from the bin directory.

Building the extension

  1. To build it:

     src/extension mvn assembly:assembly -DdescriptorId=jar-with-dependencies
    
  2. Copy src/extension/target/graphify-1.0.0-jar-with-dependencies.jar to the plugins/ directory of your Neo4j server.

  3. Configure Neo4j by adding a line to conf/neo4j-server.properties:

     org.neo4j.server.thirdparty_jaxrs_classes=org.neo4j.nlp.ext=/service
    
  4. Start Neo4j server.

  5. Query it over HTTP.

Using it

Get similar labels:

curl --user neo4j:neo4j http://localhost:7474/service/graphify/similar/{label}

Train the natural language recognition model on text about 'Document classification':

curl --user neo4j:neo4j -H "Content-Type: application/json" -d '{"label": ["Document classification"], "text": ["Documents may be classified according to their subjects or according to other attributes (such as document type, author, printing year etc.). In the rest of this article only subject classification is considered. There are two main philosophies of subject classification of documents: The content based approach and the request based approach."]}' http://localhost:7474/service/graphify/training

Classify an unlabeled text:

curl --user neo4j:neo4j -H "Content-Type: application/json" -d '{"text": "A document is a written or drawn representation of thoughts. Originating from the Latin Documentum meaning lesson - the verb means to teach, and is pronounced similarly, in the past it was usually used as a term for a written proof used as evidence."}' http://localhost:7474/service/graphify/classify

Get a list of the extracted semantic features matching a text:

curl --user neo4j:neo4j -H "Content-Type: application/json" -d '{"text": "A document is a written or drawn representation of thoughts. Originating from the Latin Documentum meaning lesson - the verb means to teach, and is pronounced similarly, in the past it was usually used as a term for a written proof used as evidence."}' http://localhost:7474/service/graphify/extractfeatures

Get a sorted list of labels that are most related to the label 'Document classification':

curl --user neo4j:neo4j http://localhost:7474/service/graphify/similar/Document%20classification
Example response:
{
    "classes": [
        {
            "class": "Document",
            "similarity": 0.19563160874988336
        },
        {
            "class": "Intelligence",
            "similarity": 0.1778887274627789
        },
        {
            "class": "Machine learning",
            "similarity": 0.14800216450227222
        },
        {
            "class": "Data",
            "similarity": 0.1467923282078174
        },
        {
            "class": "Memory",
            "similarity": 0.14600346713601134
        }
    ]
}

About

Graphify is a Neo4j unmanaged extension used for document and text classification using graph-based hierarchical pattern recognition.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •