Skip to content

IOC (Indicator of Compromise) Extractor: a program to help extract IOCs from text files.

Notifications You must be signed in to change notification settings

williamgibb/IOCextractor

 
 

Repository files navigation

IOCextractor

IOC (Indicator of Compromise) Extractor: a program to help extract IOCs from text files. The general goal is to speed up the process of parsing structured data (IOCs) from unstructured or semi-structured data (like case reports, security bulletins).

Compatibility and Requirements

  • Compatible and tested Python versions: 2.7
  • Compatible and tested operating systems: Windows 7, Mac OS 10.8.2, Ubuntu 12.10
  • IOCextractor requires TkInter (http://wiki.python.org/moin/TkInter).

The program is written in Python 2.7, and a binary version for Windows is provided (IOCextractor.zip).

Usage

This program helps extract indicators of compromise from a plain text file. It currently identifies MD5 hashes, IPv4 addresses, domains, URLs, and email addresses. First, when a user opens a file, the program identifies potential IOCs using regular expressions (ignoring a few obvious false positives, like IP addresses that start with 10). It tags and highlights the potential IOCs for a user to review.

A user can remove a tag by selecting its range of text and then either clicking the "Clear" button or right-clicking the selected text (command-click instead in Mac OS). It's also possible to remove all the tags from a large range of text, like a list of victim IP addresses, by selecting the whole range and clicking "Clear" or right-clicking. A user can add a tag by selecting a range of text and then clicking the corresponding button, for example "MD5." For any range of text that is either rejected or added, the program will search through the rest of the text to apply the same change everywhere. So if a user removes a tag from a victim IP address, the program will un-tag that IP address everywhere; it works the same for tagging a new IP address.

After a user has reviewed the tagging for accuracy, the program will export a list of unique tagged IOCs. It currently either exports to the console, saves a CSV file, or exports a CybOX Observables XML document. It is also set up so anyone could easily add another output format for a specific application.

A simple demonstration case report (DemonstrationCaseReport.txt) and a testing file (TestDocument.txt) are also provided.

Credits

About

IOC (Indicator of Compromise) Extractor: a program to help extract IOCs from text files.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%