For analyzing InfluxDB input data

As InfluxDB is a schema-on-write database, it is important to understand and optimize the format and shape of data that you are writing to it.

By analyzing your data before it is written to InfluxDB, you can optimize and future proof your schema.

Requires:

Python 3.7+
Matplotlib

For DataFrame functionality:

Pandas
Numpy

Usage:

IPython Notebook:

Import the Plotter() class: from data_shape import Plotter

Initialize an instance of Plotter with a block of text (will support files later): plotter = Plotter(text)

Terminal:

Clone this repo.

In a directory you want to work in--and if you want to run in a VM--run: pipenv install --ignore-pipfile

Web UI (roadmap?):

To do:

Add support for measurement, tag, field sizes (as opposed to counts) and create statistic description of that as well--add dict of this to description dict / row to Dataframe
Cleanup bar plot
Histogram support (needed?)

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
data_shape_objs.py		data_shape_objs.py
kafka_metric_shape.txt		kafka_metric_shape.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

For analyzing InfluxDB input data

Requires:

Usage:

IPython Notebook:

Terminal:

Web UI (roadmap?):

To do:

About

Releases

Packages

Languages

samhld/Influx_data_shape_analyzer

Folders and files

Latest commit

History

Repository files navigation

For analyzing InfluxDB input data

Requires:

Usage:

IPython Notebook:

Terminal:

Web UI (roadmap?):

To do:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages