Skip to content

Latest commit

 

History

History
102 lines (70 loc) · 2.33 KB

README.md

File metadata and controls

102 lines (70 loc) · 2.33 KB

3DT velocity

DC/OS Distributed Diagnostics Tool & Aggregation Service

3DT is a monitoring agent which exposes a HTTP API for querying from the /system/health/v1 DC/OS api. 3DT puller collects the data from 3dt agents and represents individual node health for things like system resources as well as DC/OS-specific services.

Build

go get github.com/dcos/3dt
cd $GOPATH/src/github.com/dcos/3dt
make install
./3dt -version

Run

Run 3DT once, on a DC/OS host to check systemd units:

3dt -diag

Get verbose log output:

3dt -diag -verbose

Run the 3DT aggregation service to query all cluster hosts for health state:

3dt -pull

Start the 3DT health API endpoint:

3dt

3DT CLI Arguments

-agent-port int
    Use TCP port to connect to agents. (default 1050)

-ca-cert string
    Use certificate authority.

-command-exec-timeout int
    Set command executing timeout (default 120)

-diag
    Get diagnostics output once on the CLI. Does not expose API.

-diagnostics-bundle-dir string
    Set a path to store diagnostic bundles (default "/var/run/dcos/3dt/diagnostic_bundles")

-diagnostics-job-timeout int
    Set a global diagnostics job timeout (default 720)

-diagnostics-units-since string
    Collect systemd units logs since (default "24 hours ago")

-diagnostics-url-timeout int
    Set a local timeout for every single GET request to a log endpoint (default 2)

-endpoint-config string
    Use endpoints_config.json (default "/opt/mesosphere/endpoints_config.json")

-exhibitor-ip string
    Use Exhibitor IP address to discover master nodes. (default "http://127.0.0.1:8181/exhibitor/v1/cluster/status")

-force-tls
    Use HTTPS to do all requests.

-health-update-interval int
    Set update health interval in seconds. (default 60)

-master-port int
    Use TCP port to connect to masters. (default 1050)

-port int
    Web server TCP port. (default 1050)

-pull
    Try to pull checks from DC/OS hosts.

-pull-interval int
    Set pull interval in seconds. (default 60)

-pull-timeout int
    Set pull timeout. (default 3)

-verbose
    Use verbose debug output.

-version
    Print version.

Testing

  • Test Changes