This is a light-weight system monitoring tool for cluster machines.
- Clone this repository to your
$HOME/.lmon
directory. - Configure/update the
machinefile
located at$HOME/.lmon/config/machinefile
. You will have to add new machine IPs in a new line. - Setup a cron job (Optional) to collect data from all of the machines listed in the machinefile. You may run the command
$ crontab -l
to see the current cron jobs, and append the contents of the$HOME/.lmon/data-collection/main.cron
. Or if you donont have any of your personal cron jobs setup, you may setup the job directly by running the command$ crontab $HOME/.lmon/data-collection/main.cron
.
- Install dependencies by running the command
$ sudo apt install openssh-server sysstat
- Check if the sadc is allowed to collect system activity information.
- change the
ENABLED
variable in/etc/default/sysstat
file to ”true
” - Restart sysstat:
$ sudo service sysstat restart
- change the
- Enable remote
ssh
logins without password by appending the monitoring server’s ssh-public key to the~/.ssh/authorized_keys
file in the cluster machine.