Python tool for capturing and logging human-computer interactions. Generate rich datasets for training multi-modal LLMs in autonomous computer control. Features screenshot, mouse, keyboard, and audio recording.
nlp
machine-learning
automation
computer-vision
screen-capture
audio-recording
dataset-generation
human-computer-interaction
computer-interaction
ai-training
ai-dataset
autonomous-control
multi-modal-llm
input-logging
-
Updated
Sep 16, 2024 - Python