C6: Video Analysis - Group 6

Cristian Gutiérrez
Iñaki Lacunza
Carlos Boned
Marco Cordón

Final slides:

Part 1: Link to Google Slides part 1
Part 2: Link to Google Slides part 2

How to Run

Install dependencies via a requirements.txt file.

git clone https://github.com/mcv-m6-video/mcv-c6-2024-team6.git
cd mcv-c6-2024-team6/
python3 -m pip install -r requirements.txt

Please, download the data from UAB virtual campus AICity_data and move it to the current repo.

mv /path/to/your/AICity_data/ .
cd WX/taskX_X/
python3 main.py

Week 1: Background estimation

This first week was focused on background estimation to be able to segment the moving objects. Thorough this lab we will work with the AICityData dataset.

Task 1.1: Fixed Gaussian Estimation
Task 1.2: Evaluation mAP
Task 2.1: Adaptative Modelling
Task 2.2: Comparison between fixed and adaptative
Task 3: Comparison with SOTA models (CNT, LSBP, GMG, MOG, ...)
Task 4: Color sequences

Some example results

Fixed Gaussian Modeling with an alpha of 2 for the first 60 frames of the sequence.

Week 2: Object detection and Tracking

This second week we had to implement and evaluate different SOTA models for object detection and tracking algorithms. Our annotated sequence can be found at /W2/part1/task_1_2/annotations.xml.

Task 1: Object Detection
- Task 1.1: Off-the-shelf
- Task 1.2: Annotation
- Task 1.3: Fine-tune to our annotated sequence
- Task 1.4: K-Fold Cross-validation
Task 2: Object tracking
- Task 2.1: Overlapping method
- Task 2.2: Kalman filtering
- Task 2.3: TrackEval Metrics
Task 3: (OPTIONAL) CVPR 2021 AI City Challenge

Some example results

Tracking by Overlap:

Kalman filtering:

CVPR 2021 AI City Challenge

Week 3: Optical Flow

On this week the main goal has been to estimate the optical flow of a video sequence and try to improve an object tracking algorithm using the optical flow.

Task 1: Optical Flow.
- Estimate the Optical Flow with block matching.
- Estimate the Optical Flow with off-the-shelf method.
- Improve the object tracking algorithm with Optical Flow.
Task 2: Multi-Target Single-Camera tracking (MTSC).
- Evaluate our best tracking algorithm in different SEQs of AI City Challenge.
- Evaluate the tracking using IDF1 and HOTA scores.

Some example results

Task 1.3

Task 2

Week 4: Speed estimation and Multi-Camera Tracking

This final week consisted in two separate tasks, first one was to estimate the velocity, and the second one was to perform Multi-Camera Tracking (MCT).

Task 1: Speed Estimation
- Task 1.1: Rudimentary approach
- Task 1.2: Modern approach
Task 2: Multi-Camera Tracking (MCT)

Some example results

Task 1.2 Moder Approach Animation from the log files

Week 5: Action Recognition

During this week we started with a new part of the project, which belongs to the University of Barcelona. We have worked with X3D-X6 model (the smallest one of the X3D family) in the HMDB51 Dataset.

We started working an improving the baseline training method given by the teacher. Then we added Multi-View Inference, first adding analysis by temporal windows and afterwards combining it with different spatial crops. Finally, we implemented the multi-view training strategy of TSN, along with some custom improvements.

Some images

Task 3: Temporal windows and different spatial crops

Task 4: TSN implementation and custom improvement

Week 6: Implementing alternative models

During this week we had to change the previous week's model architectures to further improve the results. We tried very different implementations so to see which was the best working one.

Afterwards, we had to analyze the importance of temporal dynamics, proving if temporal information was necessary or not. Our work in this task was divided into two braches: shuffling the clips in order to look at the change in performance, and, on the other hand, using 2D nets to analyze each frame individually.

Bubble plot of the tried architectures in the first task:

Week 7: Multimodality

The work of the final week was divided in two tasks: First we have to measure the performance of different modalities such as Optical flow, RGB difference and Skeleton extraction on their own. Afterwards, the second task consisted on mixing RGB information and alternative information. In this second task we analyzed different fussion methods: early fussion and late fussion.

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
W1		W1
W2		W2
W3		W3
W4		W4
W5		W5
W6		W6
W7		W7
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

C6: Video Analysis - Group 6

How to Run

Week 1: Background estimation

Some example results

Week 2: Object detection and Tracking

Some example results

Tracking by Overlap:

Kalman filtering:

CVPR 2021 AI City Challenge

Week 3: Optical Flow

Some example results

Task 1.3

Task 2

Week 4: Speed estimation and Multi-Camera Tracking

Some example results

Task 1.2 Moder Approach Animation from the log files

Week 5: Action Recognition

Some images

Task 3: Temporal windows and different spatial crops

Task 4: TSN implementation and custom improvement

Week 6: Implementing alternative models

Bubble plot of the tried architectures in the first task:

Week 7: Multimodality

Conclusions slide

About

Releases

Packages

Contributors 4

Languages

mcv-m6-video/mcv-c6-2024-team6

Folders and files

Latest commit

History

Repository files navigation

C6: Video Analysis - Group 6

How to Run

Week 1: Background estimation

Some example results

Week 2: Object detection and Tracking

Some example results

Tracking by Overlap:

Kalman filtering:

CVPR 2021 AI City Challenge

Week 3: Optical Flow

Some example results

Task 1.3

Task 2

Week 4: Speed estimation and Multi-Camera Tracking

Some example results

Task 1.2 Moder Approach Animation from the log files

Week 5: Action Recognition

Some images

Task 3: Temporal windows and different spatial crops

Task 4: TSN implementation and custom improvement

Week 6: Implementing alternative models

Bubble plot of the tried architectures in the first task:

Week 7: Multimodality

Conclusions slide

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages