feat(server): separate face search relation #10371

mertalev · 2024-06-15T22:35:38Z

Description

This PR addresses a subtle issue with the current facial recognition. Each time a face is assigned or reassigned a person, for both the initial and later facial recognition runs, an additional duplicate embedding is inserted into the face vector index. This can lead to index degradation as the majority of the index is duplicated, in turn leading to faces sometimes not being recognized when they should.

This PR changes the embedding to be in a separate table as a one-one relation, similar to how smart search is handled. This means changes to the face, such as which person it's assigned to, have no effect on the index. It has a smaller but notable benefit of making these changes faster and producing less WAL.

A notable benefit of this change is also that it makes supporting manually added faces easier as an embedding is no longer required.

Also sets storage to external so Postgres doesn't try to compress the embeddings, following the finding here.

Fixes #10277

How Has This Been Tested?

Tested that the migration is successful without loss of data and that both face detection and facial recognition jobs continue to work.

The number of unassigned faces decreased by 38% after running with this change
Facial recognition is twice as fast as before
Face detection is 20% faster (I'm not really sure why)
SELECT idx_tuples FROM pg_vector_index_stat WHERE indexname = 'face_index'; is identical to the number of faces, i.e. no duplicate embeddings

jrasm91

Can we also mark the clip embeddings as external?

server/src/entities/face-search.entity.ts

mertalev · 2024-06-16T15:23:19Z

Can we also mark the clip embeddings as external?

Yep, this makes the clip embeddings external too.

jrasm91 · 2024-06-17T03:02:08Z

server/src/migrations/1718486162779-AddFaceSearchRelation.ts

+
+    await queryRunner.query(`ALTER TABLE asset_faces ADD COLUMN "embedding" vector(512)`);
+    await queryRunner.query(`ALTER TABLE face_search ALTER COLUMN embedding SET STORAGE DEFAULT`);
+    await queryRunner.query(`ALTER TABLE smart_search ALTER COLUMN embedding SET STORAGE DEFAULT`);


Ah, I missed this line. Nice!

mertalev requested a review from danieldietzler as a code owner June 15, 2024 22:35

mertalev added the 🗄️server label Jun 15, 2024

jrasm91 approved these changes Jun 16, 2024

View reviewed changes

server/src/entities/face-search.entity.ts Show resolved Hide resolved

mertalev added 7 commits June 16, 2024 15:19

wip

ea04464

various fixes

3b18f4b

new migration

760750e

fix test

8d342d5

add face search entity, update sql

9a7ebfe

update e2e

1a2cfb5

set storage to external

2adf3cf

mertalev force-pushed the chore/separate-face-search-relation branch from 1540f20 to 2adf3cf Compare June 16, 2024 19:20

mertalev enabled auto-merge (squash) June 16, 2024 19:21

mertalev merged commit 6b1b505 into main Jun 16, 2024
22 checks passed

mertalev deleted the chore/separate-face-search-relation branch June 16, 2024 19:25

jrasm91 reviewed Jun 17, 2024

View reviewed changes

mertalev mentioned this pull request Jun 17, 2024

feat(server): Import face regions from metadata #6455

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(server): separate face search relation #10371

feat(server): separate face search relation #10371

mertalev commented Jun 15, 2024 •

edited

Loading

jrasm91 left a comment

mertalev commented Jun 16, 2024

jrasm91 Jun 17, 2024

feat(server): separate face search relation #10371

feat(server): separate face search relation #10371

Conversation

mertalev commented Jun 15, 2024 • edited Loading

Description

How Has This Been Tested?

jrasm91 left a comment

Choose a reason for hiding this comment

mertalev commented Jun 16, 2024

jrasm91 Jun 17, 2024

Choose a reason for hiding this comment

mertalev commented Jun 15, 2024 •

edited

Loading