Extend ContrastiveOutput to support sequential encoders #1086

sararb · 2023-05-09T23:18:00Z

Goals ⚽

This PR adds support for negative sampling to the ContrastiveOutput class for session-based models where the query encoder returns a 3-D ragged tensor.

Implementation Details 🚧

Flatten the values of the query embeddings to match with the sampled negative embeddings.
Reconstruct the ragged representation using the mask information of the input query.
Apply the same transformation to positive candidates (in case they are sequential).

Testing Details 🔍

Add a unit test for defining a transformer-based model as a retrieval model and training it with sampled softmax.

Benchmark 🔍

I used the session-based script (implemented here)) to perform a benchmark of sampled softmax in various configurations, similar to the study conducted in T4Rec (available here)).

Commandline
I use the first four days for training and evaluation is computed for the fifth day. Here is the base command line with the utilized hparam:

python3 session_based.py --metrics_log_frequency 20 --train_path /models/examples/session_based_script/ecomrees_five_days/train --eval_path /models/examples/session_based_script/ecomrees_five_days/valid --schema_path /models/examples/session_based_script/ecomrees_five_days --task multi_class_classification --embedding_dim 448  --d_model 192 --n_layer 3 --n_head 16 --label_smoothing 0.0  --model_type xlnet --eval_batch_size 128 --train_batch_size 128 --epochs 5 --weight_tying --xlnet_attn_type bi --training_task masked --evaluation_task last --masking_probability 0.30000000000000004 --lr 0.0006667377132554976 --transformer_dropout 0.0  --log_to_wandb --transformer_activation gelu --feature_normalization --input_dropout 0.1 --optimizer adamw --weight_decay 3.910060265627374e-05 --save_topk_predictions --emb_init_std 0.11 --sampled_softmax --num_negatives 1000 --logq_correction

The hparams that are changed for the experiments are --sampled_softmax (enables sampled softmax if provided), --logq_correction, and --num_negatives (number of negative samples).

Results
The results can be seen in the following table. Average examples/sec represents the throughtput and Recall and NDCG are accuracy top-k metrics.

github-actions · 2023-05-09T23:25:31Z

Documentation preview

https://nvidia-merlin.github.io/models/review/pr-1086

gabrielspmoreira

Sounds good to me

gabrielspmoreira · 2023-05-11T00:56:38Z

merlin/models/tf/outputs/contrastive.py

+        if is_ragged:
+            logits.copy_with_updates(
+                outputs=original_query_embedding.with_flat_values(logits.outputs),
+                targets=original_target.with_flat_values(logits.targets),


This with_flat_values is very useful! Seems faster than rebuilding the full ragged tensor.

extend ContrastiveOutput to support sequential encoders

1fe52ea

sararb added bug Something isn't working area/retrieval area/session-based labels May 9, 2023

sararb added this to the Merlin 23.05 milestone May 9, 2023

sararb self-assigned this May 9, 2023

sararb requested review from gabrielspmoreira and rnyak May 9, 2023 23:18

Merge branch 'main' into session-based/contrastive

25b963e

gabrielspmoreira reviewed May 11, 2023

View reviewed changes

gabrielspmoreira approved these changes May 12, 2023

View reviewed changes

gabrielspmoreira merged commit 5f82b55 into main May 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend ContrastiveOutput to support sequential encoders #1086

Extend ContrastiveOutput to support sequential encoders #1086

sararb commented May 9, 2023 •

edited

Loading

github-actions bot commented May 9, 2023

gabrielspmoreira left a comment

gabrielspmoreira May 11, 2023

Extend ContrastiveOutput to support sequential encoders #1086

Extend ContrastiveOutput to support sequential encoders #1086

Conversation

sararb commented May 9, 2023 • edited Loading

Goals ⚽

Implementation Details 🚧

Testing Details 🔍

Benchmark 🔍

github-actions bot commented May 9, 2023

Documentation preview

gabrielspmoreira left a comment

Choose a reason for hiding this comment

gabrielspmoreira May 11, 2023

Choose a reason for hiding this comment

sararb commented May 9, 2023 •

edited

Loading