Passing labels to text_decoder to compute loss. #65

kapulkin · 2024-02-11T16:59:09Z

I noticed, that labels variable is not passed to text_decoder in VLMForCausalLM.forward(). So text_decoder will return just logits and will not compute loss. This makes impossible to use VLMForCausalLM model with transformer.Trainer and requires to write custom train loop or wrap VLMForCausalLM.

There is a fix to avoid that incompatibility.

kimihailv · 2024-02-12T10:05:20Z

Passing labels to the text decoder is not enough. input embeds contain not only embeddings of text tokens, but also image features, so logits will also contain not only data for text but also for image

## [1.1.1](v1.1.0...v1.1.1) (2024-02-23) ### Docs * Performance observations for M2 CPUs (#56) ([8374ef6](8374ef6)), closes [#56](#56) ### Fix * Passing labels to `text_decoder` to compute loss. (#65) ([f445a8b](f445a8b)), closes [#65](#65) ### Improve * Larger batch benchmarks ([fdc8587](fdc8587)) ### Make * pre-commit config and linters (#62) ([0a3efac](0a3efac)), closes [#62](#62)

ashvardanian · 2024-02-23T18:14:53Z

🎉 This PR is included in version 1.1.1 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

Passing labels to text_decoder to compute loss.

a720653

ashvardanian changed the base branch from main to main-dev February 23, 2024 18:10

ashvardanian merged commit f445a8b into unum-cloud:main-dev Feb 23, 2024

ashvardanian added the released label Feb 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Passing labels to text_decoder to compute loss. #65

Passing labels to text_decoder to compute loss. #65

kapulkin commented Feb 11, 2024

kimihailv commented Feb 12, 2024

ashvardanian commented Feb 23, 2024

Passing labels to text_decoder to compute loss. #65

Passing labels to text_decoder to compute loss. #65

Conversation

kapulkin commented Feb 11, 2024

kimihailv commented Feb 12, 2024

ashvardanian commented Feb 23, 2024