MLProject

This report presents the initial implementation of a simultaneous translation model using PyTorch, which incorporates a mixture of experts strategy and reinforcement learning principles. The model has been designed to optimize translation quality while minimizing latency.

Base Model: The project will utilize the Mixture-of-Experts Wait-k policy as the backbone model. This policy allows each head of the multi-head attention mechanism to perform translation with different levels of latency.

Reinforcement Learning Integration:

State: The state in the RL framework consists of the current source tokens processed and the target tokens generated.
Action: The action space corresponds to selecting a value of 𝑘. (how many source tokens to wait before generating a translation).
Reward Function: The reward will be a combination of BLEU score (translation quality) and Average Lagging (latency). The model will be penalized for high latency and low translation quality. We might also test ROUGE score as it can provide insights into the coverage and informativeness of the generated text.
Agent: A policy-based RL agent will be used to predict the optimal k value.

Model Architecture:

The MOEWaitKPolicy class implements the Mixture-of-Experts Wait-k mechanism, allowing the model to leverage multiple experts for different heads of attention.
The AdaptiveWaitKModel class integrates the encoder and decoder components, incorporating the adaptive wait-k policy into the translation process.
The SimultaneousTranslationModel class encapsulates the entire model architecture, facilitating the forward pass through the network. (not working need to tackle this too)
The model is trained over multiple epochs using the Adam optimizer and Cross-Entropy loss. The training loop involves passing batches through the model, calculating the loss, and updating the weights.n (not working need to tackle this too)

Pending Tasks & Challenges

Fix Tensor Size Mismatches: Address the runtime error caused by mismatched tensor dimensions in the MOEWaitKPolicy forward method.
Resolve Syntax Issues: Correct all syntax errors in the code to ensure it runs without interruption.
Evaluate Model Performance: Once the errors are fixed, perform comprehensive evaluations using BLEU and ROUGE scores.

Alternative Frameworks: If issues with PyTorch persist, we will consider switching to TensorFlow/Keras for the model implementation. This may involve rewriting parts of the code to adapt to the new framework.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
ML Project Report.pdf		ML Project Report.pdf
ML_Project_Initial Phase 2.ipynb		ML_Project_Initial Phase 2.ipynb
Pending Tasks and Challenges (1).pdf		Pending Tasks and Challenges (1).pdf
Phase2_ML.zip		Phase2_ML.zip
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLProject

Model Architecture:

Pending Tasks & Challenges

About

Releases

Packages

Contributors 3

Languages

Tanmay-IITDSAI/MLProject

Folders and files

Latest commit

History

Repository files navigation

MLProject

Model Architecture:

Pending Tasks & Challenges

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages