#

attention-mechanism

Here are 1,609 public repositories matching this topic...

joseph-nagel / attention-mechanism

An introduction to attention mechanisms and the vision transformer

deep-neural-networks transformer attention-mechanism transformer-architecture vision-transformer

Updated Nov 9, 2024
Python

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

deep-learning transformers pytorch transformer lstm rnn gpt language-model attention-mechanism gpt-2 gpt-3 linear-attention rwkv chatgpt

Updated Nov 9, 2024
Python

Haiyang-W / TokenFormer

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

architecture transformer attention-mechanism scaling-methods foundation-models llm

Updated Nov 9, 2024
Python

NotShrirang / QuillGPT

Implementation of the GPT decoder block in PyTorch finetuned on Shakespeare's works 🪶

nlp docker decoder pytorch transformer attention gpt attention-mechanism fastapi streamlit large-language-models llm generative-ai

Updated Nov 9, 2024
Jupyter Notebook

JalendraIITP / Stock-Price-Prediction-using-Attention-Mechanism-Variational-LSTM

Stock Price Prediction using Attention based LSTM

finance deep-learning pytorch lstm attention-mechanism amv-lstm

Updated Nov 9, 2024
Jupyter Notebook

Poofy1 / CADBUSI-Training

Pytorch MIL pipeline for breast ultrasound cancer research

python machine-learning research pytorch attention-mechanism model-training ultrasound-imaging

Updated Nov 8, 2024
Python

lucidrains / x-transformers

A concise but complete full-attention transformer with a set of promising experimental features from various papers

deep-learning transformers artificial-intelligence attention-mechanism

Updated Nov 8, 2024
Python

freedao31 / halonet-pytorch

About Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones

deep-learning artificial-intelligence vision attention-mechanism

Updated Nov 8, 2024
Python

freedao31 / medical-chatbot

Implementation of ChatGPT, but tailored towards primary care medicine, with the reward being able to collect patient histories in a thorough and efficient manner and come up with a reasonable differential diagnosis

medicine deep-learning transformers artificial-intelligence attention-mechanism

Updated Nov 8, 2024
Python

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

computer-vision transformers artificial-intelligence image-classification attention-mechanism

Updated Nov 8, 2024
Python

IAAR-Shanghai / Awesome-Attention-Heads

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

awesome survey transformer gpt attention-mechanism research-paper circuit-analysis interpretability cognitive-neuroscience visualization-tools large-language-models llm chain-of-thought llm-reasoning machine-psychology attention-head-mining

Updated Nov 7, 2024
TeX

zombieTDV / HM_Gpt_v1.0

Homemade GPT( not that good ).

deep-learning numpy attention-mechanism backpropagation

Updated Nov 7, 2024
Python

PhysiologicAILab / FactorizePhys

FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing [NeurIPS 2024]

computer-vision deep-learning attention-mechanism vital-signs non-negative-matrix-factorization rppg physiological-computing photoplethysmography-imaging healthcare-ai remote-ppg

Updated Nov 6, 2024
Python

superxuang / amta-net

Asymmetric Multi-Task Attention Network for Prostate Bed Segmentation in CT Images

attention-mechanism prostate computed-tomography multi-task-learning prostate-segmentation

Updated Nov 5, 2024
Python

swarms

kyegomez / swarms

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503

Updated Nov 5, 2024
Python

DeepAuto-AI / hip-attention

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.

triton attention attention-mechanism sub-quadratic-attention openai-triton hip-attention

Updated Nov 5, 2024
Python

RishiDarkDevil / daam-i2i

Diffusion attentive attribution maps for interpreting Stable Diffusion for image-to-image attention.

computer-vision generative-model attention-mechanism diffusion interpretable-machine-learning stable-diffusion

Updated Nov 5, 2024
Python

kyegomez / SparseAttention

Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"

machine-learning artificial-intelligence sparse-matrix attention-mechanism attention-is-all-you-need attention-mechanisms sparse-attn

Updated Nov 4, 2024
Python

AnasNeumann / gns

Engineer-To-Order (ETO) Graph Neural Scheduling (GNS) Project

pytorch multi-agent manufacturing attention-mechanism proximal-policy-optimization graphneuralnetwork pytorchgeometric mappo engineer-to-order

Updated Nov 4, 2024
Python

kklemon / FlashPerceiver

Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.

nlp deep-learning transformer attention-mechanism perceiver flash-attention

Updated Nov 4, 2024
Python

Improve this page

Add a description, image, and links to the attention-mechanism topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the attention-mechanism topic, visit your repo's landing page and select "manage topics."