inductive-biases

Here are 13 public repositories matching this topic...

zdxdsw / inductive_counting_with_LMs

This work provides extensive empirical results on training LMs to count. We find that while traditional RNNs trivially achieve inductive counting, Transformers have to rely on positional embeddings to count out-of-domain. Modern RNNs (e.g. rwkv, mamba) also largely underperform traditional RNNs in generalizing counting inductively.

inductive-biases inductive-counting length-generalization language-model-architectures

Updated Oct 6, 2024
Jupyter Notebook

FieteLab / Exact-Inductive-Bias

Star

Towards Exact Computation of Inductive Bias (IJCAI 2024)

pytorch inductive-biases

Updated Jul 1, 2024
Python

christos42 / inductive_bias_IE

Star

An Information Extraction Study: Take In Mind the Tokenization! (official repository of the paper)

nlp deep-learning information-extraction named-entity-recognition language-models tokenization relation-extraction inductive-biases news-text biomedical-text

Updated Apr 28, 2023
Shell

tkasarla / max-separation-as-inductive-bias

Star

Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704

inductive-biases maximum-separation

Updated Apr 21, 2023
Python

sayakpaul / vision-transformers-tf

Star

A non-exhaustive collection of vision transformer models implemented in TensorFlow.

recognition tensorflow keras transformers vision segmentation inductive-biases

Updated Sep 25, 2022

NeurAI-Lab / InBiaseD

Star

This is the official code for CoLLAs 2022 paper, "InBiaseD: Inductive Bias Distillation to Improve Generalization and Robustness through Shape-awareness"

deep-learning robustness generalization inductive-biases shortcut-learning shape-aware

Updated Jul 13, 2022
Python

mahsa91 / GKD-MICCAI2021

Star

Implementation code of GKD: Semi-supervised Graph Knowledge Distillation for Graph-Independent Inference accepted by Medical Image Computing and Computer Assisted Interventions (MICCAI 2021)

deep-learning pytorch semi-supervised-learning gcn distillation graph-neural-networks inductive-biases inductive-learning aggregation-function gkd