data-augmentation

Star

Here are 1,112 public repositories matching this topic...

snorkel-team / snorkel

Star

A system for quickly generating training data with weak supervision

python data-science machine-learning ai weak-supervision snorkel labeling data-augmentation training-data data-slicing

Updated May 2, 2024
Python

NVIDIA / DALI

Star

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

python machine-learning deep-learning neural-network mxnet gpu image-processing pytorch gpu-tensorflow data-processing data-augmentation audio-processing paddle image-augmentation fast-data-pipeline

Updated Dec 11, 2024
C++

ZhaoJ9014 / face.evoLVe

Star

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Updated Dec 23, 2022
Python

QData / TextAttack

Star

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

nlp security machine-learning natural-language-processing data-augmentation adversarial-machine-learning adversarial-examples adversarial-attacks

Updated Jul 25, 2024
Python

webdataset / webdataset

Star

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

deep-learning pytorch data-augmentation webdataset webdataset-format

Updated Dec 11, 2024
Python

fepegar / torchio

Star

Medical imaging toolkit for deep learning

python machine-learning deep-learning pytorch medical-image-computing medical-images data-augmentation augmentation medical-image-processing medical-image-analysis medical-imaging-datasets medical-imaging-with-deep-learning

Updated Dec 9, 2024
Python

iver56 / audiomentations

Sponsor

Star

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

audio python music machine-learning deep-learning dsp sound sound-processing data-augmentation augmentation audio-effects audio-data-augmentation

Updated Dec 9, 2024
Python

425776024 / nlpcda

Star

一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda

nlp data-augmentation chinese-data-augmentation nlpcda chinese-eda

Updated Apr 15, 2024
Python

fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.

visualization python machine-learning image deep-learning image-processing dataset image-classification outlier-detection object-detection image-analysis visual-search data-augmentation data-curation visualization-tools image-similarity image-duplicate-detection novelty-detection image-classfication

Updated Sep 3, 2024
Python

AgaMiko / data-augmentation-review

Star

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.

review machine-learning survey generative-adversarial-network style-transfer data-generation data-augmentation image-augmentation data-synthesis autoaugment audio-augmentation data-augmentations augmentation-policies nlp-augmentation graph-data-augmentation

Updated Aug 14, 2024

jasonwei20 / eda_nlp

Star

Data augmentation for NLP, presented at EMNLP 2019

nlp text-classification position cnn embeddings synonyms swap classification rnn sentence data-augmentation

Updated Mar 19, 2023
Python

yongzhuo / nlp_xiaojiang

Star

自然语言处理（nlp），小姜机器人（闲聊检索式chatbot），BERT句向量-相似度（Sentence Similarity），XLNET句向量-相似度（text xlnet embedding），文本分类（Text classification），实体提取（ner，bert+bilstm+crf），数据增强（text augment, data enhance），同义句同义词生成，句子主干提取（mainpart），中文汉语短文本相似度，文本特征工程，keras-http-service调用

nlp text-classification distance chatbot chinese feature bert data-augmentation enhance text-augment xlnet

Updated Sep 23, 2021
Python

LirongWu / awesome-graph-self-supervised-learning

Star

Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"

machine-learning deep-learning transfer-learning representation-learning unsupervised-learning data-augmentation graph-neural-networks self-supervised-learning pre-training pretext-task

Updated Aug 15, 2024

zhanlaoban / EDA_NLP_for_Chinese

Star

An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。

text-classification eda chinese data-augmentation chinese-data-augmentation easy-data-augmentation

Updated May 31, 2022
Python

Paperspace / DataAugmentationForObjectDetection

Star

Data Augmentation For Object Detection

opencv deep-learning object-detection data-augmentation bounding-box imagine-augmentation

Updated Apr 14, 2020
Jupyter Notebook

asteroid-team / torch-audiomentations

Star

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

audio python music machine-learning deep-learning dsp waveform sound pytorch sound-processing data-augmentation augmentation audio-effects differentiable-data-augmentation audio-data-augmentation

Updated Nov 8, 2024
Python

styfeng / DataAug4NLP

Star

Collection of papers and resources for data augmentation for NLP.

machine-learning natural-language-processing deep-learning text-classification transformers artificial-intelligence survey data-augmentation survey-paper acl2021

Updated Aug 12, 2022

goru001 / inltk

Star

Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need

nlp deep-learning word-embeddings pytorch data-augmentation indic-languages sentence-similarity sentence-embeddings sentence-encoding

Updated Jan 20, 2024
Python

quqxui / Awesome-LLM4IE-Papers

Star

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

information-extraction named-entity-recognition event-detection event-extraction data-augmentation relation-extraction zero-shot-learning few-shot-learning knowledge-graph-construction event-arguments cross-domain-learning in-context-learning large-language-models

Updated Nov 18, 2024

zhunzhong07 / Random-Erasing

Star

Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST

pytorch image-classification object-detection data-augmentation person-re-identification aaai2020

Updated Nov 8, 2023
Python

Improve this page

Add a description, image, and links to the data-augmentation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-augmentation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-augmentation

Here are 1,112 public repositories matching this topic...

snorkel-team / snorkel

NVIDIA / DALI

ZhaoJ9014 / face.evoLVe

QData / TextAttack

webdataset / webdataset

fepegar / torchio

iver56 / audiomentations

425776024 / nlpcda

visual-layer / fastdup

AgaMiko / data-augmentation-review

jasonwei20 / eda_nlp

yongzhuo / nlp_xiaojiang

LirongWu / awesome-graph-self-supervised-learning

zhanlaoban / EDA_NLP_for_Chinese

Paperspace / DataAugmentationForObjectDetection

asteroid-team / torch-audiomentations

styfeng / DataAug4NLP

goru001 / inltk

quqxui / Awesome-LLM4IE-Papers

zhunzhong07 / Random-Erasing

Improve this page

Add this topic to your repo