SemanticKITTI API for visualizing dataset, processing data, and evaluating results.
-
Updated
May 14, 2024 - Python
SemanticKITTI API for visualizing dataset, processing data, and evaluating results.
LexicMap: efficient sequence alignment against millions of prokaryotic genomes
A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks
SOTA kilo-scale MIDI dataset for MIR and Music AI purposes
A large-scale dataset for instance-level recognition for artworks is introduced.
Maximize Efficiency, Elevate Accuracy: Slash GPU Hours by Half with Efficient Pre-training!
Official code release for BOLD5000 Release 2.0
Korean Moview Review Emotion (KMRE) Dataset
Web interface for querying the LAION-5B dataset using CLIP embeddings.
This repository contains a framework with a GPU implementation of generalized convolution operators. The framework is designed for large image data sets and can run in a distributed system.
KSL-Guide: A Large-scale Korean Sign Language Dataset Including Interrogative Sentences for Guiding the Deaf and Hard-of-Hearing, FG, 2021
A large-scale datasets for session-based recommendation and sequential recommendation
M3LS : Multi-lingual Multi-modal summarization dataset
Music recommender system based on collaborative filtering using the ListenBrainz listens dataset.
Densim is a library for efficient similarity search and clustering of dense vectors, which are numerical representations of data such as images, text, or audio.
To analyze and predict flight data using Spark within the Databricks environment.
GeoCARET - a command line Python tool for delineating and analysing catchments and reservoirs.
Add a description, image, and links to the large-scale-dataset topic page so that developers can more easily learn about it.
To associate your repository with the large-scale-dataset topic, visit your repo's landing page and select "manage topics."