Skip to content

tatami-inc/tatami_tiledb

Repository files navigation

tatami for TileDB matrices

Unit tests Documentation Codecov

Overview

This repository implements tatami bindings for TileDB-backed matrices, allowing random access without loading the entire dataset into memory. Any matrix stored as a 2-dimensional TileDB array (dense or sparse) can be represented as a tatami::Matrix and consumed by tatami-compatible applications.

Quick start

tatami_tiledb is a header-only library, so it can be easily used by just #includeing the relevant source files:

#include "tatami_tiledb/tatami_tiledb.hpp"

tatami_tiledb::DenseMatrix<double, int> dense_mat("some_dir", "attr_name");
tatami_tiledb::SparseMatrix<double, int> sparse_mat("some_dir", "attr_name");

Advanced users can fiddle with the options, in particular the size of the tile cache. For example, we could force the matrix to use no more than 200 MB of memory in the cache:

tatami_tiledb::DenseMatrixOptions opt;
opt.maximum_cache_size = 200000000;
opt.require_minimum_cache = false; // don't allow the cache to automatically expand.
tatami_tiledb::DenseMatrix<double, int> some_mat2("some_dir", "attr_name", opt);

Check out the reference documentation for more details.

Building projects

Cmake with FetchContent

If you're using CMake, you just need to add something like this to your CMakeLists.txt:

include(FetchContent)

FetchContent_Declare(
  tatami_tiledb
  GIT_REPOSITORY https://github.com/tatami-inc/tatami_tiledb
  GIT_TAG master # or any version of interest
)

FetchContent_MakeAvailable(tatami_tiledb)

Then you can link to tatami_tiledb to make the headers available during compilation:

# For executables:
target_link_libraries(myexe tatami_tiledb)

# For libaries
target_link_libraries(mylib INTERFACE tatami_tiledb)

CMake using find_package()

You can install the library by cloning a suitable version of this repository and running the following commands:

mkdir build && cd build
cmake .. -DTATAMI_TILEDB_TESTS=OFF
cmake --build . --target install

Then you can use find_package() as usual:

find_package(tatami_tatami_tiledb CONFIG REQUIRED)
target_link_libraries(mylib INTERFACE tatami::tatami_tiledb)

Manual

If you're not using CMake, the simple approach is to just copy the files - either directly or with Git submodules - and include their path during compilation with, e.g., GCC's -I. This will also require the dependencies listed in extern/CMakeLists.txt, namely the tatami_chunked library.

You'll also need to link to the TileDB library yourself (version 2.15 or higher). tatami_tiledb does not place any restrictions on the source of the TileDB library; in the simplest case, we just download the latest release, unpack it in some tiledb/ directory, and link to it as shown below. (Proper installation instructions for TileDB can be found in their documentation.)

find_package(TileDB PATHS tiledb)
target_link_libraries(myexe TileDB::tiledb_shared)