Skip to content

DimmWitted Gibbs Sampler in C++ β€” βš οΈπŸš§πŸ›‘ REPO MOVED TO DEEPDIVE πŸ‘‰πŸΏ

License

Notifications You must be signed in to change notification settings

HazyResearch/sampler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

DimmWitted: Fast Gibbs Sampler Build Status

How fast is DimmWitted?

  • On Amazon EC2's FREE MACHINE (512M memory, 1 core). We can sample 3.6M variables/seconds.
  • On a 2-node Amazon EC2 machine, sampling 7 billion random variables, each of which has 10 features, takes 3 minutes. This means we can run inference for all living human beings on this planet with $15 (100 samples!)
  • On Macbook, DimmWitted runs 10x faster than DeepDive's default sampler.

Usage

See: DimmWitted sampler page in DeepDive's documentation.

The binary format for DimmWitted's input is documented in doc/binary_format.md.

Installation

First, install build dependencies:

make -j dep

Then, build:

make -j

A modern C++ compiler is required: g++ >= 4.8 or clang++ >= 4.2. To specify the compiler to use, set the CXX variable:

CXX=/dfs/rulk/0/czhang/software/gcc/bin/g++ make

To test, run:

make -j test

Development

  • Follow Google C++ Style Guide.
  • Travis CI tests will error unless you run make format before git commits.
  • Tests are written with gtest and bats.
  • Command-line parsing is done with TCLAP.
  • NUMA control is done with libnuma.

Reference

C. Zhang and C. RΓ©. DimmWitted: A study of main-memory statistical analytics. PVLDB, 2014.

About

DimmWitted Gibbs Sampler in C++ β€” βš οΈπŸš§πŸ›‘ REPO MOVED TO DEEPDIVE πŸ‘‰πŸΏ

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published