GitHub - devsgnr/breadroll: breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.

breadroll 🥟 is a simple lightweight toolkit for parsing csv, tsv, and other delimited files, performing EDA (exploratory data analysis), and data processing operations on multivariate datasets. Think pandas but written in Typescript and developed on the Bun Runtime.

⚡ Fast: breadroll is built on Bun, the all-in-one Javascript runtime built for speed
📁 File I/O: Support for local & remote sources loved by JS developers - Local, HTTPS, & Supabase Storage
😊 Easy-to-use: Compose queries using filter keywords that are simple and are easy to comprehend

Table of Content

Installation

System Requirements:

Bun
MacOS, Linux
Typescript >= 5.1

Bun

breadroll is built on and optimized for Bun.js. You can install Bun by running the following

curl https://bun.sh/install | bash

create a new Bun project by running

bun init

then you can now install breadroll using

bun add breadroll

Easy API

breadroll provides an easy to use API that gets you from zero to data processing in no time, with lazy loading of these delimited files via Bun's File I/O Bun.file(), the file parsed based on the DataframeReadOptions, and convert into a Dataframe, and easily read out the content of the Dataframe using .value.

import Breadroll, { Dataframe } from "breadroll";

const csv: Breadroll = new Breadroll({ header: true });

Example: From one instance example above, you can open multiple csv files

const df: Dataframe<T> = await csv.open.local("./data/ds_salaries.csv", ",");

Remote Data Sources

breadroll makes it easy to work with remote data sources with current support for HTTPS and Supabase Storage. With other remote data sources on the roadmap.

const df: Dataframe<T> = await csv.open.https("https://.../.../filename.csv", ",");
const df: Dataframe<T> = await csv.open.supabaseStorage("bucketName", "filepath", ",");

Filtering

Peform complex filtering; with various filters including range filters like is between that can be achieved using an optional function parameter limit which is the upper limit. These range filter are only effective with numbers (integers, floating-point).

df.filter("age", "is between", 30, 40);

Perform even more complex filtering with multiple / chained filter, you can chain the filter ie. filtering the previously filtered Dataframe, the chained filter can be as long as you need them to be.

df.filter("age", "is between", 30, 40).filter("salary", ">", 70000).filter("work_year", "==", 2020);

Data Transformation

Perform whatever transformation you'd like to perform on the value of a specified column, from simple transformation like value + 2, to complex mathematical transformations that can be paired with the in-built numeric constant object

df.apply({ key: "salary", fn: (v) => v / (40 * 4), newkey: "per_hour" });

A Little Math

Get a single number that accurately represents the underlying data with the many provided aggregation functions, the likes of average (mean), max, min, sum, count, etc. with more in development

df.sum("capital_gain");
df.average("capital_gain");
df.count;

This project running on bun >=v1.0.22. Bun.js is a fast all-in-one JavaScript runtime.

Name		Name	Last commit message	Last commit date
Latest commit History 195 Commits
.github		.github
.husky		.husky
docs		docs
src		src
test		test
.editorconfig		.editorconfig
.env.example		.env.example
.eslintrc.cjs		.eslintrc.cjs
.gitignore		.gitignore
.npmignore		.npmignore
.npmrc		.npmrc
.prettierignore		.prettierignore
.prettierrc.toml		.prettierrc.toml
LICENSE		LICENSE
README.md		README.md
build.ts		build.ts
bun.lockb		bun.lockb
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Content

Installation

Bun

Easy API

Remote Data Sources

Filtering

Data Transformation

A Little Math

About

Releases 5

Sponsor this project

Packages

Languages

License

devsgnr/breadroll

Folders and files

Latest commit

History

Repository files navigation

Table of Content

Installation

Bun

Easy API

Remote Data Sources

Filtering

Data Transformation

A Little Math

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 5

Sponsor this project

Packages 0

Languages

Packages