Buster is an open-source platform for deploying AI data analysts
Go to file
Dallin Bentley 00d189a99b docker compose licenses and readme 2024-09-10 17:09:57 -06:00
assets docker compose licenses and readme 2024-09-10 17:09:57 -06:00
ee docker compose licenses and readme 2024-09-10 17:09:57 -06:00
helm_values docker compose licenses and readme 2024-09-10 17:09:57 -06:00
python docker compose licenses and readme 2024-09-10 17:09:57 -06:00
.env.example docker compose licenses and readme 2024-09-10 17:09:57 -06:00
.gitignore docker compose licenses and readme 2024-09-10 17:09:57 -06:00
LICENSE docker compose licenses and readme 2024-09-10 17:09:57 -06:00
README.md docker compose licenses and readme 2024-09-10 17:09:57 -06:00
docker-compose.yml docker compose licenses and readme 2024-09-10 17:09:57 -06:00
main.tf Reset to current state 2024-09-10 11:30:01 -06:00
outputs.tf Reset to current state 2024-09-10 11:30:01 -06:00
providers.tf Reset to current state 2024-09-10 11:30:01 -06:00
variables.tf Reset to current state 2024-09-10 11:30:01 -06:00

README.md

Buster GitHub Banner

Buster Warehouse

A data warehouse built on Apache Iceberg and Starrocks


Buster Warehouse Overview

This project is a data warehouse built on Apache Iceberg and Starrocks. In working with our customers, we found that Snowflake, Bigquery, and other warehouse solutions were prohibitively expensive or slow in them being able to deploy AI-powered analytics at scale.

Additionaly, we found that having a close integration between the data warehouse and our AI-native BI tool allows for a better and more reliable data experience.

Key Features

  • Built on Starrocks: We felt that Starrock was the best query engine by default for our use case. The main thing that pushed us towards it was that they perform predicate pushdown on iceberg tables, whereas Clickhouse and DuckDB do not. We were also impressed by the performance, caching system, and flexibility of Starrocks.
  • Built on Apache Iceberg: Some of the top companies in the world use Apache Iceberg for storing and interacting with their data. We wanted a table format that not only brought tremendous benefits, but one that companies wouldn't outgrow.
  • Bring Your Own Storage: We felt that customers should own their data and not be locked into a particular storage engine.

Quickstart

Have

Roadmap

Currently, we are in the process of open-sourcing the platform. This includes:

After that, we will release an official roadmap.

How We Plan to Make Money

Currently, we offer a few commercial products:

  • Cloud-Hosted Version
    • Cluster
    • Serverless
  • Managed Self-Hosted Version

Support and feedback

You can contact us through either:

License

This repository is MIT licensed, except for the ee folders. See LICENSE for more details.

Shoutouts

The documentation from the Starrocks, Iceberg, and PyIceberg team has been very helpful in building this project.