Databricks Labs
Labs projects to accelerate use cases on the Databricks Unified Analytics Platform
Pinned Loading
Repositories
Showing 10 of 36 repositories
- dbldatagen Public
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
databrickslabs/dbldatagen’s past year of commit activity - tempo Public
API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
databrickslabs/tempo’s past year of commit activity