Skip to content

DataStack makes it easy to do data analysis as a first time coder

License

Notifications You must be signed in to change notification settings

fredzett/datastack

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

datastack

DataStack makes it easy to do data analysis as a first time coder.

More to follow.

How to use

  1. Jupyter Notebook
import sys
!{sys.executable} -m pip install git+"https://github.com/fredzett/datastack"

from datastack import DataTable, DataColumn, col, label
import numpy as np

# Create Table with 1 Mio rows
n = 1_000_000
tbl = DataTable(a=np.random.choice(range(100),n),
                b=np.random.rand(n),
                c=np.random.choice(range(100),n),
                d=np.random.choice(list("abcefdsgekd"), n))

# Apply verbs
tbl = (tbl
        .filter(col("c").larger_then(3.5)) 
        .mutate(NewCol=col("c") * col("a") * 12.24)
        .order_by(label("c"), label("NewCol"), asc=[True, False])
        .select(label().contains("New") | label("c"))
        )

About

DataStack makes it easy to do data analysis as a first time coder

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published