KubePipe

KubePipe is a tool to paralelize the execution of multiple Machine Learning pipelines in containers orchestated by Kubernetes.

Installation

Install Argo Workflows

kubectl create ns argo
kubectl apply -n argo -f https://raw.githubusercontent.com/argoproj/argo-workflows/master/manifests/quick-start-postgres.yaml
kubectl patch svc minio -n argo -p '{"spec": {"type": "LoadBalancer"}}'
kubectl patch svc argo-server -n argo -p '{"spec": {"type": "LoadBalancer"}}'

Install KubePipe

pip install git+https://github.com/HPC-ULL/KubePipe

Usage

from sklearn.linear_model import LogisticRegression
from sklearn.ensemble import RandomForestClassifier, AdaBoostClassifier
from sklearn.preprocessing import StandardScaler, OneHotEncoder

from kube_pipe.kube_pipe_kubernetes import KubePipeKubernetes as KubePipe


from sklearn.model_selection import train_test_split
from sklearn import datasets

iris = datasets.load_iris()

X_train, X_test, y_train, y_test = train_test_split(
    iris.data, iris.target, test_size=0.2)


pipelines = KubePipe(
    [StandardScaler(), AdaBoostClassifier()],
    [OneHotEncoder(), LogisticRegression()],
    [StandardScaler(), RandomForestClassifier()],
)


pipelines.fit(X_train, y_train)

scores = pipelines.score(X_test, y_test)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
examples		examples
kube_pipe		kube_pipe
yamls		yamls
.gitignore		.gitignore
README.md		README.md
license		license
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KubePipe

Installation

Usage

License

About

Releases

Packages

Languages

License

HPC-ULL/KubePipe

Folders and files

Latest commit

History

Repository files navigation

KubePipe

Installation

Usage

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages