Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doesn't seem to work with distributed. #8

Open
rabernat opened this issue Nov 16, 2019 · 2 comments
Open

Doesn't seem to work with distributed. #8

rabernat opened this issue Nov 16, 2019 · 2 comments

Comments

@rabernat
Copy link
Collaborator

rabernat commented Nov 16, 2019

I tried some basic stuff with a dask_kubernetes on ocean.pangeo.io. No luck.

I created a cluster and connected to it, created a gdrivefs, and the tried to read / write via xarray. I immediately get a KilledWorker.

Sorry for not providing a reproducible example. The only example I know how to make is probably too complicated. I figured you would know how to do a proper test of distributed instead of whatever hack I come up with.

@martindurant
Copy link
Member

martindurant commented Nov 16, 2019 via email

@martindurant
Copy link
Member

The following worked for me:

import fsspec
from gdrivefs import core
fsspec.registry['gdrive'] = core.GoogleDriveFileSystem

import dask.bag as db
from dask.distributed import Client
c = Client()
b = db.read_text('gdrive://*.md', storage_options={'token': 'cache'})
b.compute()

(in my case this was two files, and resulted in text output, as expected)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants