Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Restore/improve support for non-GEO dataset upload (esp. single-cell) #1284

Open
ppavlidis opened this issue Nov 1, 2024 · 1 comment
Open
Assignees
Labels
enhancement Enhance the code or user experience

Comments

@ppavlidis
Copy link
Collaborator

There are many single-cell data sets that we want to load into Gemma that are not available in GEO. These typically come from random web sites, not a particular repository.

We have some antiquated support for this already, both from the CLI and GUI, but it needs to be revisited and probably updated.

https://gemma.msl.ubc.ca/expressionExperiment/upload.html (ExpressionDataFileUploadController)
LoadSimpleExpressionDataCli

These were designed with microarrays in mind, and for data that comes as a single tab-delimited file.

Also note we have methods for loading experimental design information from files as well (ExperimentalDesignImporter) but it is limited too. For uploading meta-data on samples we'll need something like this.

We'll need to adapt these to facilitate loading of single-cell data.

In general, there are three steps, after which datasets should be able to be processed "as usual".

  1. Definition of the basic data set information (name, description etc.) - the upload form is not a bad way to do this but it will need to be updated a little. Probably the uploading of data itself should be separated from that step completely.
  2. Loading of data files, and probably supporting some other format besides tsv (we need to see what makes sense). Since we support this already for single-cell, this part should be easy.
  3. Uploading of meta-data on samples if available, to save data entry time.

We'll flesh this out with some particular examples.

@ppavlidis ppavlidis added the enhancement Enhance the code or user experience label Nov 1, 2024
@arteymix
Copy link
Member

arteymix commented Nov 1, 2024

This needs to wait until we finish the basic single cell support.

We should have all the necessary software components for this, they just need to be assembled in a CLI tool.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Enhance the code or user experience
Projects
None yet
Development

No branches or pull requests

2 participants