Skip to content
This repository has been archived by the owner on Jun 18, 2020. It is now read-only.

数据格式设计 #1

Open
ShixiangWang opened this issue Sep 21, 2019 · 2 comments
Open

数据格式设计 #1

ShixiangWang opened this issue Sep 21, 2019 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@ShixiangWang
Copy link
Owner

以表格为基础,建立单一的数据表存储单个数据集

以数据表为基础,建立某种数据的数据集,并用一个表格标注元信息,包括创建时间,与其他数据库数据表关联,引用等

实现方式,使用较为宽松的S3,并定义一系列的泛型方法

@ShixiangWang ShixiangWang self-assigned this Sep 21, 2019
@ShixiangWang ShixiangWang added the enhancement New feature or request label Sep 21, 2019
@ShixiangWang
Copy link
Owner Author

ShixiangWang commented Sep 21, 2019

Metadata for dataset should at least contains:

  • Attribution
    • Source: URL
    • Description: describe data preprocessing, meaning of columns etc.
    • Unit
    • Citation: reference, separated by ;
    • Software: NA or softwares, separated by ;
    • LastModified: recent update date-time
    • Note: extra info users may should know
  • Maintainer
    • Name
    • Email

Rows representing samples and columns representing variables.

Classes to do:

DataTable: element of a DataBase
DataBase: A database for storing similar data, contains a table connect data tables cross different databases.

DataBase to do:

  • ImmuneFraction
  • ImmuneResponse
  • Phenotype

@ShixiangWang
Copy link
Owner Author

一个数据表是一个 rds 数据,存放到 extdata 中,而数据库元信息表则存储到 data 下面。

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant