Datasets containing the initial (training, 38 samples) and independent (test, 34 samples) datasets used in the paper : Golub et al "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
.
These datasets contain measurements corresponding to acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL) from Bone Marrow and Peripheral Blood using gene expression monitoring (via DNA microarray) . Intensity values have been re-scaled such that overall intensities for each chip are equivalent.
The motive is to categorize the samples into AMP
and ALL
using Principal Component Analysis.
+ Vedant Shrivastava | vedantshrivastava466@gmail.com