-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can I get DE gene list for a simulated data set? #57
Comments
For the Splat simulation you need to consider the |
Thank you for answering! Could you please explain the complication when I simulate multiple groups? When comparing only two groups out of many groups, is it reasonable to ignore the DE factor for other groups and focus only on the two groups of interest? Your advice will be really helpful for our analyses. Thank you again. |
You are correct, you only need to compare the factors for the groups you are interested in. I will try to provide an example that explains how it works. Let's say we are simulation some data with just two groups and we get these values for the genes:
In this case GeneA is not DE, GeneB is up-regulated in Group2 (foldchange = 5), GeneC is up-regulated in Group1 (foldchange = 2). Now let's say we simulate three groups:
We can do a similar thing to calculate the simulated differences in expression (foldchanges) between each of the pairs of groups we are interested in.
The simulated foldchanges we have just calculated can then be used to tell us which genes are DE between two group (those where FoldChange != 1). If we want to compare one group to multiple other groups it gets a bit more complicated again. We need to look at the DE factors in each of the groups but weighted according to the number of cells in each group. Let's say we want to compare GeneA between Group1 vs (Group2 + Group3) we could calculated the simulated foldchange as something like:
Hope that helps explain things! 😸 |
The explanation is really thorough and helpful. Thank you again! |
I am conducting simulations to evaluate DE gene detection performances. Is it possible to know which genes are differentially simulated among groups, so that I can have a gold standard to compare with? I appreciate your support! Thank you.
The text was updated successfully, but these errors were encountered: