Causality Causality Workbench                                                             Challenges in Machine Learning Causality
[Back to list]


A genomics dataset

Contact: Isabelle Guyon - Submitted: 2008-09-12 02:55 - Views : 4373 - [Edit entry]


This is one of the datasets of the first causality challenge: causation and prediction. The goal of the challenge was to make predictions under manipulations.

REGED (REsimulated Gene Expression Dataset) monitors the expression of genes, which could be responsible of lung cancer. The data are ?re-simulated?, i.e. generated by a model derived from real human lung-cancer microarray gene expression data. From the causal discovery point of view, the goal is to separate genes whose activity cause lung cancer from those whose activity is a consequence of the disease.

For the pot-luck challenge, the task proposed is to discover the causal network in the neighborhood of the target.

Comments / Questions / Answers

#1 Sahar Behzadi 2018-05-08 09:54:59 -

Since the competition is over, is there any possibility to get access to the ground truth specially for detecting causal relations?

Reply to this post
#2 Isabelle Guyon 2018-05-08 21:51:49 In reply to message #1

The competition is over, but you can still make post-challenge submissions to see how it works.

Reply to this post

Your comment / question:

You must be registered in order to post comments/questions.
Password: Forgot your password ?
Rate the dataset: No rating    0 1 2 3 4 5   (Only counts once, will update if changed)
Receive e-mail when new posts are made