This is one of the datasets of the first causality challenge: causation and prediction. The goal of the challenge was to make predictions under manipulations. SIDO (SImple Drug Operation mechanisms) contains descriptors of molecules, which have... [more/question/discuss/rate/edit...]
|
The PROMO dataset proposes the task to identify which promotions affect sales. Artificial data about 1000 promotion variables and 100 product sales is provided. The goal is to predict a 1000x100 boolean influence matrix, indicating for each (i,j)... [more/question/discuss/rate/edit...]
|
This dataset consists of roughly 700 to 900 single cell recordings of the abundance of 11 phosphoproteins and phospholipids (PKC, PKA, P38, Jnk (pjnk), Raf (praf), Mek (pmek), Erk (p44/42), Akt (pakts473), PLC-gamma (plcg), PIP2, PIP3) under various... [more/question/discuss/rate/edit...]
|
TIED dataset © 2008 Alexander Statnikov and Constantin Aliferis Introduction TIED stands for Target Information Equivalent Dataset. It is an artificial simulated dataset constructed to illustrate that there may be many minimal sets of... [more/question/discuss/rate/edit...]
|
The objective is to determine the set of boolean rules that describe the interactions of the nodes within this plant signaling network. The dataset includes 300 separate boolean pseudodynamic simulations of the true rules, using an asynchronous... [more/question/discuss/rate/edit...]
|
CINA (Census Is Not Adult) is derived from census data (the UCI machine-learning repository Adult database). The data consists of census records for a number of individuals. The causal discovery task is to uncover the socio-economic factors... [more/question/discuss/rate/edit...]
|
This is one of the datasets of the first causality challenge: causation and prediction. The goal of the challenge was to make predictions under manipulations. REGED (REsimulated Gene Expression Dataset) monitors the expression of genes, which... [more/question/discuss/rate/edit...]
|
This is one of the datasets of the first causality challenge: causation and prediction. The goal of the challenge was to make predictions under manipulations. MARTI (Measurement ARTIfact) is obtained from the same data generative process as... [more/question/discuss/rate/edit...]
|
From real data, the anonymized logs of a web server, determine the causal structure - which pages link/lead to visits of other pages. The ground truth is beyond doubt, from the referrer information, but this information will be kept for an... [more/question/discuss/rate/edit...]
|
The data set consists of 8 N x 2 matrices, each representing a cause-effect pair and the task is to identify which variable is the cause and which one the effect. The origin of the data is hidden for the participants but known to the organizers.... [more/question/discuss/rate/edit...]
|
Stemmatology (a.k.a. stemmatics) studies relations among different variants of a document that have been gradually built from an original text by copying and modifying earlier versions. The aim of such study is to reconstruct the family tree (causal... [more/question/discuss/rate/edit...]
|
Summary: This data represents a 9 variable (labeled X1...X9) dynamic system with several dynamic processes acting on qualitatively different time scales from one another. The goal is to learn a causal model of the system with the training data, and... [more/question/discuss/rate/edit...]
|
This challenge has two parts, a simulation and real data. Simulation: Data are simulated as superposition of bivariate unidirectional interaction plus additive mixed and non-white noise. The simulations were done with AR-models with... [more/question/discuss/rate/edit...]
|
Abstract: A complex modern semi-conductor manufacturing process is normally under consistent surveillance via the monitoring of signals/variables collected from sensors and or process measurement points. However, not all of these signals are equally... [more/question/discuss/rate/edit...]
|
During the semiconductor fabrication process each wafer goes through a product specific sequence of operations (hundreds) in batches - lots. Every lot goes through each operation in the sequence. At each operation a lot could go through only one of... [more/question/discuss/rate/edit...]
|
During the last 5 years, research on Human Activity Recognition (HAR) has reported on systems showing good overall recognition performance. As a consequence, HAR has been considered as a potential technology for e-health systems. Here, we propose a... [more/question/discuss/rate/edit...]
|
When using causal discovery in the geosciences, it is hard to evaluate the results, because there is generally no ground truth available. To fill this gap we simulate two important processes that are often dominant in geophysical processes,... [more/question/discuss/rate/edit...]
|
Data simulated with Gene Network Weaver. [more/question/discuss/rate/edit...]
|
Data generated by gene network weaver [more/question/discuss/rate/edit...]
|
The REGED network was induced from 1,000 randomly selected genes in a lung cancer gene expression dataset [more/question/discuss/rate/edit...]
|
Faulty and healthy gear box Data sets need to be analyzed in detail. Here, we created this dataset for those who do research in wind turbine gearbox fault diagnosis. [more/question/discuss/rate/edit...]
|