| 
                                     
               
           CINAR: Raw data for
CINA, an econometrics  dataset 
             
                        
                                                                
       
               
         CINAR  (Census Is Not Adult Raw) is derived from census
data  (the UCI machine-learning    repository Adult database). The data
consists of census records for a number of individuals. The causal discovery
task is to uncover the socio-economic factors affecting high income (the
target value indicates whether the income exceeds 50K). The 14 original attributes
 (features) including age, workclass,     education, education, marital
 status, occupation, native country, etc. were coded in the CINA dataset to eliminate  categorical variables. Here
we provide the RAW DATA. Also in CINA, distractor features (artificially
generated variables,  which are not causes of the target) were added. Here,
they have not been included.  
        
       Download 
the data. 
   |