HapForest

Citation:

X. Chen, C.-T. Liu, M. Zhang and H.P. Zhang. A forest-based approach to identifying gene and gene-gene interactions, PNAS, 104: 1919919203, 2007.

To Perform an Analysis:

Apply the recursive classification tree program (rtree) using the individual SNPs as features and the disease status as the outcome. The description of the program and sample files are provided at our rtree page.
Construct haplotype blocks containing the SNPs identified via rtree using Hapview. Please refer to Hapview (https://www.broadinstitute.org/haploview/downloads) for more information.
Use SNPHAP to estimate the haplotype frequencies in the haplotype blocks identified in the previous step. Please see SNPHAP (https://gaow.github.io/genetic-analysis-software/s/snphap/) for details.
Use HapForest to identify haplotypes and haplotype-haplotype interactions in association with the disease. The program can be downloaded here.
1. Usage and Input files
  
  To invoke HapForest from command line, enter java-jar toRun.jar response_file hap_file1 hap_file2... in the installation folder.
  
  The response_file is a file specifying the response (disease status) of each subject, in which1 stands for affected and 0 for unaffected. A sample response_file can be found here.
  
  The hap_file1 and hap_file2 and etc each corresponds to the haplotype configuration of a region, output from SNPHAP. The order of the subject in these files should be same as that of the response_file. A sample hap_file is provided here. The number of hap_files depends on the number of haplotype blocks identified in the previous steps.
  
  Options: