A new algorithm to optimize maximal information coefficient
The maximal information coefficient (MIC) captures dependences between paired variables,
including both functional and non-functional relationships. In this paper, we develop a new
method, ChiMIC, to calculate the MIC values. The ChiMIC algorithm uses the chi-square test
to terminate grid optimization and then removes the restriction of maximal grid size limitation
of original ApproxMaxMI algorithm. Computational experiments show that ChiMIC algorithm
can maintain same MIC values for noiseless functional relationships, but gives much smaller …
including both functional and non-functional relationships. In this paper, we develop a new
method, ChiMIC, to calculate the MIC values. The ChiMIC algorithm uses the chi-square test
to terminate grid optimization and then removes the restriction of maximal grid size limitation
of original ApproxMaxMI algorithm. Computational experiments show that ChiMIC algorithm
can maintain same MIC values for noiseless functional relationships, but gives much smaller …
The maximal information coefficient (MIC) captures dependences between paired variables, including both functional and non-functional relationships. In this paper, we develop a new method, ChiMIC, to calculate the MIC values. The ChiMIC algorithm uses the chi-square test to terminate grid optimization and then removes the restriction of maximal grid size limitation of original ApproxMaxMI algorithm. Computational experiments show that ChiMIC algorithm can maintain same MIC values for noiseless functional relationships, but gives much smaller MIC values for independent variables. For noise functional relationship, the ChiMIC algorithm can reach the optimal partition much faster. Furthermore, the MCN values based on MIC calculated by ChiMIC can capture the complexity of functional relationships in a better way, and the statistical powers of MIC calculated by ChiMIC are higher than those calculated by ApproxMaxMI. Moreover, the computational costs of ChiMIC are much less than those of ApproxMaxMI. We apply the MIC values tofeature selection and obtain better classification accuracy using features selected by the MIC values from ChiMIC.
PLOS