The probabilistic problem of cross-calibration of two categorical variables is addressed. A probabilistic forecast of the categorical variables is obtained based on a sample of observed data. This forecast is the output of a genetic algorithm based approach, which makes no assumption on the type of relationship between the two variables and applies a scoring rule to assess the fitness of the chromosomes. It converges to a good-quality point probability forecast of the joint distribution of the two variables. The proposed approach is applied both at stationary points in time and across time. Its performance is enhanced when additional sampled data is included, and can be designed with different scoring rules or made to account for missing data.
Aboukhamseen, S. M. & M'Hallah, R. A. (2017). Genetic algorithms for cross-calibration of categorical data. Journal of Modern Applied Statistical Methods, 16(1), 722-742. doi: 10.22237/jmasm/1493599080