![]() |
Halima Bensmail
Department of Statistics, Operations and Management Sciences |
Keywords:
Data mining and knowledge discovery; statistical tools for genomics and proteomics; Bayesian analysis; clustering and model-based cluster analysis; mixture modeling for continuous, mixed data and imputed data; multidimensional scaling, optimal scaling; classification and neural network
Research Area:
My main research interests focused on the Multivariate Linear and nonlinear data analysis, missing data, clustering, Bayesian analysis, bioinformatics, data mining and model selection. Therefore, the followings are the major components in my research:
1.Clustering Multivariate linear and nonlinear data.
2.Clustering of mixed data (mixture of categorical and quantitative variables) using Nonlinear multivariate transformation (Gifi transformation).
3.Bayesian clustering with missing data.
4.Bioinformatics: (i) Proteomics mining (ii) Genomics data analysis (iii) Finding homogenous groups for gene expression data using kernel estimation.
5.Genetic algorithm and its performance with model selection.
6.Analyzing large datasets using multi-scale clustering method.
7.Data analysis of expenditure data (health insurance).
8.Bayesian Threshold autoregressive model
Selected Publications:
- Haoudi, A, and Bensmail, H (2006). Bioinformatics and Data mining in Proteomics. Expert Review of Proteomics (In Press).
- Kwon, Y, Bensmail, H, and Bozdogan, H. Bayesian Analysis of Threshold Autoregressive Model with Information Complexity. “Economics Review” (In Press)
- Bensmail, H., Buddana A., Semmes O. J. and Haoudi (2005) “A.Functional Clustering Algorithm for High Dimensional Proteomics Data”. J Biomed Biotechnol, 2005(2), pp.80-6.
- Liu Z., Chen, D, Bensmail, H and Ying Xu (2005). Gene Expression Data Clustering with Kernel Principal Component Analysis. Journal of Bioinformatics and Computational Biology, Vol 3(2), pp. 303-316.
- Liu Z., Chen D., Bensmail, H., Reifman, J. and Xu, Y (2005) “Gene Expression Data Classification with Kernel Principal Component Analysis. Journal of Biomedicine and Biotechnology, 2, pp. 155—159.
- Bensmail H. Golek, A. Semmens, O. J. and Haoudi, A. (2005). Bayesian Fast-Fourier Transform Based Clustering Method for Proteomics Data. Bioinformatics, 21(10), pp. 2210-24.
- Bensmail, H and Haoudi, A (2003): Post-Genomics: Proteomics and Bioinformatics in Cancer Research. Journal of Biomedicine and Biotechnology, volume 4, pp. 217—230.
- Bensmail, H., Celeux, G., Raftery, A. & Robert, C. (1997). Inference in Model-Based Cluster Analysis. Journal of Computing and Statistics, 1, N10, pp.1-10.
- Bensmail, H. & Celeux, G. (1996). Regularized Discriminant Analysis. Journal of the American Statistical Association (JASA), Vol. 91, No 436, pp. 1743—1748.

