[1] Barto, A. G. (1997). Reinforcement Learning. In Neural Systems for Control, O. Omidvar & D. L. Elliott (Eds.). Academic Press.
[2] Dempster, P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1).
[3] Dorigo, M., & Stitzle, T. (2004). Ant Colony Optimization, vol. 2. MIT Press.
[4] Dorigo, M., Maniezzo, V., & Colorni, A. (1996). Ant system: Optimization by a colony of cooper- ating agents. IEEE Transactions on Systems, Man, and Cybernetics: Part B (Cybernetics), 26(1).
[5] Dowlatshahi, M. B., & Hashemi, A. (2023). Unsupervised feature selection: A fuzzy multi-criteria decision-making approach. Iranian Journal of Fuzzy Systems.
[6] Fran;;ois-Lavet, V., Henderson, P., Islam, R., Bellemare, M. G., & Pineau, J. (2018). An Introduction to Deep Reinforcement Learning. Foundations and Trends® in Machine Learning, 11(3-4).
[7] Hashemi, A., & Dowlatshahi, M. B. (2023). A Fuzzy Integral Approach for Ensembling Unsuper- vised Feature Selection Algorithms. In 2023 28th International Computer Conference, Computer Society of Iran (CSICC), Tehran, Iran.
[8] Hashemi, A., Joodaki, M., Joodaki, N. Z., & Dowlatshahi, M. B. (2022). Ant colony optimization equipped with an ensemble of heuristics through multi-criteria decision making: A case study in ensemble feature selection. Applied Soft Computing, 124, 109046.
[9] Hashemi, A., Pajoohan, M. R., & Dowlatshahi, M. B. (2024). A Multi-Objective Optimization Ap- proach for Online Streaming Feature Selection Using Fuzzy Pareto Dominance. Journal of Mahani Mathematical Research Center, 13(1), 467.
[10] Hashemi, A., Dowlatshahi, M. B., & Nezamabadi-Pour, H. (2020). MFS-MCDM: Multilabel feature selection using multi-criteria decision making. Knowledge-Based Systems, 206, 106365.
[11] Hashemi, A., Dowlatshahi, M. B., & Nezamabadi-Pour, H. (2022). Ensemble of feature selection algorithms: A multi-criteria decision-making approach. International Journal of Machine Learning and Cybernetics, 13(1), 49-69.
[12] Hashemi, A., Dowlatshahi, M. B., & Nezamabadi-Pour, H. (2020). A bipartite matching-based fea- ture selection for multi-label learning. International Journal of Machine Learning and Cybernetics.
[13] Hashemi, A., & Dowlatshahi, M. B. (2024). Exploring Ant Colony Optimization for Feature Selec- tion: A Comprehensive Review. In N. Dey (Ed.), Applications of Ant Colony Optimization and its Variants, Springer Tracts in Nature-Inspired Computing. Springer, Singapore.
[14] Hashemi, A., & Dowlatshahi, M. B. (2022). An Ensemble Of Feature Selection Algorithms Using OWA Operator. In 2022 9th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), Bam, Iran.
[15] Hashemi, A., Pajoohan, M. -R., & Dowlatshahi, M. B. (2022). Online streaming feature selection based on Sugeno fuzzy integral. In 2022 9th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), Bam, Iran.
[16] Karimi, F., Dowlatshahi, M. B., & Hashemi, A. (2023). SemiACO: A semi-supervised feature se- lection based on ant colony optimization. Expert Systems with Applications, 214, 119130.
[17] Li, Y., et al. (2013). Parallel ant colony optimization on graphics processing units. IEEE Transactions on Evolutionary Computation, 17(5).
[18] Miri, M., Dowlatshahi, M. B., & Hashemi, A. (2022). Evaluation of multi-label feature selection for text classification using weighted Borda count approach. In 2022 9th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), Bam, Iran.
[19] Paniri, M., Dowlatshahi, M. B., & Nezamabadipour, H. (2020). MLACO: A multi-label feature selection algorithm based on ant colony optimization. Knowledge-Based Systems, 192, 105285.
[20] Paniri, M., Dowlatshahi, M. B., & Nezamabadipour, H. (2021). Ant-TD: Ant colony optimization plus temporal difference reinforcement learning for multi-label feature selection. Swarm and Evolu- tionary Computation, 64, 100892.
[21] Pei, Y., Wang, W., & Zhang, S. (2012). Basic Ant Colony Optimization. In Proc. 2012 International Conference on Computer Science and Electronics Engineering (ICCSEE), Hangzhou, China.
[22] Sutton, R. S., & Barto, A. G. (2018). Reinforcement Learning: An Introduction, 2nd ed. Cambridge, MA, USA: A Bradford Book.
[23] Socha, K., & Dorigo, M. (2008). Ant colony optimization for continuous domains. European Journal of Operational Research, 185(3).
[24] Tsoumakas, G., Katakis, I., & Vlahavas, I. (2009). Mining multi-label data. In Data Mining and Knowledge Discovery Handbook.
[25] Tsoumakas, G., Spyromitros-Xioufis, E., Vilcek, J., & Vlahavas, I. (2011). Mulan: A Java Library for Multi-Label Learning. Journal of Machine Learning Research, 12.
[26] Wiering, M. A., & Van Hasselt, H. (2007). Two novel on-policy reinforcement learning algorithms based on TD (λ)-methods. In 2007 IEEE International Symposium on Approximate Dynamic Pro- gramming and Reinforcement Learning.
[27] Wu, Y., Ma, W., Miao, Q., & Wang, S. (2019). Multimodal continuous ant colony optimization for multisensor remote sensing image registration with local search. Swarm Evolutionary Computation.
[28] Zadeh, L. A. (2008). Is there a need for fuzzy logic? Information Sciences, 178(13), 2751-2779.