Journal of Bionic Engineering ›› 2024, Vol. 21 ›› Issue (1): 426-446.doi: 10.1007/s42235-023-00433-y

• • 上一篇    下一篇

An Improved Binary Quantum-based Avian Navigation Optimizer Algorithm to Seleect Effective Features from Medical Data: A COVID-19 Case Study

Ali Fatahi1,2; Mohammad H. Nadimi‑Shahraki1,2; Hoda Zamani1,2   

  1. 1 Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Najafabad 8514143131, Iran 2 Big Data Research Center, Najafabad Branch, Islamic Azad University, Najafabad 8514143131, Iran
  • 出版日期:2024-01-16 发布日期:2024-02-26
  • 通讯作者: Mohammad H. Nadimi-Shahraki E-mail:nadimi@iaun.ac.ir; nadimi@ieee.org
  • 作者简介:Ali Fatahi1,2; Mohammad H. Nadimi?Shahraki1,2; Hoda Zamani1,2

An Improved Binary Quantum-based Avian Navigation Optimizer Algorithm to Seleect Effective Features from Medical Data: A COVID-19 Case Study 

Ali Fatahi1,2; Mohammad H. Nadimi‑Shahraki1,2; Hoda Zamani1,2   

  1. 1 Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Najafabad 8514143131, Iran 2 Big Data Research Center, Najafabad Branch, Islamic Azad University, Najafabad 8514143131, Iran
  • Online:2024-01-16 Published:2024-02-26
  • Contact: Mohammad H. Nadimi-Shahraki E-mail:nadimi@iaun.ac.ir; nadimi@ieee.org
  • About author:Ali Fatahi1,2; Mohammad H. Nadimi?Shahraki1,2; Hoda Zamani1,2

摘要: eature Subset Selection (FSS) is an NP-hard problem to remove redundant and irrelevant features particularly from medical data, and it can be efectively addressed by metaheuristic algorithms. However, existing binary versions of metaheuristic algorithms have issues with convergence and lack an efective binarization method, resulting in suboptimal solutions that hinder diagnosis and prediction accuracy. This paper aims to propose an Improved Binary Quantum-based Avian Navigation Optimizer Algorithm (IBQANA) for FSS in medical data preprocessing to address the suboptimal solutions arising from binary versions of metaheuristic algorithms. The proposed IBQANA’s contributions include the Hybrid Binary Operator (HBO) and the Distance-based Binary Search Strategy (DBSS). HBO is designed to convert continuous values into binary solutions, even for values outside the [0, 1] range, ensuring accurate binary mapping. On the other hand, DBSS is a twophase search strategy that enhances the performance of inferior search agents and accelerates convergence. By combining exploration and exploitation phases based on an adaptive probability function, DBSS efectively avoids local optima. The efectiveness of applying HBO is compared with fve transfer function families and thresholding on 12 medical datasets, with feature numbers ranging from 8 to 10,509. IBQANA's efectiveness is evaluated regarding the accuracy, ftness, and selected features and compared with seven binary metaheuristic algorithms. Furthermore, IBQANA is utilized to detect COVID-19. The results reveal that the proposed IBQANA outperforms all comparative algorithms on COVID-19 and 11 other medical datasets. The proposed method presents a promising solution to the FSS problem in medical data preprocessing.

关键词: Feature subset selection , · Optimization , · Binary metaheuristic algorithms , · Bioinspired , · Machine learning , · Medical datasets

Abstract: eature Subset Selection (FSS) is an NP-hard problem to remove redundant and irrelevant features particularly from medical data, and it can be efectively addressed by metaheuristic algorithms. However, existing binary versions of metaheuristic algorithms have issues with convergence and lack an efective binarization method, resulting in suboptimal solutions that hinder diagnosis and prediction accuracy. This paper aims to propose an Improved Binary Quantum-based Avian Navigation Optimizer Algorithm (IBQANA) for FSS in medical data preprocessing to address the suboptimal solutions arising from binary versions of metaheuristic algorithms. The proposed IBQANA’s contributions include the Hybrid Binary Operator (HBO) and the Distance-based Binary Search Strategy (DBSS). HBO is designed to convert continuous values into binary solutions, even for values outside the [0, 1] range, ensuring accurate binary mapping. On the other hand, DBSS is a twophase search strategy that enhances the performance of inferior search agents and accelerates convergence. By combining exploration and exploitation phases based on an adaptive probability function, DBSS efectively avoids local optima. The efectiveness of applying HBO is compared with fve transfer function families and thresholding on 12 medical datasets, with feature numbers ranging from 8 to 10,509. IBQANA's efectiveness is evaluated regarding the accuracy, ftness, and selected features and compared with seven binary metaheuristic algorithms. Furthermore, IBQANA is utilized to detect COVID-19. The results reveal that the proposed IBQANA outperforms all comparative algorithms on COVID-19 and 11 other medical datasets. The proposed method presents a promising solution to the FSS problem in medical data preprocessing.

Key words: Feature subset selection , · Optimization , · Binary metaheuristic algorithms , · Bioinspired , · Machine learning , · Medical datasets