Skip to content

Repo which includes the medical data sets used in a feature selection paper proposed by OASYS group

License

Notifications You must be signed in to change notification settings

groupoasys/Medical_data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Medical_data

Goals ⚽

The aim of this repository is to provide some details of the medical data sets used in paper [1]. This article has been developed by some members of the OASYS group thanks to the funding of the project Flexanalytics. We suggest you visit the related links to know more our research 😉

How can I download the data? ⬇

Please, click at this link.

Summary 🧮📊📖

The following table summarizes the main characteristics of the databases, including the name, the number of individuals, the number of features, and the link where they have been downloaded:

Database Number of individuals Number of features Source
breast 569 30 Link
colorectal 62 2000 Link
diabetes 768 8 Link
lymphoma 96 4026 Authors paper [2]

References 📚

[1] Jiménez-Cordero, A., Morales, J.M., & Pineda, S. (2021). A novel embedded min-max approach for feature selection in nonlinear Support Vector Machine classification. European Journal of Operational Research, 293(1), 24-35.

[2] Maldonado, S., Weber, R., & Basak, J. (2011). Simultaneous feature selection and classification using kernel-penalized Support Vector Machines. Information Sciences, 181(1), 115-128.

[3] OASYS, Medical data, Github repository (https://github.com/groupoasys/Medical_data), 2020.

How to cite the repo and the paper? 📝

If you want to cite paper [1] or this repo [3], please use the following bib entry:

  • Article:
@article{jimenez2021novel,
author={A. {Jim\'enez-Cordero}, J. M. {Morales} and S. {Pineda}},
journal={European Journal of Operational Research},
title={A novel embedded min-max approach for feature selection in nonlinear support vector machine classification},
year={2021},
issn={0377-2217}
volume={293},
number={1},
pages={24-35}}
  • Repository:
@article{OASYS2020medical,
author = {OASYS},
journal = {GitHub repository},
title = {{Medical Data}},
url = {https://github.com/groupoasys/Medical{\_}data},
year = {2020}
}

Do you want to contribute? 🙋‍♀️🙋‍♂️

Please, do it 😋 Any feedback is welcome 🤗 so feel free to ask or comment anything you want via a Pull Request in this repo. If you need extra help, you can ask Asunción Jiménez-Cordero (asuncionjc@uma.es), Juan Miguel Morales (juan.morales@uma.es) or Salvador Pineda (spinedamorente@gmail.com).

Contributors 🌬☀

Developed by 👩‍💻👨‍💻👨‍💻

License 📝

Copyright 2020 Optimization and Analytics for Sustainable energY Systems (OASYS)

Licensed under the GNU General Public License, Version 3 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

   http://www.gnu.org/licenses/gpl-3.0.en.html

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.