An analysis method for pbkp_redicting breast cancer using data science processes and machine learning


Abstract:

In two decades, the number of people with breast cancer has almost doubled: in 2000, about 10 million patients had the disease; by 2020, it had reached 19 million. It is estimated that one in five people today will develop some form of cancer in their lifetime. Studies suggest that the number of people diagnosed with cancer will increase in the coming years, being approximately 50% higher in 2040 than in 2020. This article provides an analysis method to pbkp_redict or diagnose breast cancer using data science processes and machine learning. The analysis method is structured into three phases. The first one is a data preparation phase, the second one is a pbkp_redictive analysis phase, and the last one is an evaluation metric. Therefore, the pbkp_redictions are experimented with machine learning techniques, which are: KNN, gradient boosting classifier, and random forest, for which evaluation metrics are presented with the next quality measures: Accuracy, Precision, Recall, and F1-Score. The dataset selected for this phase of analysis is Wisconsin breast cancer [1]. These data analysis techniques can be extended to other learning techniques and can also be used in future scientific work such as disease pbkp_rediction or medicine in general.

Año de publicación:

2022

Keywords:

    Fuente:

    scopusscopus
    googlegoogle

    Tipo de documento:

    Conference Object

    Estado:

    Acceso restringido

    Áreas de conocimiento:

    • Cáncer
    • Aprendizaje automático
    • Ciencias de la computación

    Áreas temáticas:

    • Enfermedades
    • Física aplicada
    • Métodos informáticos especiales