A Resampling Univariate Analysis Approach to Ovarian Cancer from Clinical and Genetic Data
Abstract:
Ovarian cancer (OC) is the second most common gynecological malignancy and the gynecological tumor with the worst prognosis. To try to improve this situation, Data Science technologies could be a useful tool to help clinicians to know more about the disease. In our case, we are interested in exploring OC data to discover relationships between clinical and genetic factors and the disease progression. For it, we propose an analysis framework for simple and univariate statistical descriptions of features of different types, based on bootstrap resampling. Foremost, we define the framework for metric, categorical, and dates variables and determine what are the advantages and disadvantages of using different bootstrap resampling strategies, based on their statistical basis. Then, we use it to perform a univariate analysis over an OC dataset that allows to explore how is the disease progression, having platinum-free interval as indicator, in relation to clinical and genetic features of different types. Also, it provides a first set of variables possibly relevant for survival pbkp_rediction. Results obtained show that some features have led to individual differences between both platinum resistant (<6 months) and platinum sensitive(>6 months) groups. It can be concluded that this could be an indicator that the database could be discriminatory for the hypotheses studied, though it is convenient to make multivariate analyses to check how relationships among features are influenced.
Año de publicación:
2021
Keywords:
- univariate analysis
- genetic data
- Ovarian Cancer
- bootstrap resampling
- hypothesis test
- data science analytics
Fuente:
Tipo de documento:
Article
Estado:
Acceso abierto
Áreas de conocimiento:
- Cáncer
- Cáncer
Áreas temáticas:
- Enfermedades
- Bioquímica
- Medicina y salud