Colon Cancer: Intelligent Modeling of Risk Factors
Bouharati K1, Hamdi-Cherif M1, Mahnane A1, Belkhir R2, Laouamri S1, Bouharati S3*
1Laboratory of Health and Environment, UFAS, SetiflUniversity, Algeria
2Natural Science and Life, UFAS, Setif1University, Algeria
3Laboratory of Intelligent Systems, UFAS, Setif1University, Algeria
*Corresponding Author: Bouharati Khaoula, Laboratory of Health and Environment. Faculty of Medicine, Ferhat Abbas University, Setif1, Algeria
Received: 28 November 2018; Accepted: 10 December 2018; Published: 20 December 2018
Citation: Bouharati K, Hamdi-Cherif M, Mahnane A, Belkhir R, Laouamri S, Bouharati S. Colon Cancer: Intelligent Modeling of Risk Factors. Journal of Bioinformatics and Systems Biology 1 (2018): 015-021.View / Download Pdf
Several factors are involved in the development of colon cancer. These factors are characterized by complexity and uncertainty. The system is very complex to analyze using classical mathematical methods. Statistical analysis techniques are used. Due to the complexity of the system, these studies remain in the domain of the probable and the uncertain. A technique based on the principles of artificial intelligence is proposed. The principles of fuzzy inference are applied to the analysis of these factors. The proposed system makes it possible to take care of these incertitude’s inherent to the input variables. The result will be as accurate as possible. Just randomly set the input variables to instantly read the result at the output. This can be a tool to predict and prevent the occurrence of this type of cancer.
Fuzzy logic, Intelligent modeling, Colon cancer, Risk factors
Fuzzy logic articles, Intelligent modeling articles, Colon cancer articles, Risk factors articles
Several factors may be considered as risk factors for colon cancer. These factors include age. The incidence of colon cancer is increasing in young adults and adults over 50 years of age. The American Association against Cancer recommend colorectal cancer screening below age 45, although the high rate increases with age . Previous history of colorectal polyps can also constitute a risk factor. Studies have shown that the early diagnosis of colorectal cancers allows the identification of colonic polyps. Some patients will have an increased risk of developing this type of cancer. In general, the risk of colorectal cancer increases with age in both sexes. This type of cancer is recorded in half of diagnosed cases whose age exceeds 70 years whereas this rate is 10% among those fewer than 50 years . Other risk factor is the inflammatory bowel disease. Crohn's disease and ulcerative colitis may constitute a risk of colorectal cancer. This risk increases with time and the anatomical extent of colitis as well as the degree of inflammation. When colitis settles over time, it can be an aggravating factor in the development of colorectal cancer . There is no doubt that the risk of colorectal cancer is directly associated with family ties, it is also demonstrated in families of patients with colorectal cancers Several factors may be the cause of colorectal cancer, however, demographic factors (implying genetic factors) and dietary factors largely explain mortality by this cancer, although this remains poorly explained , and family history or a personal history of ovarian, endometrial or breast cancer. Studies have shown that family history is directly associated with the risk of colon cancer . Other studies have analyzed the relationship between colon cancer in a person and the incidence in parents and siblings . Other factors may constitute risk factors. The weight of the effect of each factor is often misunderstood. The physiological and metabolic system is very complex and varies from one person to another. The modelling of such risk factors is very difficult using classical mathematical techniques. Due to the complex nature of these factors, this study proposes the application of the principles of fuzzy inference. Fuzzy logic deals with uncertainty and where data are missing or incomplete . The proposed system considers some factors measured during the period 2006 to 2014 by the National Cancer Registry of Setif in Algeria. The result will be the prediction of the appearance of this type of cancer from the values at the input of the system. As the analysis takes into account the uncertainties associated with the input variables and the inaccuracy due to the factors ignored and that have their effect, the output result will be as accurate as possible.
2. Materials and Methods
Recent techniques in computer science and artificial intelligence tend to imitate human reasoning. This is the foundation of advanced modelling [8-9]. Among these techniques, we use of the theory of fuzzy sets. The advantage of fuzzy logic is that it easily solves non-linear systems. Human expertise is translated into inference rules linking the input variables to the output variable. In recent times, the use of fuzzy analysis systems has become widespread in various fields, notably in the medical field . In our case, this type of cancer is increase in people aged 45 years and older. This can be explained by the early detection and adoption of western lifestyle like physical inactivity and the richness of foods in fat . This does not preclude other factors that have their effects and are ignored. This amply justifies the use of this analysis technique.
2.1 Diagram of designed fuzzy system
The proposed system consists of three input variables, a module of fuzzyfication, a database, a defuzzyfier module and an output Figure 1.
2.2 Fuzzyfication of registered input and output data
2.2.1 Input data:
The variable “Age” expressed in numeric term must be fuzzyfied into linguistic term. For this, three triangular membership functions are created. Fuzzy intervals between two neighboring functions allow the uncertainties associated with this representation to be taken care of. The functions represent: young (0-30 years); Adult (25-55 years); Old (50 years and over). The variable “Sex” expressed in numeric term is not fuzzyfied. We assign a value (1) to the male sex and the value (2) to the female sex. The variable “Period” expressed in numeric term must be fuzzyfied. For this, three triangular membership functions are created. Fuzzy intervals between two neighboring functions allow the uncertainties associated with this representation to be taken care of. The functions represents the years between 2006 and 2014. (Table 1).
Table 1: Incidence per 100.000 by age group (period) of Colon cancer (2006-2014).
2.2.2 Output data: The variable “Incidence” expressed in numeric term must be fuzzyfied into linguistic term. For this, three triangular membership functions are created. Fuzzy intervals between two neighbouring functions allow the uncertainties associated with this representation to be taken care of. We assign a value (0-40) to the low incidence, (30-70) to the average incidence and (60-100) to high incidence with reference to the recorded incidence values Table 1.
2.3 Fuzzy rules database
In general, the expression of a rule is of the form (IF ... THEN)
IF (Antecedents)...THAN (Consequence).
By referring to registered values, we assign the set of rules in this form.
IF the “Age” is young AND the “Gender” is female AND the “Year” is 2010, THEN the “Incidence” is low.
The rule base must contain all possible combinations.
2.4 Inference module
When the basis of the rules is established by reference to the measured values and the intervention of the human expert, this has made it possible to establish an intelligent application in which it is possible to predict the incidence of colon cancer. The inference of the system makes all these rules co-operate and make the correspondence between them and the output result as a consequence. The result obtained takes into account even the fuzzy intervals created between the neighboring membership functions.
The result is to simply introduce randomly the values at the input of the system; and to read instantly the incidence at the output. Example at the Figure 2.
Many studies have been devoted to analyzing the data. The tools used are often classical mathematical tools such as differential equations or statistical analyzes. The finding is that it is very difficult to encompass all the variables in a mathematical equation which makes these tools very heavy and sometimes impossible to solve. The statistical tools are still in the probable with uncertainties. This study circumvents these difficulties by using an artificial intelligence tool with fuzzy logic. The variables involved in the process are considered uncertain and expressed in linguistic terms. By this, the uncertainties are compensated. Each variable is represented by a triangular membership function over an interval ranging levels. Linguistic variables are assigned to intervals. Note that neighboring intervals overlap in fuzzy intervals. By this, uncertainties are taken care of. The output variable is linked to the input variables to consider all possible combinations. Once the system is established, it gives the possibility to introduce random variables at the input to instantly read the result at the output. This result comes from the collaboration of all the rules that link the inputs to the output.
In view of the complexity of the analyzed system, the technique used can be a tool to help prevent the onset of colon cancer. This prediction is translated into a decision model from the introduction of the input variables. Simple to use, this system supports all the complexity and inaccuracy of data. The data processed are actual values recorded during a period that may provide a basis for future years. Compensation for data deficiencies is ensured by fuzzyfication of the input and output parameters. Moreover, the intervention of human expertise is necessary in the establishment of rules of inference to overcome any insufficiency in the analysis of the data. Whatever the reliability of the results obtained, this does not prevent the use of other factors which are not considered in this study and which have an effect on colon cancer. For this, the proposed system remains extensible to the other parameters. This study can then constitute a contribution and a basis for future more global studies and can be a tool for the prediction and prevention of colon cancer.
Declaration of Interest
The authors declare that no conflict of interest.
All our gratitude is in the staff service of epidemiology of the Faculty of Medicine of the University of Setif1 in Algeria.
- Owen Dyer. Colorectal cancer: US guidelines urge screening from age 45 as incidence soars in younger adults. BMJ 361 (2018): 2452.
- Kolligs FT. Diagnostics and Epidemiology of Colorectal Cancer. Visc Med 32 (2016): 158-164.
- Yoshino T, Nakase H, Takagi T, et al. Risk factors for developing colorectal cancer in Japanese patients with ulcerative colitis: a retrospective observational study-CAPITAL (Cohort and Practice for IBD total management in Kyoto-Shiga Links) study I. BMJ Open Gastro 3 (2016): 1-8.
- Shaukat A, Dostal A, Menk J, et al. BMI Is a Risk Factor for Colorectal Cancer Mortality. Digestive Diseases and Sciences 62 (2017): 2511-2517.
- Zanghieri G, Gregorio C, Sacchetti C, et al. Familial occurrence of gastric cancer in the 2-year experience of a population-based registry. Cancer 66 (1990): 2047-2051.
- Sidney JW, Ann GZ, Hans G, et al. and the National Polyp Study Workgroup. Risk of colorectal cancer in the families of patients with adenomatous polyps. N Engl J Med 334 (1996): 82-87.
- Bouharati I, Babouche F, Bouharati K, et al. Brain tumor identification from MRIimaging using fuzzy inference. Radiol Diagn Imaging 2 (2018): 1-4.
- Dumitrache I, Constantin N, Dragoicea M. Reteleneurale (Neural networks),Matrix Press (1999).
- Dumitrache I, Buiu C. Algoritmi genetici. Principii fundamental esiaplicatii in automatic (Genetic algorithms. Fundamental principles and applications in automatics), Mediamira Press (2000).
- Javad H, Reza J, Ayoob N. Fuzzy Rule Based Diagnostic System For Detecting The Lung Cancer Disease. Journal of Renewable Natural Resources 3 (2015): 147-157.
- Mokhtar HC, Diego S, Abbes M, et al. Time trends of cancer incidence in Setif, Algeria, 1986-2010: An observational study.BMC Cancer 14 (2014): 637.