Detecting Breast Cancer Using Visual ML
DOI:
https://doi.org/10.52783/jns.v14.1933Keywords:
Breast Cancer, Machine Learning, Orange, Visual ML, Wisconsin, CoimbraAbstract
Approximately 60% of Breast cancer patients are diagnosed in advanced stages. This paper examines the automation of identification of cancerous cells using visual machine learning approach. Results are obtained using two different datasets: Wisconsin and Coimbra. In Wisconsin dataset, predictors are extracted from the digitised image of a fine needle aspirate (FNA) of a breast mass. In Coimbra dataset, predictors are extracted from the blood analysis. Ten machine learning models are compared using a visual ML tool called Orange. Particular emphasis is placed on the metric “recall”. Recall is defined as the ability to catch malignant cases out of total malignant cases present in the dataset. Highest recall of 0.982 in Wisconsin dataset is achieved using Stochastic Gradient Descent (SGD). Highest recall of 0.793 in Coimbra dataset is achieved using Gradient Boost algorithm. This tool can be deployed in hospitals where initial detection may be done using blood analysis and then for confirmation with digitised image of breast mass. In countries like India, where there is a scarcity of cancer specialists, this tool would fasten the detection process. Patients would spend more time in treatment than in diagnosis.
Downloads
Metrics
References
Wang Lulu. Early diagnosis of breast cancer. Sensors (Basel, Switzerland). 2017; 17. https://doi.org/ 10.3390/s17071572 PMID: 28678153
Ghassemi Marzyeh, Naumann Tristan, Schulam Peter, Andrew L. Beam, Irene Y. Chen, and Rajesh Ranganath. Opportunities in machine learning for healthcare. arxiv.org. 2018.
Shah VI, Raju U, Chitale D, Deshpande V, Gregory N, Strand V. False-negative core needle biopsies of the breast: an analysis of clinical, radiologic, and pathologic findings in 27 consecutive cases of missed breast cancer. Cancer. 2003 Apr 15;97(8):1824-31. doi: 10.1002/cncr.11278. PMID: 12673707
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.