Gelişmiş Arama

Basit öğe kaydını göster

dc.contributor.authorBakir-Gungor, Burcu
dc.contributor.authorErsoz, Nur Sebnem
dc.contributor.authorYousef, Malik
dc.date.accessioned2025-06-17T08:27:12Z
dc.date.available2025-06-17T08:27:12Z
dc.date.issued2025en_US
dc.identifier.issn2076-3417
dc.identifier.urihttps://doi.org/10.3390/app15062940
dc.identifier.urihttps://hdl.handle.net/20.500.12573/2540
dc.description.abstractAdvances in metagenomics have revolutionized our ability to elucidate links between the microbiome and human diseases. Colorectal cancer (CRC), a leading cause of cancer-related mortality worldwide, has been associated with dysbiosis of the gut microbiome. This study aims to develop a method for identifying CRC-associated microbial enzymes by incorporating biological domain knowledge into the feature selection process. Conventional feature selection techniques often evaluate features individually and fail to leverage biological knowledge during metagenomic data analysis. To address this gap, we propose the enzyme commission (EC)-nomenclature-based Grouping-Scoring-Modeling (G-S-M) method, which integrates biological domain knowledge into feature grouping and selection. The proposed method was tested on a CRC-associated metagenomic dataset collected from eight different countries. Community-level relative abundance values of enzymes were considered as features and grouped based on their EC categories to provide biologically informed groupings. Our findings in randomized 10-fold cross-validation experiments imply that glycosidases, CoA-transferases, hydro-lyases, oligo-1,6-glucosidase, crotonobetainyl-CoA hydratase, and citrate CoA-transferase enzymes can be associated with CRC development as part of different molecular pathways. These enzymes are mostly synthesized by Eschericia coli, Salmonella enterica, Klebsiella pneumoniae, Staphylococcus aureus, Streptococcus pneumoniae, and Clostridioides dificile. Comparative evaluation experiments showed that the proposed model consistently outperforms traditional feature selection methods paired with various classifiers.en_US
dc.description.sponsorshipWe would like to thank The Scientific and Technological Research Council of Türkiye (TÜB˙ITAK) 2211A BIDEP program for supporting the work of N.S.E. The work of B.B.-G. has also been supported by the Abdullah Gul University Support Foundation (AGUV). B.B.-G. would like to express her gratitude for the L’Oréal-UNESCO Young Women Scientist Award. This research was made possible by the support of the L’Oréal-UNESCO Young Women Scientist Program. The work of M.Y. has been supported by Zefat Academic College.en_US
dc.language.isoengen_US
dc.publisherMDPIen_US
dc.relation.isversionof10.3390/app15062940en_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectMetagenomic analysis of colorectal canceren_US
dc.subjectMachine learningen_US
dc.subjectFeature groupingen_US
dc.subjectFunctional proffiling of metagenomesen_US
dc.subjectCommunity-level enzyme commission (EC) abundancesen_US
dc.titleIntegrating Biological Domain Knowledge with Machine Learning for Identifying Colorectal-Cancer-Associated Microbial Enzymes in Metagenomic Dataen_US
dc.typearticleen_US
dc.contributor.departmentAGÜ, Yaşam ve Doğa Bilimleri Fakültesi, Moleküler Biyoloji ve Genetik Bölümüen_US
dc.contributor.authorID0000-0002-2272-6270en_US
dc.contributor.institutionauthorBakir-Gungor, Burcu
dc.contributor.institutionauthorErsoz, Nur Sebnem
dc.identifier.volume15en_US
dc.identifier.issue6en_US
dc.identifier.startpage1en_US
dc.identifier.endpage37en_US
dc.relation.journalAPPLIED SCIENCES-BASELen_US
dc.relation.tubitak2211A BIDEP
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US


Bu öğenin dosyaları:

Thumbnail

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster