PubMed İndeksli Yayınlar Koleksiyonu
Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/397
Browse
14 results
Search Results
Correction Correction: Engineering Novel Features for Diabetes Complication Prediction Using Synthetic Electronic Health Records(Frontiers Media S.A., 2025-08-29) Voskergian, Daniel; Bakir-Gungor, Burcu; Yousef, MalikArticle Citation - WoS: 20Citation - Scopus: 24miRdisNET: Discovering MicroRNA Biomarkers That Are Associated With Diseases Utilizing Biological Knowledge-Based Machine Learning(Frontiers Media S.A., 2023-01-12) Jabeer, Amhar; Temiz, Mustafa; Bakir-Gungor, Burcu; Yousef, MalikDuring recent years, biological experiments and increasing evidence have shown that MicroRNAs play an important role in the diagnosis and treatment of human complex diseases. Therefore, to diagnose and treat human complex diseases, it is necessary to reveal the associations between a specific disease and related miRNAs. Although current computational models based on machine learning attempt to determine miRNA-disease associations, the accuracy of these models need to be improved, and candidate miRNA-disease relations need to be evaluated from a biological perspective. In this paper, we propose a computational model named miRdisNET to predict potential miRNA-disease associations. Specifically, miRdisNET requires two types of data, i.e., miRNA expression profiles and known disease-miRNA associations as input files. First, we generate subsets of specific diseases by applying the grouping component. These subsets contain miRNA expressions with class labels associated with each specific disease. Then, we assign an importance score to each group by using a machine learning method for classification. Finally, we apply a modeling component and obtain outputs. One of the most important outputs of miRdisNET is the performance of miRNA-disease prediction. Compared with the existing methods, miRdisNET obtained the highest AUC value of .9998. Another output of miRdisNET is a list of significant miRNAs for disease under study. The miRNAs identified by miRdisNET are validated via referring to the gold-standard databases which hold information on experimentally verified MicroRNA-disease associations. miRdisNET has been developed to predict candidate miRNAs for new diseases, where miRNA-disease relation is not yet known. In addition, miRdisNET presents candidate disease-disease associations based on shared miRNA knowledge. The miRdisNET tool and other supplementary files are publicly available at: .Article Citation - WoS: 36Citation - Scopus: 35Trail Promotes the Polarization of Human Macrophages Toward a Proinflammatory M1 Phenotype and Is Associated With Increased Survival in Cancer Patients With High Tumor Macrophage Content(Frontiers Media S.A., 2023-09-21) Gunalp, Sinem; Helvaci, Derya Goksu; Oner, Aysenur; Bursali, Ahmet; Conforte, Alessandra; Guener, Hueseyin; Sag, Duygu; Güner, HüseyinBackgroundTNF-related apoptosis-inducing ligand (TRAIL) is a member of the TNF superfamily that can either induce cell death or activate survival pathways after binding to death receptors (DRs) DR4 or DR5. TRAIL is investigated as a therapeutic agent in clinical trials due to its selective toxicity to transformed cells. Macrophages can be polarized into pro-inflammatory/tumor-fighting M1 macrophages or anti-inflammatory/tumor-supportive M2 macrophages and an imbalance between M1 and M2 macrophages can promote diseases. Therefore, identifying modulators that regulate macrophage polarization is important to design effective macrophage-targeted immunotherapies. The impact of TRAIL on macrophage polarization is not known.MethodsPrimary human monocyte-derived macrophages were pre-treated with either TRAIL or with DR4 or DR5-specific ligands and then polarized into M1, M2a, or M2c phenotypes in vitro. The expression of M1 and M2 markers in macrophage subtypes was analyzed by RNA sequencing, qPCR, ELISA, and flow cytometry. Furthermore, the cytotoxicity of the macrophages against U937 AML tumor targets was assessed by flow cytometry. TCGA datasets were also analyzed to correlate TRAIL with M1/M2 markers, and the overall survival of cancer patients.ResultsTRAIL increased the expression of M1 markers at both mRNA and protein levels while decreasing the expression of M2 markers at the mRNA level in human macrophages. TRAIL also shifted M2 macrophages towards an M1 phenotype. Our data showed that both DR4 and DR5 death receptors play a role in macrophage polarization. Furthermore, TRAIL enhanced the cytotoxicity of macrophages against the AML cancer cells in vitro. Finally, TRAIL expression was positively correlated with increased expression of M1 markers in the tumors from ovarian and sarcoma cancer patients and longer overall survival in cases with high, but not low, tumor macrophage content.ConclusionsTRAIL promotes the polarization of human macrophages toward a proinflammatory M1 phenotype via both DR4 and DR5. Our study defines TRAIL as a new regulator of macrophage polarization and suggests that targeting DRs can enhance the anti-tumorigenic response of macrophages in the tumor microenvironment by increasing M1 polarization.Article Citation - WoS: 10Citation - Scopus: 15Textnettopics Pro, a Topic Model-Based Text Classification for Short Text by Integration of Semantic and Document-Topic Distribution Information(Frontiers Media S.A., 2023-10-05) Voskergian, Daniel; Bakir-Gungor, Burcu; Yousef, MalikWith the exponential growth in the daily publication of scientific articles, automatic classification and categorization can assist in assigning articles to a predefined category. Article titles are concise descriptions of the articles' content with valuable information that can be useful in document classification and categorization. However, shortness, data sparseness, limited word occurrences, and the inadequate contextual information of scientific document titles hinder the direct application of conventional text mining and machine learning algorithms on these short texts, making their classification a challenging task. This study firstly explores the performance of our earlier study, TextNetTopics on the short text. Secondly, here we propose an advanced version called TextNetTopics Pro, which is a novel short-text classification framework that utilizes a promising combination of lexical features organized in topics of words and topic distribution extracted by a topic model to alleviate the data-sparseness problem when classifying short texts. We evaluate our proposed approach using nine state-of-the-art short-text topic models on two publicly available datasets of scientific article titles as short-text documents. The first dataset is related to the Biomedical field, and the other one is related to Computer Science publications. Additionally, we comparatively evaluate the predictive performance of the models generated with and without using the abstracts. Finally, we demonstrate the robustness and effectiveness of the proposed approach in handling the imbalanced data, particularly in the classification of Drug-Induced Liver Injury articles as part of the CAMDA challenge. Taking advantage of the semantic information detected by topic models proved to be a reliable way to improve the overall performance of ML classifiers.Article Citation - WoS: 9Citation - Scopus: 15MicroBiomeGSM: The Identification of Taxonomic Biomarkers From Metagenomic Data Using Grouping, Scoring and Modeling (G-S-M) Approach(Frontiers Media S.A., 2023-11-22) Bakir-Gungor, Burcu; Temiz, Mustafa; Jabeer, Amhar; Wu, Di; Yousef, MalikNumerous biological environments have been characterized with the advent of metagenomic sequencing using next generation sequencing which lays out the relative abundance values of microbial taxa. Modeling the human microbiome using machine learning models has the potential to identify microbial biomarkers and aid in the diagnosis of a variety of diseases such as inflammatory bowel disease, diabetes, colorectal cancer, and many others. The goal of this study is to develop an effective classification model for the analysis of metagenomic datasets associated with different diseases. In this way, we aim to identify taxonomic biomarkers associated with these diseases and facilitate disease diagnosis. The microBiomeGSM tool presented in this work incorporates the pre-existing taxonomy information into a machine learning approach and challenges to solve the classification problem in metagenomics disease-associated datasets. Based on the G-S-M (Grouping-Scoring-Modeling) approach, species level information is used as features and classified by relating their taxonomic features at different levels, including genus, family, and order. Using four different disease associated metagenomics datasets, the performance of microBiomeGSM is comparatively evaluated with other feature selection methods such as Fast Correlation Based Filter (FCBF), Select K Best (SKB), Extreme Gradient Boosting (XGB), Conditional Mutual Information Maximization (CMIM), Maximum Likelihood and Minimum Redundancy (MRMR) and Information Gain (IG), also with other classifiers such as AdaBoost, Decision Tree, LogitBoost and Random Forest. microBiomeGSM achieved the highest results with an Area under the curve (AUC) value of 0.98% at the order taxonomic level for IBDMD dataset. Another significant output of microBiomeGSM is the list of taxonomic groups that are identified as important for the disease under study and the names of the species within these groups. The association between the detected species and the disease under investigation is confirmed by previous studies in the literature. The microBiomeGSM tool and other supplementary files are publicly available at: https://github.com/malikyousef/microBiomeGSM.Article Citation - WoS: 16Citation - Scopus: 20Invention of 3Mint for Feature Grouping and Scoring in Multi-Omics(Frontiers Media S.A., 2023-03-15) Yazici, Miray Unlu; Marron, J. S.; Bakir-Gungor, Burcu; Zou, Fei; Yousef, Malik; Unlu Yazici, MirayAdvanced genomic and molecular profiling technologies accelerated the enlightenment of the regulatory mechanisms behind cancer development and progression, and the targeted therapies in patients. Along this line, intense studies with immense amounts of biological information have boosted the discovery of molecular biomarkers. Cancer is one of the leading causes of death around the world in recent years. Elucidation of genomic and epigenetic factors in Breast Cancer (BRCA) can provide a roadmap to uncover the disease mechanisms. Accordingly, unraveling the possible systematic connections between-omics data types and their contribution to BRCA tumor progression is crucial. In this study, we have developed a novel machine learning (ML) based integrative approach for multi-omics data analysis. This integrative approach combines information from gene expression (mRNA), MicroRNA (miRNA) and methylation data. Due to the complexity of cancer, this integrated data is expected to improve the prediction, diagnosis and treatment of disease through patterns only available from the 3-way interactions between these 3-omics datasets. In addition, the proposed method bridges the interpretation gap between the disease mechanisms that drive onset and progression. Our fundamental contribution is the 3 Multi-omics integrative tool (3Mint). This tool aims to perform grouping and scoring of groups using biological knowledge. Another major goal is improved gene selection via detection of novel groups of cross-omics biomarkers. Performance of 3Mint is assessed using different metrics. Our computational performance evaluations showed that the 3Mint classifies the BRCA molecular subtypes with lower number of genes when compared to the miRcorrNet tool which uses miRNA and mRNA gene expression profiles in terms of similar performance metrics (95% Accuracy). The incorporation of methylation data in 3Mint yields a much more focused analysis. The 3Mint tool and all other supplementary files are available at .Article Citation - WoS: 16Citation - Scopus: 17Inhibition of Pathologic Immunoglobulin E in Food Allergy by EBF-2 and Active Compound Berberine Associated With Immunometabolism Regulation(Frontiers Media S.A., 2023-02-07) Yang, Nan; Maskey, Anish R.; Srivastava, Kamal; Kim, Monica; Wang, Zixi; Musa, Ibrahim; Li, Xiu-MinIntroductionFood allergy is a significant public health problem with limited treatment options. As Food Allergy Herbal Formula 2 (FAHF-2) showed potential as a food allergy treatment, we further developed a purified version named EBF-2 and identified active compounds. We investigated the mechanisms of EBF-2 on IgE-mediated peanut (PN) allergy and its active compound, berberine, on IgE production. MethodsIgE plasma cell line U266 cells were cultured with EBF-2 and FAHF-2, and their effects on IgE production were compared. EBF-2 was evaluated in a murine PN allergy model for its effect on PN-specific IgE production, number of IgE(+) plasma cells, and PN anaphylaxis. Effects of berberine on IgE production, the expression of transcription factors, and mitochondrial glucose metabolism in U266 cells were evaluated. ResultsEBF-2 dose-dependently suppressed IgE production and was over 16 times more potent than FAHF-2 in IgE suppression in U266 cells. EBF-2 significantly suppressed PN-specific IgE production (70%, p<0.001) and the number of IgE-producing plasma cells in PN allergic mice, accompanied by 100% inhibition of PN-induced anaphylaxis and plasma histamine release (p<0.001) without affecting IgG1 or IgG2a production. Berberine markedly suppressed IgE production, which was associated with suppression of XBP1, BLIMP1, and STAT6 transcription factors and a reduced rate of mitochondrial oxidation in an IgE-producing plasma cell line. ConclusionsEBF-2 and its active compound berberine are potent IgE suppressors, associated with cellular regulation of immunometabolism on IgE plasma cells, and may be a potential therapy for IgE-mediated food allergy and other allergic disorders.Article Citation - WoS: 15Citation - Scopus: 18Histopathology Image Classification: Highlighting the Gap Between Manual Analysis and AI Automation(Frontiers Media S.A., 2024-01-17) Dogan, Refika Sultan; Yilmaz, BulentThe field of histopathological image analysis has evolved significantly with the advent of digital pathology, leading to the development of automated models capable of classifying tissues and structures within diverse pathological images. Artificial intelligence algorithms, such as convolutional neural networks, have shown remarkable capabilities in pathology image analysis tasks, including tumor identification, metastasis detection, and patient prognosis assessment. However, traditional manual analysis methods have generally shown low accuracy in diagnosing colorectal cancer using histopathological images. This study investigates the use of AI in image classification and image analytics using histopathological images using the histogram of oriented gradients method. The study develops an AI-based architecture for image classification using histopathological images, aiming to achieve high performance with less complexity through specific parameters and layers. In this study, we investigate the complicated state of histopathological image classification, explicitly focusing on categorizing nine distinct tissue types. Our research used open-source multi-centered image datasets that included records of 100.000 non-overlapping images from 86 patients for training and 7180 non-overlapping images from 50 patients for testing. The study compares two distinct approaches, training artificial intelligence-based algorithms and manual machine learning models, to automate tissue classification. This research comprises two primary classification tasks: binary classification, distinguishing between normal and tumor tissues, and multi-classification, encompassing nine tissue types, including adipose, background, debris, stroma, lymphocytes, mucus, smooth muscle, normal colon mucosa, and tumor. Our findings show that artificial intelligence-based systems can achieve 0.91 and 0.97 accuracy in binary and multi-class classifications. In comparison, the histogram of directed gradient features and the Random Forest classifier achieved accuracy rates of 0.75 and 0.44 in binary and multi-class classifications, respectively. Our artificial intelligence-based methods are generalizable, allowing them to be integrated into histopathology diagnostics procedures and improve diagnostic accuracy and efficiency. The CNN model outperforms existing machine learning techniques, demonstrating its potential to improve the precision and effectiveness of histopathology image analysis. This research emphasizes the importance of maintaining data consistency and applying normalization methods during the data preparation stage for analysis. It particularly highlights the potential of artificial intelligence to assess histopathological images.Article Citation - WoS: 16Citation - Scopus: 21GeNetOntology: Identifying Affected Gene Ontology Terms via Grouping, Scoring, and Modeling of Gene Expression Data Utilizing Biological Knowledge-Based Machine Learning(Frontiers Media S.A., 2023-08-21) Ersoz, Nur Sebnem; Bakir-Gungor, Burcu; Yousef, MalikIntroduction: Identifying significant sets of genes that are up/downregulated under specific conditions is vital to understand disease development mechanisms at the molecular level. Along this line, in order to analyze transcriptomic data, several computational feature selection (i.e., gene selection) methods have been proposed. On the other hand, uncovering the core functions of the selected genes provides a deep understanding of diseases. In order to address this problem, biological domain knowledge-based feature selection methods have been proposed. Unlike computational gene selection approaches, these domain knowledge-based methods take the underlying biology into account and integrate knowledge from external biological resources. Gene Ontology (GO) is one such biological resource that provides ontology terms for defining the molecular function, cellular component, and biological process of the gene product.Methods: In this study, we developed a tool named GeNetOntology which performs GO-based feature selection for gene expression data analysis. In the proposed approach, the process of Grouping, Scoring, and Modeling (G-S-M) is used to identify significant GO terms. GO information has been used as the grouping information, which has been embedded into a machine learning (ML) algorithm to select informative ontology terms. The genes annotated with the selected ontology terms have been used in the training part to carry out the classification task of the ML model. The output is an important set of ontologies for the two-class classification task applied to gene expression data for a given phenotype.Results: Our approach has been tested on 11 different gene expression datasets, and the results showed that GeNetOntology successfully identified important disease-related ontology terms to be used in the classification model.Discussion: GeNetOntology will assist geneticists and scientists to identify a range of disease-related genes and ontologies in transcriptomic data analysis, and it will also help doctors design diagnosis platforms and improve patient treatment plans.Article Citation - WoS: 1Citation - Scopus: 3Engineering Novel Features for Diabetes Complication Prediction Using Synthetic Electronic Health Records(Frontiers Media S.A., 2025-04-14) Voskergian, Daniel; Bakir-Gungor, Burcu; Yousef, MalikDiabetes significantly affects millions of people worldwide, leading to substantial morbidity, disability, and mortality rates. Predicting diabetes-related complications from health records is crucial for early prevention and for the development of effective treatment plans. In order to predict four different complications of diabetes mellitus, i.e., retinopathy, chronic kidney disease, ischemic heart disease, and amputations, this study introduces a novel feature engineering approach. While developing the classification models, we utilize XGBoost feature selection method and various supervised machine learning algorithms, including Random Forest, XGBoost, LogitBoost, AdaBoost, and Decision Tree. These models were trained on synthetic electronic health records (EHR) generated by dual-adversarial autoencoders. These EHRs represent nearly 1 million synthetic patients derived from an authentic cohort of 979,308 individuals with diabetes. The variables considered in the models were the age range accompanied by chronic diseases that occur during patient visits starting from the onset of diabetes. Throughout the experiments, XGBoost and Random Forest demonstrated the best overall prediction performance. The final models, which are tailored to each complication and trained using our feature engineering approach, achieved an accuracy between 69% and 77% and an AUC between 77% and 84% using cross-validation, while the partitioned validation approach yielded an accuracy between 59% and 78% and an AUC between 66% and 85%. These findings imply that the performance of our method surpass the performance of the traditional Bag-of-Features approach, highlighting the effectiveness of our approach in enhancing model accuracy and robustness.
