Please use this identifier to cite or link to this item:
http://223.31.159.10:8080/jspui/handle/123456789/1825Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Hamid, Fiza | - |
| dc.contributor.author | Mukherjee, Kanka | - |
| dc.contributor.author | Chaudhary, Sakshi | - |
| dc.contributor.author | Kaushik, Love | - |
| dc.contributor.author | Kumar, Shailesh | - |
| dc.date.accessioned | 2026-06-11T07:09:38Z | - |
| dc.date.available | 2026-06-11T07:09:38Z | - |
| dc.date.issued | 2026 | - |
| dc.identifier.citation | DNA Research, (In Press) | en_US |
| dc.identifier.issn | 1756-1663 | - |
| dc.identifier.other | https://doi.org/10.1093/dnares/dsag005 | - |
| dc.identifier.uri | https://academic.oup.com/dnaresearch/advance-article/doi/10.1093/dnares/dsag005/8704208?login=true | - |
| dc.identifier.uri | http://223.31.159.10:8080/jspui/handle/123456789/1825 | - |
| dc.description | Accepted date: 26 May 2026 | en_US |
| dc.description.abstract | Fusion genes play crucial roles in plant biological processes but remain far less explored than their human counterparts, largely due to limited validated datasets and the absence of plant-specific prediction tools. Existing approaches often produce high false-positive rates, restricting reliable discovery. To address this gap, we developed Plant Fusion Gene Predictor (PFGPred), an ensemble machine learning framework that integrates Random Forest, XGBoost, and long short-term memory (LSTM) models into a meta-classifier for accurate identification of true and false fusion genes from RNA-Seq data. PFGPred was trained on a high-confidence dataset of fusion genes validated by both RNA-Seq and whole-genome sequencing from Arabidopsis thaliana, Oryza sativa, Triticum aestivum, and Zea mays, to predict and rank candidate fusion genes for future functional validation. It outperformed individual baseline models, achieving accuracies of 0.97 on training data and 0.77 on independent test data. When evaluated on human datasets, it achieved 0.71 accuracy with lower sensitivity, reflecting biological differences between plant and human fusion events. Comparative analyses confirmed that PFGPred reliably identifies validated fusions, demonstrating its utility as a cost-effective, plant-specific prediction tool for high-throughput fusion gene screening and functional genomics research. It is freely available as a web server at http://www.nipgr.ac.in/PFGPred. | en_US |
| dc.description.sponsorship | The authors gratefully acknowledge the BRIC-National Institute of Plant Genome Research (NIPGR), New Delhi, for providing research support. The authors extend their gratitude to the DBT e-Library Consortium (DeLCON) for providing access to e-material and the Computational 8 Biology & Bioinformatics Facility (CBBF) of the NIPGR for their support. | en_US |
| dc.language.iso | en_US | en_US |
| dc.publisher | Oxford University Press | en_US |
| dc.subject | Fusion Transcripts | en_US |
| dc.subject | Gene Fusion | en_US |
| dc.subject | Machine Learning | en_US |
| dc.subject | Plant Fusion Gene | en_US |
| dc.subject | RNA Sequencing | en_US |
| dc.subject | Whole-Genome Sequencing | en_US |
| dc.title | PFGPred: A stack ensemble classifier for the identification of fusion genes in plants | en_US |
| dc.type | Article | en_US |
| Appears in Collections: | Institutional Publications | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Kumar Shai_2026_6.pdf Restricted Access | 714.71 kB | Adobe PDF | View/Open Request a copy |
Items in IR@NIPGR are protected by copyright, with all rights reserved, unless otherwise indicated.