0 36

Cited 0 times in

RelCurator: a text mining-based curation system for extracting gene-phenotype relationships specific to neurodegenerative disorders

DC Field Value Language
dc.contributor.author김윤중-
dc.date.accessioned2024-03-22T06:11:14Z-
dc.date.available2024-03-22T06:11:14Z-
dc.date.issued2023-08-
dc.identifier.issn1976-9571-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/198444-
dc.description.abstractBackground The identification of gene–phenotype relationships is important in medical genetics as it serves as a basis for precision medicine. However, most of the gene-phenotype relationship data are buried in the biomedical literature in textual form. Objective We propose RelCurator, a curation system that extracts sentences including both gene and phenotype entities related to specific disease categories from PubMed articles, provides rich additional information such as entity taggings, and predictions of gene–phenotype relationships. Methods We targeted neurodegenerative disorders and developed a deep learning model using Bidirectional Gated Recurrent Unit (BiGRU) networks and BioWordVec word embeddings for predicting gene–phenotype relationships from biomedical texts. The prediction model is trained with more than 130,000 labeled PubMed sentences including gene and phenotype entities, which are related to or unrelated to neurodegenerative disorders. Results We compared the performance of our deep learning model with those of Bidirectional Encoder Representations from Transformers (BERT), Support Vector Machine (SVM), and simple Recurrent Neural Network (simple RNN) models. Our model performed better with an F1-score of 0.96. Furthermore, the evaluation done using a few curation cases in the real scenario showed the effectiveness of our work. Therefore, we conclude that RelCurator can identify not only new causative genes, but also new genes associated with neurodegenerative disorders’ phenotype. Conclusion RelCurator is a user-friendly method for accessing deep learning-based supporting information and a concise web interface to assist curators while browsing the PubMed articles. Our curation process represents an important and broadly applicable improvement to the state of the art for the curation of gene–phenotype relationships.-
dc.description.statementOfResponsibilityrestriction-
dc.languageEnglish-
dc.publisherGenetics Society of Korea-
dc.relation.isPartOfGENES & GENOMICS-
dc.rightsCC BY-NC-ND 2.0 KR-
dc.subject.MESHData Mining* / methods-
dc.subject.MESHHumans-
dc.subject.MESHNeural Networks, Computer-
dc.subject.MESHNeurodegenerative Diseases* / genetics-
dc.titleRelCurator: a text mining-based curation system for extracting gene-phenotype relationships specific to neurodegenerative disorders-
dc.typeArticle-
dc.contributor.collegeCollege of Medicine (의과대학)-
dc.contributor.departmentDept. of Neurology (신경과학교실)-
dc.contributor.googleauthorHeonwoo Lee-
dc.contributor.googleauthorJunbeom Jeon-
dc.contributor.googleauthorDawoon Jung-
dc.contributor.googleauthorJung-Im Won-
dc.contributor.googleauthorKiyong Kim-
dc.contributor.googleauthorYun Joong Kim-
dc.contributor.googleauthorJeehee Yoon-
dc.identifier.doi10.1007/s13258-023-01405-6-
dc.contributor.localIdA00796-
dc.relation.journalcodeJ00928-
dc.identifier.eissn2092-9293-
dc.identifier.pmid37300788-
dc.identifier.urlhttps://link.springer.com/article/10.1007/s13258-023-01405-6-
dc.subject.keywordCuration system-
dc.subject.keywordDeep learning-
dc.subject.keywordGene–phenotype relationship-
dc.subject.keywordNeurodegenerative disorders-
dc.contributor.alternativeNameKim, Yun Joong-
dc.contributor.affiliatedAuthor김윤중-
dc.citation.volume45-
dc.citation.number8-
dc.citation.startPage1025-
dc.citation.endPage1036-
dc.identifier.bibliographicCitationGENES & GENOMICS, Vol.45(8) : 1025-1036, 2023-08-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Neurology (신경과학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.