164 287

Cited 0 times in

A Framework (SOCRATex) for Hierarchical Annotation of Unstructured Electronic Health Records and Integration Into a Standardized Medical Database: Development and Usability Study

DC Field Value Language
dc.contributor.author유승찬-
dc.date.accessioned2021-12-28T16:57:48Z-
dc.date.available2021-12-28T16:57:48Z-
dc.date.issued2021-03-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/186872-
dc.description.abstractBackground: Although electronic health records (EHRs) have been widely used in secondary assessments, clinical documents are relatively less utilized owing to the lack of standardized clinical text frameworks across different institutions. Objective: This study aimed to develop a framework for processing unstructured clinical documents of EHRs and integration with standardized structured data. Methods: We developed a framework known as Staged Optimization of Curation, Regularization, and Annotation of clinical text (SOCRATex). SOCRATex has the following four aspects: (1) extracting clinical notes for the target population and preprocessing the data, (2) defining the annotation schema with a hierarchical structure, (3) performing document-level hierarchical annotation using the annotation schema, and (4) indexing annotations for a search engine system. To test the usability of the proposed framework, proof-of-concept studies were performed on EHRs. We defined three distinctive patient groups and extracted their clinical documents (ie, pathology reports, radiology reports, and admission notes). The documents were annotated and integrated into the Observational Medical Outcomes Partnership (OMOP)-common data model (CDM) database. The annotations were used for creating Cox proportional hazard models with different settings of clinical analyses to measure (1) all-cause mortality, (2) thyroid cancer recurrence, and (3) 30-day hospital readmission. Results: Overall, 1055 clinical documents of 953 patients were extracted and annotated using the defined annotation schemas. The generated annotations were indexed into an unstructured textual data repository. Using the annotations of pathology reports, we identified that node metastasis and lymphovascular tumor invasion were associated with all-cause mortality among colon and rectum cancer patients (both P=.02). The other analyses involving measuring thyroid cancer recurrence using radiology reports and 30-day hospital readmission using admission notes in depressive disorder patients also showed results consistent with previous findings. Conclusions: We propose a framework for hierarchical annotation of textual data and integration into a standardized OMOP-CDM medical database. The proof-of-concept studies demonstrated that our framework can effectively process and integrate diverse clinical documents with standardized structured data for clinical research.-
dc.description.statementOfResponsibilityopen-
dc.languageEnglish-
dc.publisherJMIR Publications-
dc.relation.isPartOfJMIR MEDICAL INFORMATICS-
dc.rightsCC BY-NC-ND 2.0 KR-
dc.titleA Framework (SOCRATex) for Hierarchical Annotation of Unstructured Electronic Health Records and Integration Into a Standardized Medical Database: Development and Usability Study-
dc.typeArticle-
dc.contributor.collegeCollege of Medicine (의과대학)-
dc.contributor.departmentDept. of Preventive Medicine and Public Health (예방의학교실)-
dc.contributor.googleauthorJimyung Park-
dc.contributor.googleauthorSeng Chan You-
dc.contributor.googleauthorEugene Jeong-
dc.contributor.googleauthorChunhua Weng-
dc.contributor.googleauthorDongsu Park-
dc.contributor.googleauthorJin Roh-
dc.contributor.googleauthorDong Yun Lee-
dc.contributor.googleauthorJae Youn Cheong-
dc.contributor.googleauthorJin Wook Choi-
dc.contributor.googleauthorMira Kang-
dc.contributor.googleauthorRae Woong Park-
dc.identifier.doi10.2196/23983-
dc.contributor.localIdA02478-
dc.relation.journalcodeJ03664-
dc.identifier.eissn2291-9694-
dc.identifier.pmid33783361-
dc.subject.keywordcommon data model-
dc.subject.keyworddata curation-
dc.subject.keyworddata management-
dc.subject.keywordnatural language processing-
dc.subject.keywordsearch engine-
dc.contributor.alternativeNameYou, Seng Chan-
dc.contributor.affiliatedAuthor유승찬-
dc.citation.volume9-
dc.citation.number3-
dc.citation.startPagee23983-
dc.identifier.bibliographicCitationJMIR MEDICAL INFORMATICS, Vol.9(3) : e23983, 2021-03-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Preventive Medicine (예방의학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.