Cited 0 times in

Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation

DC Field Value Language
dc.contributor.author김경민-
dc.date.accessioned2025-02-03T09:27:14Z-
dc.date.available2025-02-03T09:27:14Z-
dc.date.issued2024-12-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/202480-
dc.description.abstractTo improve the accuracy and reliability of LiDAR semantic segmentation, previous studies have introduced multi-modal approaches that utilize additional modalities, such as 2D RGB images, to provide complementary information. However, these methods increase the cost of data collection, sensor hardware requirements, power consumption, and computational complexity. We observed that multi-modal approaches improve the semantic alignment of 3D representations. Motivated by this observation, we propose a pseudo multi-modal approach. To this end, we introduce a novel class-label-driven artificial 2D image construction method. By leveraging the close semantic alignment between image and text features of vision-language models, artificial 2D images are synthesized by arranging LiDAR class label text features. During training, the semantic information encoded in the artificial 2D images enriches the 3D features through knowledge distillation. The proposed method significantly reduces the burden of training data collection and facilitates more effective learning of semantic relationships in the 3D backbone network. Extensive experiments on two benchmark datasets demonstrate that the proposed method improves performance by 2.2-3.5 mIoU over the baseline using only LiDAR data, achieving performance comparable to that of real multi-modal approaches.-
dc.description.statementOfResponsibilityopen-
dc.languageEnglish-
dc.publisherMDPI-
dc.relation.isPartOfSENSORS-
dc.rightsCC BY-NC-ND 2.0 KR-
dc.titlePseudo Multi-Modal Approach to LiDAR Semantic Segmentation-
dc.typeArticle-
dc.contributor.collegeCollege of Medicine (의과대학)-
dc.contributor.departmentDept. of Neurology (신경과학교실)-
dc.contributor.googleauthorKyungmin Kim-
dc.identifier.doi10.3390/s24237840-
dc.contributor.localIdA05748-
dc.relation.journalcodeJ03219-
dc.identifier.eissn1424-8220-
dc.identifier.pmid39686377-
dc.subject.keywordLiDAR semantic segmentation-
dc.subject.keywordknowledge distillation-
dc.contributor.alternativeNameKim, Kyung Min-
dc.contributor.affiliatedAuthor김경민-
dc.citation.volume24-
dc.citation.number23-
dc.citation.startPage7840-
dc.identifier.bibliographicCitationSENSORS, Vol.24(23) : 7840, 2024-12-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Neurology (신경과학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.