Cited 0 times in
Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 김경민 | - |
dc.date.accessioned | 2025-02-03T09:27:14Z | - |
dc.date.available | 2025-02-03T09:27:14Z | - |
dc.date.issued | 2024-12 | - |
dc.identifier.uri | https://ir.ymlib.yonsei.ac.kr/handle/22282913/202480 | - |
dc.description.abstract | To improve the accuracy and reliability of LiDAR semantic segmentation, previous studies have introduced multi-modal approaches that utilize additional modalities, such as 2D RGB images, to provide complementary information. However, these methods increase the cost of data collection, sensor hardware requirements, power consumption, and computational complexity. We observed that multi-modal approaches improve the semantic alignment of 3D representations. Motivated by this observation, we propose a pseudo multi-modal approach. To this end, we introduce a novel class-label-driven artificial 2D image construction method. By leveraging the close semantic alignment between image and text features of vision-language models, artificial 2D images are synthesized by arranging LiDAR class label text features. During training, the semantic information encoded in the artificial 2D images enriches the 3D features through knowledge distillation. The proposed method significantly reduces the burden of training data collection and facilitates more effective learning of semantic relationships in the 3D backbone network. Extensive experiments on two benchmark datasets demonstrate that the proposed method improves performance by 2.2-3.5 mIoU over the baseline using only LiDAR data, achieving performance comparable to that of real multi-modal approaches. | - |
dc.description.statementOfResponsibility | open | - |
dc.language | English | - |
dc.publisher | MDPI | - |
dc.relation.isPartOf | SENSORS | - |
dc.rights | CC BY-NC-ND 2.0 KR | - |
dc.title | Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation | - |
dc.type | Article | - |
dc.contributor.college | College of Medicine (의과대학) | - |
dc.contributor.department | Dept. of Neurology (신경과학교실) | - |
dc.contributor.googleauthor | Kyungmin Kim | - |
dc.identifier.doi | 10.3390/s24237840 | - |
dc.contributor.localId | A05748 | - |
dc.relation.journalcode | J03219 | - |
dc.identifier.eissn | 1424-8220 | - |
dc.identifier.pmid | 39686377 | - |
dc.subject.keyword | LiDAR semantic segmentation | - |
dc.subject.keyword | knowledge distillation | - |
dc.contributor.alternativeName | Kim, Kyung Min | - |
dc.contributor.affiliatedAuthor | 김경민 | - |
dc.citation.volume | 24 | - |
dc.citation.number | 23 | - |
dc.citation.startPage | 7840 | - |
dc.identifier.bibliographicCitation | SENSORS, Vol.24(23) : 7840, 2024-12 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.