Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation

Kyungmin Kim

doi:10.3390/s24237840

YUHSpace

BROWSE

12 45

Cited 0 times in

Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation

DC Field	Value	Language
dc.contributor.author	김경민	-
dc.date.accessioned	2025-02-03T09:27:14Z	-
dc.date.available	2025-02-03T09:27:14Z	-
dc.date.issued	2024-12	-
dc.identifier.uri	https://ir.ymlib.yonsei.ac.kr/handle/22282913/202480	-
dc.description.abstract	To improve the accuracy and reliability of LiDAR semantic segmentation, previous studies have introduced multi-modal approaches that utilize additional modalities, such as 2D RGB images, to provide complementary information. However, these methods increase the cost of data collection, sensor hardware requirements, power consumption, and computational complexity. We observed that multi-modal approaches improve the semantic alignment of 3D representations. Motivated by this observation, we propose a pseudo multi-modal approach. To this end, we introduce a novel class-label-driven artificial 2D image construction method. By leveraging the close semantic alignment between image and text features of vision-language models, artificial 2D images are synthesized by arranging LiDAR class label text features. During training, the semantic information encoded in the artificial 2D images enriches the 3D features through knowledge distillation. The proposed method significantly reduces the burden of training data collection and facilitates more effective learning of semantic relationships in the 3D backbone network. Extensive experiments on two benchmark datasets demonstrate that the proposed method improves performance by 2.2-3.5 mIoU over the baseline using only LiDAR data, achieving performance comparable to that of real multi-modal approaches.	-
dc.description.statementOfResponsibility	open	-
dc.language	English	-
dc.publisher	MDPI	-
dc.relation.isPartOf	SENSORS	-
dc.rights	CC BY-NC-ND 2.0 KR	-
dc.title	Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation	-
dc.type	Article	-
dc.contributor.college	College of Medicine (의과대학)	-
dc.contributor.department	Dept. of Neurology (신경과학교실)	-
dc.contributor.googleauthor	Kyungmin Kim	-
dc.identifier.doi	10.3390/s24237840	-
dc.contributor.localId	A05748	-
dc.relation.journalcode	J03219	-
dc.identifier.eissn	1424-8220	-
dc.identifier.pmid	39686377	-
dc.subject.keyword	LiDAR semantic segmentation	-
dc.subject.keyword	knowledge distillation	-
dc.contributor.alternativeName	Kim, Kyung Min	-
dc.contributor.affiliatedAuthor	김경민	-
dc.citation.volume	24	-
dc.citation.number	23	-
dc.citation.startPage	7840	-
dc.identifier.bibliographicCitation	SENSORS, Vol.24(23) : 7840, 2024-12	-

Appears in Collections:: 1. College of Medicine (의과대학) > Dept. of Neurology (신경과학교실) > 1. Journal Papers

Show simple item record Find it @ YMLIB

License

YUHSpace: Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation

YUHSpace

BROWSE

Browse

Links