Cited 0 times in

FPGA-assisted Design Space Exploration of Parameterized AI Accelerators: A Quickloop Approach

DC Field Value Language
dc.contributor.author정재용-
dc.date.accessioned2025-02-03T09:07:59Z-
dc.date.available2025-02-03T09:07:59Z-
dc.date.issued2024-10-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/202217-
dc.description.abstractFPGAs facilitate prototyping and debug, and recently accelerate full-stack simulations due to their rapid turnaround time (TAT). However, this TAT is restrictive in exhaustive design space explorations of parameterized RTL generators, especially DNN accelerators that unleash an explosive full-stack search space. This paper presents Quickloop, an efficient and scalable framework to enable FPGA-accelerated exploration. Quickloop first abstracts away the cumbersome flow of RTL generation, software stack, FPGA toolflow, workload execution and metrics extraction by wrapping these stages into isolated Quicksteps, featuring cascadability, scalability, and replay. Then, we analytically minimize the FPGA toolflow TAT via a novel, data-driven strategy that intelligently utilizes build fragments from previous iterations, enhancing the loop efficiency and simultaneously lowering the toolflow’s compute utilization. Quickloop is built around the OpenAI Gym environment framework and thus supports drop-in regression and reinforcement learning explorations. With a Quickloop around a reference Berkeley’s Gemmini DNN accelerator, we exhaustively explore its parameter space and discover complex performance patterns, based on full-stack simulation of Imagenet benchmarks as a workload. Compared to conventional FPGA toolflow, we further show that Quickloop effectively reduces episodal time by above 30%, as the episode approaches realistic lengths.-
dc.description.statementOfResponsibilityrestriction-
dc.relation.isPartOfJournal of Systems Architecture-
dc.rightsCC BY-NC-ND 2.0 KR-
dc.titleFPGA-assisted Design Space Exploration of Parameterized AI Accelerators: A Quickloop Approach-
dc.typeArticle-
dc.contributor.collegeCollege of Medicine (의과대학)-
dc.contributor.departmentDept. of Pharmacology (약리학교실)-
dc.contributor.googleauthorKashif Inayat-
dc.contributor.googleauthorFahad Bin Muslim-
dc.contributor.googleauthorTayyeb Mahmood-
dc.contributor.googleauthorJaeyong Chung-
dc.identifier.doi10.1016/j.sysarc.2024.103260-
dc.contributor.localIdA03709-
dc.identifier.urlhttps://www.sciencedirect.com/science/article/pii/S1383762124001978-
dc.subject.keywordMachine learning-
dc.subject.keywordAccelerator-
dc.subject.keywordSystolic arrays-
dc.subject.keywordFPGA-
dc.subject.keywordExploration-
dc.contributor.alternativeNameChung, Jae Yong-
dc.contributor.affiliatedAuthor정재용-
dc.citation.volume155-
dc.citation.startPage103260-
dc.identifier.bibliographicCitationJournal of Systems Architecture, Vol.155 : 103260, 2024-10-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Pharmacology (약리학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.