BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/Chicago
X-LIC-LOCATION:America/Chicago
BEGIN:DAYLIGHT
TZOFFSETFROM:-0600
TZOFFSETTO:-0500
TZNAME:CDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0500
TZOFFSETTO:-0600
TZNAME:CST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20230124T171527Z
LOCATION:D220
DTSTART;TZID=America/Chicago:20221113T113000
DTEND;TZID=America/Chicago:20221113T120000
UID:submissions.supercomputing.org_SC22_sess430_ws_llvmf105@linklings.com
SUMMARY:Direct GPU Compilation and Execution for Host Applications with Op
 enMP Parallelism
DESCRIPTION:Workshop\n\nDirect GPU Compilation and Execution for Host Appl
 ications with OpenMP Parallelism\n\nTian, Huber, Parasyris, Chapman, Doefe
 rt\n\nIn this paper, we propose a direct GPU compilation scheme that lever
 ages the portable target offloading interface provided by LLVM/OpenMP. Uti
 lizing this infrastructure allows us to compile an existing host applicati
 on for the GPU and execute it there with only a minimal wrapper layer for 
 the user code, command line arguments, and a compiler provided GPU impleme
 ntation of C/C++ standard library functions. The C/C++ library functions a
 re partially implemented for direct device execution and otherwise fallbac
 k to remote procedure call (RPC) to call host functions transparently. Our
  proposed prototype will allow users to quickly compile for, and test on, 
 the GPU without explicitly handling kernel launches, data mapping, or host
 -device synchronization. We evaluate our implementation using three proxy 
 applications with host OpenMP parallelism and three microbenchmarks to tes
 t the correctness of our prototype GPU compilation.\n\nSession Format: Rec
 orded\n\nRegistration Category: Workshop Reg Pass
END:VEVENT
END:VCALENDAR
