Graphics processing unit (GPU) programming strategies and trends in GPU computing

Abstract

In this article, we have given an overview of hardware and traditional optimization techniques for the GPU. We have furthermore given a step-by-step guide to profile driven development, in which bottlenecks and possible solutions are outlined. The focus is on state-of-the-art hardware with accompanying tools, and we have addressed the most prominent bottlenecks: memory, arithmetics, and latencies.

Language

English

Author(s)

André Rigland Brodtkorb
Trond Runar Hagen
Martin Lilleeng Sætra

Affiliation

SINTEF Digital / Mathematics and Cybernetics
University of Oslo

Date

04.05.2012

Year

2013

Published in

Journal of Parallel and Distributed Computing

ISSN

0743-7315

Publisher

Academic Press

Volume

Issue

Page(s)

4 - 13

External resources

https://babrodtk.at.ifi.uio.no/files/publications/brodtkorb_etal_meta10.pdf

DOI

https://doi.org/10.1016/j.jpdc.2012.04.003

View this publication at Cristin

Contact us

Our services

Career

Sustainability

Management and board

Institutes

Other units

About us

Follow us