000 03221cam a2200373Ii 4500
003 OCoLC
005 20220310161943.0
008 160905s2017 mauad b 001 0 eng d
020 _a0128119861
_qpaperback
020 _a9780128119860
_qpaperback
035 _aGSU03731
_z(OCoLC)957680278
_z(OCoLC)964801939
_z(OCoLC)972618997
_z(OCoLC)972780043
_z(OCoLC)972919162
_z(OCoLC)973148803
_z(OCoLC)978721114
_z(OCoLC)978992806
_z(OCoLC)979387894
040 _aLCC
_beng
_cgsu
_dgsu
_erda
050 4 _aQA76.642
_b.K57 2017
082 0 4 _a004/.35
_223
100 1 _aKirk, David,
_d1960-
_eauthor.
_0http://id.loc.gov/authorities/names/n92010326.
245 1 0 _aProgramming massively parallel processors :
_ba hands-on approach /
_cDavid B. Kirk, Wen-mei W. Hwu.
250 _aThird edition.
264 1 _aCambridge, MA, United States :
_bMorgan Kaufmann,
_c[2017]
300 _axxii, 550 pages :
_billustrations, charts ;
_c24 cm.
336 _atext
_btxt
_2rdacontent.
337 _aunmediated
_bn
_2rdamedia.
338 _avolume
_bnc
_2rdacarrier.
500 _aPrevious edition: 2013.
504 _aIncludes bibliographical references and index.
505 0 _aIntroduction -- Data parallel computing -- Scalable parallel execution -- Memory and data locality - Performance considerations -- Numerical considerations -- Parallel patterns: concolution -- Parallel patterns: prefix sum -- Parallel patterns: parallel histogram computation -- Parallel patterns: sparse matrix computation -- Parallel patterns: merge sort -- Parallel patterns: graph search -- CUDA dynamic parallelism -- Application case study: non-cartesian magnetic resonance imaging -- Application case study: molecular visualization and analysis -- Application case study: machine learning -- Parallel programming and computational thinking -- Programming a heterogeneous computing cluster -- Parallel programming with OpenACC -- More on CUDA and graphics processing unit computing -- Conclusion and outlook.
520 _aThis book shows both student and professional alike the basic concepts of parallel programming and GPU architecture, exploring, in detail, various techniques for constructing parallel programs. Case studies demonstrate the development process, detailing computational thinking and ending with effective and efficient parallel programs. Topics of performance, floating-point format, parallel patterns, and dynamic parallelism are covered in-depth. This edition contains updated coverage of CUDA, including coverage of newer libraries, such as CuDNN, moved content that has become less important to appendices, added two new chapters on parallel patterns, and updated case studies to reflect current industry practices.
650 0 _aParallel programming (Computer science)
_0http://id.loc.gov/authorities/subjects/sh85097827.
650 0 _aParallel processing (Electronic computers)
_0http://id.loc.gov/authorities/subjects/sh85097826.
650 0 _aMultiprocessors.
_0http://id.loc.gov/authorities/subjects/sh85088386.
650 0 _aComputer architecture.
_0http://id.loc.gov/authorities/subjects/sh85029479.
700 1 _aHwu, Wen-mei,
_eauthor.
_0http://id.loc.gov/authorities/names/n2009077213.
942 _2lcc
_cBK
_n0
999 _c217
_d217