000 | 03221cam a2200373Ii 4500 | ||
---|---|---|---|
003 | OCoLC | ||
005 | 20220310161943.0 | ||
008 | 160905s2017 mauad b 001 0 eng d | ||
020 |
_a0128119861 _qpaperback |
||
020 |
_a9780128119860 _qpaperback |
||
035 |
_aGSU03731 _z(OCoLC)957680278 _z(OCoLC)964801939 _z(OCoLC)972618997 _z(OCoLC)972780043 _z(OCoLC)972919162 _z(OCoLC)973148803 _z(OCoLC)978721114 _z(OCoLC)978992806 _z(OCoLC)979387894 |
||
040 |
_aLCC _beng _cgsu _dgsu _erda |
||
050 | 4 |
_aQA76.642 _b.K57 2017 |
|
082 | 0 | 4 |
_a004/.35 _223 |
100 | 1 |
_aKirk, David, _d1960- _eauthor. _0http://id.loc.gov/authorities/names/n92010326. |
|
245 | 1 | 0 |
_aProgramming massively parallel processors : _ba hands-on approach / _cDavid B. Kirk, Wen-mei W. Hwu. |
250 | _aThird edition. | ||
264 | 1 |
_aCambridge, MA, United States : _bMorgan Kaufmann, _c[2017] |
|
300 |
_axxii, 550 pages : _billustrations, charts ; _c24 cm. |
||
336 |
_atext _btxt _2rdacontent. |
||
337 |
_aunmediated _bn _2rdamedia. |
||
338 |
_avolume _bnc _2rdacarrier. |
||
500 | _aPrevious edition: 2013. | ||
504 | _aIncludes bibliographical references and index. | ||
505 | 0 | _aIntroduction -- Data parallel computing -- Scalable parallel execution -- Memory and data locality - Performance considerations -- Numerical considerations -- Parallel patterns: concolution -- Parallel patterns: prefix sum -- Parallel patterns: parallel histogram computation -- Parallel patterns: sparse matrix computation -- Parallel patterns: merge sort -- Parallel patterns: graph search -- CUDA dynamic parallelism -- Application case study: non-cartesian magnetic resonance imaging -- Application case study: molecular visualization and analysis -- Application case study: machine learning -- Parallel programming and computational thinking -- Programming a heterogeneous computing cluster -- Parallel programming with OpenACC -- More on CUDA and graphics processing unit computing -- Conclusion and outlook. | |
520 | _aThis book shows both student and professional alike the basic concepts of parallel programming and GPU architecture, exploring, in detail, various techniques for constructing parallel programs. Case studies demonstrate the development process, detailing computational thinking and ending with effective and efficient parallel programs. Topics of performance, floating-point format, parallel patterns, and dynamic parallelism are covered in-depth. This edition contains updated coverage of CUDA, including coverage of newer libraries, such as CuDNN, moved content that has become less important to appendices, added two new chapters on parallel patterns, and updated case studies to reflect current industry practices. | ||
650 | 0 |
_aParallel programming (Computer science) _0http://id.loc.gov/authorities/subjects/sh85097827. |
|
650 | 0 |
_aParallel processing (Electronic computers) _0http://id.loc.gov/authorities/subjects/sh85097826. |
|
650 | 0 |
_aMultiprocessors. _0http://id.loc.gov/authorities/subjects/sh85088386. |
|
650 | 0 |
_aComputer architecture. _0http://id.loc.gov/authorities/subjects/sh85029479. |
|
700 | 1 |
_aHwu, Wen-mei, _eauthor. _0http://id.loc.gov/authorities/names/n2009077213. |
|
942 |
_2lcc _cBK _n0 |
||
999 |
_c217 _d217 |