Staff View: OpenMP: Heterogenous Execution and Data Movements

OpenMP: Heterogenous Execution and Data Movements 11th International Workshop on OpenMP, IWOMP 2015, Aachen, Germany, October 1-2, 2015, Proceedings /

This book constitutes the refereed proceedings of the 11th International Workshop on OpenMP, held in Aachen, Germany, in October 2015. The 19 technical full papers presented were carefully reviewed and selected from 22 submissions. The papers are organized in topical sections on applications, accele...

Full description

Corporate Authors:	IWOMP (Conference) Aachen, Germany)
Other Authors:	IWOMP (Conference), Terboven, Christian,, De Supinski, Bronis R.,, Reble, Pablo,, Chapman, Barbara, 1954-, Müller, Matthias S.,, SpringerLink (Online service)
Format:	eBook
Language:	English
Published:	Cham : Springer, 2015.
Physical Description:	1 online resource (xi, 274 pages) : illustrations.
Series:	Lecture notes in computer science ; 9342. LNCS sublibrary. Programming and software engineering.
Subjects:	OpenMP (Application program interface) -- Congresses. Computer science. Computers. Microprocessors. Computer system failures. Software engineering. Programming languages (Electronic computers) Algorithms.


LEADER	14492cam a2201597Mi 4500
001	932169396
003	OCoLC
005	20240223121953.0
006	m o d
007	cr nn\|008mamaa
008	150930s2015 sz a od 100 0 eng d
015			\|a GBB8N3096 \|2 bnb
016	7		\|a 019148330 \|2 Uk
019			\|a 961849692 \|a 964878710 \|a 1066651671 \|a 1066693220 \|a 1086458505 \|a 1259142845
020			\|a 9783319245959 \|q (electronic bk.)
020			\|a 3319245953 \|q (electronic bk.)
020			\|a 3319245945
020			\|a 9783319245942
020			\|z 9783319245942
024	7		\|a 10.1007/978-3-319-24595-9 \|2 doi
035			\|a (OCoLC)932169396 \|z (OCoLC)961849692 \|z (OCoLC)964878710 \|z (OCoLC)1066651671 \|z (OCoLC)1066693220 \|z (OCoLC)1086458505 \|z (OCoLC)1259142845
037			\|a com.springer.onix.9783319245959 \|b Springer Nature
040			\|a NUI \|b eng \|e pn \|c NUI \|d OCLCO \|d ORU \|d GW5XE \|d OHI \|d OCLCQ \|d IAS \|d IAO \|d IAD \|d KSU \|d JBG \|d ICN \|d SOI \|d VT2 \|d OCLCQ \|d ILO \|d OCLCF \|d IAD \|d OCL \|d ESU \|d OCLCQ \|d IOG \|d OCLCA \|d U3W \|d OCL \|d BUF \|d OCLCQ \|d OCLCO \|d CEF \|d OCLCQ \|d OCLCO \|d TFW \|d WYU \|d UKMGB \|d OCLCQ \|d OCLCO \|d UKAHL \|d QGK \|d OCLCO \|d COM \|d OCLCO \|d OCL \|d OCLCO \|d OCLCL
049			\|a COM6
050		4	\|a TK7895.M5
072		7	\|a UYF \|2 bicssc
072		7	\|a COM011000 \|2 bisacsh
082	0	4	\|a 004.1 \|2 23
111	2		\|a IWOMP (Conference) \|n (11th : \|d 2015 : \|c Aachen, Germany)
245	1	0	\|a OpenMP: Heterogenous Execution and Data Movements : \|b 11th International Workshop on OpenMP, IWOMP 2015, Aachen, Germany, October 1-2, 2015, Proceedings / \|c edited by Christian Terboven, Bronis R. de Supinski, Pablo Reble, Barbara M. Chapman, Matthias S. Müller.
246	3		\|a IWOMP 2015.
264		1	\|a Cham : \|b Springer, \|c 2015.
300			\|a 1 online resource (xi, 274 pages) : \|b illustrations.
336			\|a text \|b txt \|2 rdacontent.
337			\|a computer \|b c \|2 rdamedia.
338			\|a online resource \|b cr \|2 rdacarrier.
347			\|a text file \|b PDF \|2 rda.
490	1		\|a Lecture Notes in Computer Science, \|x 0302-9743 ; \|v 9342.
490	1		\|a LNCS sublibrary. SL 2, Programming and software engineering.
520			\|a This book constitutes the refereed proceedings of the 11th International Workshop on OpenMP, held in Aachen, Germany, in October 2015. The 19 technical full papers presented were carefully reviewed and selected from 22 submissions. The papers are organized in topical sections on applications, accelerator applications, tools, extensions, compiler and runtime, and energy.
504			\|a Includes bibliographical references and index.
505	0		\|a Intro -- Preface -- Organization -- Contents -- Applications -- PAGANtec: OpenMP Parallel Error Correction for Next-Generation Sequencing Data -- 1 Introduction -- 2 Related Work -- 2.1 k-mer Graph and Error Correction -- 2.2 Parallelization Options -- 3 PAGANtec Architecture -- 3.1 Graph Structure -- 3.2 Correction Strategies -- 3.3 Correcting Errors -- 4 Parallelization -- 4.1 Performance Analysis -- 5 Conclusion -- References -- Composing Low-Overhead Scheduling Strategies for Improving Performance of Scientific Applications -- 1 Introduction -- 2 Scheduling Strategies -- 3 Techniques for Composing Scheduling Strategies -- 3.1 uSched -- 3.2 slackSched -- 3.3 vSched -- 3.4 ComboSched -- 4 Code Transformation -- 5 Results -- 6 Related Work -- 7 Conclusions -- References -- Exploiting Fine- and Coarse-Grained Parallelism Using a Directive Based Approach -- 1 Introduction -- 2 Related Work -- 3 Background: OpenMP Accelerator Model -- 4 An Offloading Model for a Cluster -- 4.1 Definitions -- 4.2 Execution Model -- 4.3 Memory Model -- 5 Implementation -- 5.1 Runtime Support -- 6 Preliminary Results -- 7 Discussion -- 8 Conclusions -- References -- Accelerator Applications -- Experiences of Using the OpenMP Accelerator Model to Port DOE Stencil Applications -- 1 Introduction -- 2 OpenMP 4.0's Accelerator Support -- 3 Applications -- 4 Porting to GPUs -- 4.1 Baseline Performance on CPU and GPU -- 4.2 Increasing Parallelism -- 4.3 Loop Scheduling -- 4.4 Exploiting Memory Hierarchy -- 4.5 Reducing Memory Movement Between Host and Device -- 4.6 Manual Tuning for GPU Performance -- 4.7 Productivity -- 5 Related Work -- 6 Discussion and Future Work -- References -- Evaluating the Impact of OpenMP 4.0 Extensions on Relevant Parallel Workloads -- 1 Introduction and Motivation -- 2 Application Parallelization -- 2.1 Facesim -- 2.2 Fluidanimate.
505	8		\|a 2.3 Streamcluster -- 3 Evaluation -- 3.1 Performance Evaluation -- 3.2 Programmability -- 4 Related Work -- 5 Conclusions -- References -- First Experiences Porting a Parallel Application to a Hybrid Supercomputer with OpenMP 4.0 Device Constructs -- 1 Introduction -- 2 OpenMP Device Constructs -- 2.1 Data Regions -- 3 A High Level View of the Porting Method -- 3.1 Fusing Local Data Regions -- 4 Porting NekBone -- 5 Conclusions -- References -- Tools -- Lessons Learned from Implementing OMPD: A Debugging Interface for OpenMP -- 1 Introduction -- 2 Prior Work -- 3 The OpenMP Debugging Interface -- 3.1 OMPT: A Runtime Interface for OpenMP Tools -- 3.2 Why Distinguish OMPD from OMPT? -- 3.3 The OMPD Architecture -- 4 Use Cases of OMPD -- 4.1 OpenMP-Aware Stack Trace -- 4.2 Stepping in and Out of a Parallel Region -- 5 OMPD Callback Interface -- 5.1 Functions for Operating System Interaction -- 5.2 Resolving Structures for Target Architecture -- 5.3 Access Application Memory -- 5.4 Debugger's Context Argument -- 6 OMPD API Function Specifications -- 6.1 Providing Information on Compatible Runtime Library -- 6.2 API Specification for Breakpoints -- 6.3 Missing Function to Identify Master -- 7 Future Challenges -- 7.1 Context Pointer for Accelerators -- 7.2 Addressing Accelerator Threads -- 7.3 Return Codes -- 8 Conclusions -- References -- False Sharing Detection in OpenMP Applications Using OMPT API -- 1 Introduction -- 2 Motivation -- 3 Related Work -- 4 OMPT- Application Programming Interface for Tools -- 5 Our Approach -- 5.1 OMPT for Capturing Unique Patterns -- 5.2 Hardware Performance Information -- 5.3 Binary Classifier for False Sharing Detection -- 5.4 Feature Selection -- 6 Experimentation and Results -- 6.1 Training Phase -- 6.2 Validation of the Approach -- 7 Conclusion and Future Work -- References.
505	8		\|a Exception Handling with OpenMP in Object-Oriented Languages -- 1 Introduction -- 2 Related Work -- 3 Problem Overview -- 3.1 Current Situation -- 3.2 Problem Definition -- 4 Cancellations -- 5 Exception Handling -- 5.1 Overview of Categorization -- 5.2 Local Exception Handling -- 5.3 Global Exception Handling -- 6 Implementation -- 6.1 Adaptable Synchronization Barrier -- 6.2 Dynamic Work Redistribution -- 6.3 Exception from Synchronization Regions -- 6.4 Global Exception Throwing -- 7 Evaluation -- 7.1 Usability -- 7.2 Performance -- 8 Conclusion -- References -- Extensions -- On the Algorithmic Aspects of Using OpenMP Synchronization Mechanisms II: User-Guided Speculative Locks -- 1 Introduction -- 2 Related Work -- 3 User-Guided Locking API with TSX -- 3.1 Intel Transactional Synchronization Extensions -- 3.2 Using the User-Guided Locking API -- 4 Applying Intel TSX to the Test Code -- 4.1 A Brief Review of the Algorithm -- 4.2 The Role of TSX -- 5 Experimental Results -- 5.1 Convergence -- 5.2 Transactional Memory Statistics -- 5.3 Performance Measurement -- 6 Conclusions and Future Work -- References -- Using Transactional Memory to Avoid Blocking in OpenMP Synchronization Directives -- 1 Introduction -- 2 Avoiding Blocking in OpenMP -- 2.1 Critical Sections -- 2.2 Barrier/Taskwait -- 3 Evaluation -- 3.1 Experimental Setup -- 3.2 Results -- 4 Limitations and Related Work -- 5 Conclusion -- References -- A Case Study of OpenMP Applied to Map/Reduce-Style Computations -- 1 Introduction -- 2 Related Work -- 3 Map-Reduce Programming Model -- 3.1 Phoenix++ Implementation -- 3.2 OpenMP Facilities for Map/Reduce-Style Computations -- 4 OpenMP Implementations -- 4.1 Histogram -- 4.2 Linear Regression -- 4.3 K-Means Clustering -- 4.4 Word Count -- 4.5 String Match -- 4.6 Matrix Multiply -- 4.7 Principal Component Analysis -- 5 Evaluation -- 5.1 Analysis.
505	8		\|a 5.2 Coding Style Comparison -- 5.3 Implications to OpenMP -- 6 Conclusion -- References -- Compiler and Runtime -- Enabling Region Merging Optimizations in OpenMP -- 1 Introduction -- 2 Region Merging and Control -- 2.1 Region Merging Validity in OpenMP -- 2.2 Syntax Extensions to Support Merging -- 3 Results and Evaluation -- 3.1 Back to Back Regions -- 3.2 Parallel Regions with Intervening Serial Regions -- 3.3 Lulesh -- 4 Related Work -- 5 Conclusion -- References -- Towards Task-Parallel Reductions in OpenMP -- 1 Introduction -- 2 Related Work -- 3 Discussion -- 3.1 Updates of a Reduction Variable Outside a Reduction Context -- 3.2 Over-Specifying the Reduction Identifier -- 3.3 Supporting Untied Tasks -- 3.4 Supporting Nested Taskgroups -- 3.5 Cancellation, Dependencies and Merged Tasks -- 4 Syntax Additions -- 5 Evaluation -- 5.1 System Environment -- 5.2 Benchmark Descriptions -- 5.3 Performance Results on Intel Xeon Processors -- 5.4 Performance Results on Intel Xeon Phi Coprocessors -- 6 Conclusions and Future Work -- References -- OpenMP 4.0 Device Support in the OMPi Compiler -- 1 Introduction -- 2 Background -- 3 Compiler Transformations -- 3.1 Target Data -- 3.2 Target -- 3.3 Declare Target -- 4 Runtime Support -- 4.1 Data Environment Handling -- 5 The Epiphany Accelerator as a Device -- 5.1 Runtime Organization -- 5.2 Experiments -- 6 Discussion and Current Status -- References -- Energy -- Application-Level Energy Awareness for OpenMP -- 1 Introduction -- 2 Motivation -- 3 OpenMPE -- 4 Compilation and Runtime System -- 5 Evaluation -- 6 Related Work -- 7 Conclusion and Future Work -- References -- Evaluating the Energy Consumption of OpenMP Applications on Haswell Processors -- 1 Introduction -- 2 Related Works -- 3 Basic Characteristics -- 3.1 Energy-Saving Features of Haswell -- 3.2 Load-Dependent Behavior -- 4 Optimization Steps.
546			\|a English.
630	0	0	\|a OpenMP (Application program interface) \|v Congresses.
630	0	7	\|a OpenMP (Application program interface) \|2 fast.
650		0	\|a Computer science.
650		0	\|a Computers.
650		0	\|a Microprocessors.
650		0	\|a Computer system failures.
650		0	\|a Software engineering.
650		0	\|a Programming languages (Electronic computers)
650		0	\|a Algorithms.
650		2	\|a Electronic Data Processing.
650		2	\|a Computers.
650		2	\|a Algorithms.
650		6	\|a Informatique.
650		6	\|a Ordinateurs.
650		6	\|a Pannes système (Informatique)
650		6	\|a Génie logiciel.
650		6	\|a Algorithmes.
650		7	\|a computers. \|2 aat.
650		7	\|a algorithms. \|2 aat.
650		7	\|a Programming & scripting languages: general. \|2 bicssc.
650		7	\|a Systems analysis & design. \|2 bicssc.
650		7	\|a Computer hardware. \|2 bicssc.
650		7	\|a Algorithms & data structures. \|2 bicssc.
650		7	\|a Software Engineering. \|2 bicssc.
650		7	\|a Computer architecture & logic design. \|2 bicssc.
650		7	\|a Computers \|x Programming Languages \|x General. \|2 bisacsh.
650		7	\|a Computers \|x Hardware \|x Handheld Devices. \|2 bisacsh.
650		7	\|a Computers \|x Hardware \|x General. \|2 bisacsh.
650		7	\|a Computers \|x Programming \|x Algorithms. \|2 bisacsh.
650		7	\|a Computers \|x Software Development & Engineering \|x General. \|2 bisacsh.
650		7	\|a Computers \|x Systems Architecture \|x General. \|2 bisacsh.
650		7	\|a Algorithms. \|2 fast.
650		7	\|a Computers. \|2 fast.
650		7	\|a Computer science. \|2 fast.
650		7	\|a Computer system failures. \|2 fast.
650		7	\|a Microprocessors. \|2 fast.
650		7	\|a Programming languages (Electronic computers) \|2 fast.
650		7	\|a Software engineering. \|2 fast.
655		2	\|a Dictionary.
655		2	\|a Congress.
655		7	\|a dictionaries. \|2 aat.
655		7	\|a proceedings (reports) \|2 aat.
655		7	\|a Dictionaries. \|2 fast.
655		7	\|a Conference papers and proceedings. \|2 fast.
655		7	\|a Dictionaries. \|2 lcgft.
655		7	\|a Conference papers and proceedings. \|2 lcgft.
655		7	\|a Dictionnaires. \|2 rvmgf.
655		7	\|a Actes de congrès. \|2 rvmgf.
700	1		\|a Terboven, Christian, \|e editor.
700	1		\|a De Supinski, Bronis R., \|e editor.
700	1		\|a Reble, Pablo, \|e editor.
700	1		\|a Chapman, Barbara, \|d 1954- \|1 https://id.oclc.org/worldcat/entity/E39PCjBRY8JdWBWgpWcW6QkQjd, \|e editor.
700	1		\|a Müller, Matthias S., \|e editor.
710	2		\|a SpringerLink (Online service)
776	0	8	\|i Printed edition: \|z 9783319245942.
830		0	\|a Lecture notes in computer science ; \|v 9342. \|x 0302-9743.
830		0	\|a LNCS sublibrary. \|n SL 2, \|p Programming and software engineering.
907			\|a .b50280752 \|b multi \|c - \|d 160404 \|e 240401
998			\|a (3)cue \|a cu \|b 240227 \|c m \|d z \|e - \|f eng \|g sz \|h 0 \|i 2
948			\|a MARCIVE Overnight, in 2024.03
948			\|a MARCIVE Comp, in 2023.01
948			\|a MARCIVE Over, 07/2021
948			\|a MARCIVE Comp, 2019.12
948			\|a MARCIVE Q2, 2018
948			\|a MARCIVE Comp, 2018.05
948			\|a MARCIVE Comp, 2017.10
948			\|a MARCIVE August, 2017
948			\|a MARCIVE extract Aug 5, 2017
933			\|a Marcive found issue: "700 1 \|a Reble, Pablo, \|e editor."
994			\|a 92 \|b COM
995			\|a Loaded with m2btab.ltiac in 2024.03
995			\|a Loaded with m2btab.elec in 2024.02
995			\|a Loaded with m2btab.ltiac in 2023.01
995			\|a Loaded with m2btab.ltiac in 2021.07
995			\|a Loaded with m2btab.elec in 2021.06
995			\|a Loaded with m2btab.ltiac in 2019.12
995			\|a Loaded with m2btab.ltiac in 2018.08
995			\|a Loaded with m2btab.ltiac in 2018.06
995	0	0	\|a OCLC offline update by CMU and loaded with m2btab.elec in 2018.04
995			\|a Loaded with m2btab.ltiac in 2017.10
995			\|a Loaded with m2btab.ltiac in 2017.09
995			\|a Loaded with m2btab.elec in 2016
995			\|a Loaded with m2btab.elec in 2016
995			\|a Loaded with m2btab.elec in 2016
995			\|a OCLC offline update by CMU
995			\|a Loaded with m2btab.auth in 2021.07
995			\|a Loaded with m2btab.auth in 2024.03
999			\|e z
999			\|a cue
989			\|d cueme \|e - - \|f - - \|g - \|h 0 \|i 0 \|j 200 \|k 240227 \|l $0.00 \|m \|n - - \|o - \|p 0 \|q 0 \|t 0 \|x 0 \|w SpringerLink \|1 .i150443146 \|u http://ezproxy.coloradomesa.edu/login?url=https://link.springer.com/10.1007/978-3-319-24595-9 \|3 SpringerLink \|z Click here for access

OpenMP: Heterogenous Execution and Data Movements 11th International Workshop on OpenMP, IWOMP 2015, Aachen, Germany, October 1-2, 2015, Proceedings /

In Prospector

Similar Items