E-Book, Englisch, Band 12247, 646 Seiten, eBook
Malawski / Rzadca Euro-Par 2020: Parallel Processing
1. Auflage 2020
ISBN: 978-3-030-57675-2
Verlag: Springer International Publishing
Format: PDF
Kopierschutz: 1 - PDF Watermark
26th International Conference on Parallel and Distributed Computing, Warsaw, Poland, August 24–28, 2020, Proceedings
E-Book, Englisch, Band 12247, 646 Seiten, eBook
Reihe: Lecture Notes in Computer Science
ISBN: 978-3-030-57675-2
Verlag: Springer International Publishing
Format: PDF
Kopierschutz: 1 - PDF Watermark
Zielgruppe
Research
Autoren/Hrsg.
Weitere Infos & Material
Support Tools and Environments.- Skipping Non-essential Instructions Makes Data-dependence Profiling Faster.- A Toolchain to Verify the Parallelization of OmpSs-2 Applications.- Performance and Power Modeling, Prediction and Evaluation.- Towards a Model to Estimate the Reliability of Large-scale Hybrid Supercomputers.- A Learning-Based Approach for Evaluating the Capacity of Data Processing Pipelines.- Operation-Aware Power Capping.- A Comparison of the Scalability of OpenMP Implementations.- Evaluating the Effectiveness of a Vector-Length-Agnostic Instruction Set.- Scheduling and Load Balancing.- Parallel Scheduling of Data-Intensive Tasks.- A Makespan Lower Bound for the Scheduling of the Tiled Cholesky Factorization Based on ALAP Scheduling.- Optimal GPU-CPU Offloading Strategies for Deep Neural Network Training.- Improving mapping for sparse direct solvers: A trade-off between data locality and load balancing.- High Performance Architectures and Compilers.- Modelling Standard and Randomized Slimmed Folded Clos Networks.- OmpMemOpti: Optimized Memory Movement for Heterogeneous Computing.- Data Management, Analytics and Machine Learning.- Accelerating Deep Learning Inference with Cross-Layer Data Reuse on GPUs.- Distributed Fine-Grained Traffic Speed Prediction for Large-Scale Transportation Networks based on Automatic LSTM Customization and Sharing.- Optimizing FFT-based convolution on ARMv8 multi-core CPUs.- Maximizing I/O Bandwidth for Reverse Time Migration on Heterogeneous Large-Scale Systems.- Cluster, Cloud and Edge Computing.- TorqueDB: Distributed Querying of Time-series Data from Edge-local Storage.- Data-Centric Distributed Computing on Networks Mobile Devices.- WPSP: a multi-correlated weighted policy for VM selection and migration for Cloud computing.- Theory and Algorithms for Parallel and Distributed Processing.- LCP-Aware Parallel String Sorting.- Mobile RAM and Shape Formation by Programmable Particles.- Approximation Algorithm for Estimating Distances in Distributed Virtual Environments.- On the Power of Randomization in Distributed Algorithms in Dynamic Networks with Adaptive Adversaries.- 3D Coded SUMMA: Communication-Efficient and Robust Parallel Matrix Multiplication.- Parallel and Distributed Programming, Interfaces, and Languages.- Managing Failures in task-based parallel workflows in distributed computing environments.- Accelerating Nested Data Parallelism: Preserving Regularity.- Using Dynamic Broadcasts to improve Task-Based Runtime Performances.- A Compression-Based Design for Higher Throughput in a Lock-Free Hash Map.- Multicore and Manycore Parallelism.- NVPhTM: An Efficient Phase-Based Transactional System for Non-Volatile Memory.- Enhancing Resource Management through Prediction-based Policies.- Accelerating Overlapping Community Detection: Performance Tuning a Stochastic Gradient Markov Chain Monte Carlo Algorithm.- Parallel Numerical Methods andApplications.- A Prediction Framework for Fast Sparse Triangular Solves.- Multiprecision block-Jacobi for Iterative Triangular Solves.- Efficient Ephemeris Models for Spacecraft Trajectory Simulations on GPUs.- Parallel Finite Cell Method with Adaptive Geometric Multigrid.- Accelerator Computing.- cuDTW++: Ultra-Fast Dynamic Time Warping on CUDA-enabled GPUs.- Heterogeneous CPU+iGPU Processing for Efficient Epistasis Detection.- SYCL-Bench: A Versatile Single-Source Benchmark Suite for Heterogeneous Computing.