lib A Coordinated Tiling and Batching Framework for Efficient GEMM on GPUs
lib A Pattern Based Algorithmic Autotuner for Graph Processing on GPUs
lib A Round-Efficient Distributed Betweenness Centrality Algorithm
lib Adaptive Sparse Matrix-Matrix Multiplication on the GPU
lib Checking Linearizability Using Hitting Families
lib Corrected trees for reliable group communication
lib Efficient Race Detection with Futures
lib Incremental Flattening for Nested Data Parallelism
lib Leveraging Hardware TM in Haskell
lib Lightweight Hardware Transactional Memory Profiling
lib Proactive Work Stealing for Futures
lib QTLS: high-performance TLS asynchronous offload framework with IntelĀ® QuickAssist technology
lib Semantics-Aware Scheduling Policies for Synchronization Determinism
lib SEP-Graph: Finding Shortest Execution Paths for Graph Processing under a Hybrid Framework on GPU
lib Stretching the capacity of Hardware Transactional Memory in IBM POWER architectures