Document Actions

Publications

by John Mellor-Crummey last modified 2008-04-15 08:53
  • Agarwal, S, Barik, R, Bonachea, D, Sarkar, V, Shyamasundar, R, and Yelick, K (2007). Deadlock-Free Scheduling of X10 Computations with Bounded Resources In: Symposium on Parallel Algorithms and Architecture (SPAA), pp. 229–240, San Diego, California, ACM. [PDF]
  • Arnold, DC, Ahn, DH, Supinski, BR, Lee, G, Miller, BP, and Schulz, M (2007). Stack Trace Analysis for Large Scale Debugging In: Proceedings of the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 07), Long Beach, California, IEEE.
  • Bordelon, A (2007). Developing a Scalable, Extensible Parallel Performance Analysis Toolkit Master thesis, Rice University, Department of Computer Science.
  • Budlimic, Z, Zhang, R, and Scherer, W (2007). Runtime Tuning of STM Validation Techniques In: ACM Symposium on the Principles and Practice of Parallel Programming, ACM. Buttari, A, Dongarra, J, Husbands, P, Kurzak, J, and Yelick, K (2007). Multithreading for Synchronization Tolerance in Matrix Factorization In: Proceedings of the SciDAC 2007 Conference, Boston, Massachusetts, Journal of Physics: Conference Series. Buttari, A, Langou, J, Kurzak, J, and Dongarra, J (2007). A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures Parallel Computing. Buttari, A, Langou, J, Kurzak, J, and Dongarra, J (2007). Parallel Tiled QR Factorization for Multicore Architectures Concurrency and Computation: Practice and Experience. Chen, W (2007). Optimizing Partitioned Global Address Space Programs for Cluster Architectures PhD thesis, University of California-Berkeley, Computer Science Division. Chen, W, Bonachea, D, Iancu, C, and Yelick, K (2007). Automatic Nonblocking Communication for Partitioned Global Address Space Programs In: Proceedings of the International Conference on Supercomputing (ICS), pp. 158–167, Seattle, Washington, ACM. Coarfa, C, Mellor-Crummey, J, Froyd, N, and Dotsenko, Y (2007). Scalability Analysis of SPMD Codes Using Expectations In: Proceedings of the International Conference on Supercomputing, pp. 13–22, Seattle, Washington, ACM. Husbands, P and Yelick, K (2007). Multithreading and One-Sided Communication in Parallel LU Factorization In: Proceedings of Supercomputing (SC07), Reno, Nevada, ACM. Kamil, A and Yelick, K (2007). Hierarchical Pointer Analysis for Distributed Programs In: Static Analysis Symposium (SAS), pp. 281–297, Kongens Lyngby, Denmark, Springer Berlin / Heidelberg. Kurzak, J and Dongarra, J (2007). Implementation of Mixed Precision in Solving Systems of Linear Equations on the Cell Processor Concurrency and Computation: Practice and Experience., 19(10):1371–1385. Kurzak, J, Buttari, A, and Dongarra, J (2007). Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization IEEE Transactions on Parallel and Distributed Systems. Lee, GL, Ahn, DH, Arnold, DC, Supinski, BR, Miller, BP, and Schulz, M (2007). Benchmarking the Stack Trace Analysis Tool for BlueGene/L In: Parallel Computing 2007 (Parco): Minisymposium on Scalability and Usability of HPC Programming Tools, Parco2007. Lusk, E and Yelick, K (2007). Languages for High-Productivity Computing: The DARPA HPCS Language Project Parallel Processing Letters, 17(1):89–102. Marin, G and Mellor-Crummey, J (2007). Application Insight Through Performance Modeling In: Proceedings of the 26th IEEE International Performance, Computing, and Communications Conference, pp. 65–74, New Orleans, Louisiana, IEEE. Marin, G and Mellor-Crummey, J (2007). Understanding Unfulfilled Memory Reuse Potential in Scientific Applications Rice University, technical report(TR07-6). Mellor-Crummey, J (2007). Harnessing the Power of Emerging Petascale Platforms In: Proceedings of the SciDAC Conference 2007, Journal of Physics: Conference Series 78. Mellor-Crummey, J, Beckman, P, Cooper, K, Dongarra, J, Gropp, W, Lusk, E, Miller, B, and Yelick, K (2007). Creating Software Tools and Libraries for Leadership Computing Webpublished. Mellor-Crummey, J, Beckman, P, Dongarra, J, Miller, B, and Yelick, K (2007). Software Technology for Leadership-Class Computing SciDAC Review:36–45. Qasem, A (2007). Automatic Tuning of Scientific Applications PhD thesis, Rice University, Department of Computer Science. Su, J and Yelick, K (2007). Automatic Performance Debugging in Partitioned Global Address Space Programs In: 20th International Workshop on Languages and Compilers for Parallel Computing (LCPC), Urbana, Illinois, Springer Lecture Notes in Computer Science. Wen, T, Su, J, Colella, P, Yelick, K, and Keen, N (2007). An Adaptive Mesh Refinement Benchmark for Modern Parallel Programming Languages In: Proceedings of Supercomputing (SC07), Reno, Nevada, ACM. Williams, S, Oliker, L, Vuduc, R, Demmel, J, and Yelick, K (2007). Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore Programs In: Proceedings of Supercomputing (SC07), Reno, Nevada, ACM.
« April 2018 »
Su Mo Tu We Th Fr Sa
1234567
891011121314
15161718192021
22232425262728
2930
 

Powered by Plone

CScADS Collaborators include:

Rice University ANL UCB UTK WISC