Page 1 |
Save page Remove page | Previous | 1 of 138 | Next |
|
small (250x250 max)
medium (500x500 max)
large ( > 500x500)
Full Resolution
All (PDF)
|
This page
All
Subset |
MAPPING SPARSE MATRIX SCIENTIFIC APPLICATIONS ONTO
FPGA-AUGMENTED RECONFIGURABLE SUPERCOMPUTERS
by
Gerald R. Morris
A Dissertation Presented to the
FACULTY OF THE GRADUATE SCHOOL
UNIVERSITY OF SOUTHERN CALIFORNIA
In Partial Fulfillment of the
Requirements for the Degree
DOCTOR OF PHILOSOPHY
(ELECTRICAL ENGINEERING)
December 2006
Copyright 2006 Gerald R. Morris
Object Description
| Title | Mapping sparse matrix scientific applications onto FPGA-augmented reconfigurable supercomputers |
| Author | Morris, Gerald Roger |
| Author email | grm@usc.edu |
| Degree | Doctor of Philosophy |
| Document type | Dissertation |
| Degree program | Electrical Engineering |
| School | Viterbi School of Engineering |
| Date defended/completed | 2006-10-13 |
| Date submitted | 2006 |
| Restricted until | Unrestricted |
| Date published | 2006-10-30 |
| Advisor (committee chair) | Prasanna, Viktor K. |
| Advisor (committee member) |
Nakano, Aiichiro Dubois, Michel |
| Abstract | The large capacity of field programmable gate arrays (FPGAs) has prompted researchers to map computational kernels onto FPGAs. In some instances, these kernels achieve significant speedups over their software-only counterparts running on general-purpose processors. The success of these efforts has spurred supercomputer companies to develop reconfigurable computers (RCs) that allow the FPGAs to become, in effect, application-specific coprocessors. In concert with the RCs are high-level language-to-hardware description language (HLL-to-HDL) compilers that facilitate development of FPGA-based kernels using HLL-based programming rather than HDL-based hardware design. In theory, these technologies allow end-users to create high-performance custom computing architectures. In practice, acceleration of floating-point scientific kernels is still problematic. Sequential vector reductions like accumulation are difficult because the pipelined floating-point units introduce loop carried dependences that prevent the hardware from being fully pipelined. This has a profound impact on fundamental scientific kernels such as matrix vector multiply. Without pipelining, the potential performance advantage of FPGA-based kernels is eliminated.; This dissertation develops algorithms and architectures for time and area efficient software and hardware implementation of scientific kernels on RCs. In particular, it deals with the problem of mapping IEEE Standard 754 double-precision floating-point sparse matrix computations onto FPGA-augmented RCs using an HLL-to-HDL compiler.; The major contributions of this research are firstly, a novel algorithm and architecture that facilitates HLL-based reduction of multiple, sequentially delivered floating-point vectors without pipeline stalls or buffer overflows, and secondly, the demonstration of how to speedup an important class of scientific applications, that is, sparse matrix solvers, by mapping them onto reconfigurable supercomputers. Optimized software version of two classic iterative solvers, the Jacobi method, and conjugate gradient are used as a baseline for comparison. Using heuristics and techniques presented in the dissertation, these two solvers are accelerated using FPGA-based kernels. To ensure a fair comparison, both versions of each solver are developed using the same software baseline, and both versions are run on the same platform using the same set of sparse linear equations. The FPGA-augmented versions have a measured speedup of over two on a current-generation RC and an estimated speedup of over six on a next-generation RC. |
| Keyword | reconfigurable computer; sparse matrix; Jacobi method; FPGA; conjugate gradient; vector reduction |
| Language | English |
| Part of collection | University of Southern California dissertations and theses |
| Publisher (of the original version) | University of Southern California |
| Place of publication (of the original version) | Los Angeles, California |
| Publisher (of the digital version) | University of Southern California. Libraries |
| Type | texts |
| Legacy record ID | usctheses-m116 |
| Rights | Morris, Gerald Roger |
| Repository name | Libraries, University of Southern California |
| Repository address | Los Angeles, California |
| Repository email | http://www.usc.edu/isd/libraries/services/ask_a_librarian/email/ |
| Filename | etd-Morris-20061030 |
| Archival file | uscthesesreloadpub_Volume35/etd-Morris-20061030.pdf |
Description
| Title | Page 1 |
| Full text | MAPPING SPARSE MATRIX SCIENTIFIC APPLICATIONS ONTO FPGA-AUGMENTED RECONFIGURABLE SUPERCOMPUTERS by Gerald R. Morris A Dissertation Presented to the FACULTY OF THE GRADUATE SCHOOL UNIVERSITY OF SOUTHERN CALIFORNIA In Partial Fulfillment of the Requirements for the Degree DOCTOR OF PHILOSOPHY (ELECTRICAL ENGINEERING) December 2006 Copyright 2006 Gerald R. Morris |
Comments
Post a Comment for Page 1

