Page 1 |
Save page Remove page | Previous | 1 of 120 | Next |
|
small (250x250 max)
medium (500x500 max)
Large (1000x1000 max)
Extra Large
large ( > 500x500)
Full Resolution
All (PDF)
|
This page
All
|
MISSING COVARIATES AND HIGH-DIMENSIONAL VARIABLE SELECTION IN ADDITIVE HAZARDS REGRESSION by Wei Lin A Dissertation Presented to the FACULTY OF THE USC GRADUATE SCHOOL UNIVERSITY OF SOUTHERN CALIFORNIA In Partial Fulfillment of the Requirements for the Degree DOCTOR OF PHILOSOPHY (APPLIED MATHEMATICS) August 2011 Copyright 2011 Wei Lin
Object Description
Title | Missing covariates and high-dimensional variable selection in additive hazards regression |
Author | Lin, Wei |
Author email | linwei@usc.edu;weiwaynelin@gmail.com |
Degree | Doctor of Philosophy |
Document type | Dissertation |
Degree program | Applied Mathematics |
School | College of Letters, Arts And Sciences |
Date defended/completed | 2011-05-31 |
Date submitted | 2011-08-01 |
Date approved | 2011-08-02 |
Restricted until | 2011-08-02 |
Date published | 2011-08-02 |
Advisor (committee chair) | Goldstein, Larry |
Advisor (committee member) |
Alexander, Kenneth S. Langholz, Bryan Lv, Jinchi Sun, Fengzhu Z. |
Abstract | This dissertation addresses two challenging problems arising in inference with censored failure time data. The additive hazards model provides a unified framework for these problems to be discussed, not only because it is a useful alternative to the well-known Cox model and has significant practical implications, but also because its simple yet elegant structure allows one to explore some fundamental aspects of these problems. ❧ In the first part of this dissertation, we consider the estimation problem in additive hazards regression with missing covariates. We are interested in both the case where the observation probabilities are known and the case where they are unknown but can be parametrically modeled and estimated. By modifying the pseudoscore function with full data, we introduce some weighted estimators for the regression coefficients and the cumulative baseline hazard function. The proposed estimators are then shown to be consistent and asymptotically normal under mild conditions, with asymptotic variances that can be easily estimated. Our theoretical results and simulation studies indicate that using estimated weights in the simple weighted estimators may yield important efficiency gain and that the augmented weighted estimators are even more efficient. The proposed methods are further illustrated by a mouse leukemia data example. ❧ In the second part, we turn to the variable selection problem in the additive hazards model. Motivated by linking high-throughput genomic data to survival outcomes, we are particularly interested in the high-dimensional setting where the dimension of covariates may grow fast, possibly nonpolynomially, with the sample size. We propose to perform variable selection and estimation simultaneously by using a class of regularized estimators with a general family of concave penalties, including several popular choices such as the lasso, SCAD, MCP, and SICA. In a nonasymptotic framework where the model dimensions are allowed to vary freely, we rigorously investigate the weak oracle properties and oracle properties of the proposed estimators. Our theoretical results are essentially different from those in the existing literature, and provide new insight into the model selection properties of regularized estimators for survival models. We illustrate the proposed method by simulation studies and application to a diffuse large B-cell lymphoma data set. ❧ A common theme underlying the theoretical development in this dissertation is the use of modern empirical process theory. Indeed, we rely on the language of empirical process theory to establish our theoretical results for both problems considered here, and they serve as excellent examples for demonstration of the power and elegance of this mathematical tool, especially in the context of survival analysis. |
Keyword | survival analysis; missing covariates; variable selection; semiparametric inference; empirical processes |
Language | English |
Part of collection | University of Southern California dissertations and theses |
Publisher (of the original version) | University of Southern California |
Place of publication (of the original version) | Los Angeles, California |
Publisher (of the digital version) | University of Southern California. Libraries |
Provenance | Electronically uploaded by the author |
Type | texts |
Legacy record ID | usctheses-m |
Contributing entity | University of Southern California |
Rights | Lin, Wei |
Physical access | The author retains rights to his/her dissertation, thesis or other graduate work according to U.S. copyright law. Electronic access is being provided by the USC Libraries in agreement with the author, as the original true and official version of the work, but does not grant the reader permission to use the work if the desired use is covered by copyright. It is the author, as rights holder, who must provide use permission if such use is covered by copyright. The original signature page accompanying the original submission of the work to the USC Libraries is retained by the USC Libraries and a copy of it may be obtained by authorized requesters contacting the repository e-mail address given. |
Repository name | University of Southern California Digital Library |
Repository address | USC Digital Library, University of Southern California, University Park Campus MC 7002, 106 University Village, Los Angeles, California 90089-7002, USA |
Repository email | cisadmin@lib.usc.edu |
Archival file | uscthesesreloadpub_Volume71/etd-LinWei-231.pdf |
Description
Title | Page 1 |
Contributing entity | University of Southern California |
Repository email | cisadmin@lib.usc.edu |
Full text | MISSING COVARIATES AND HIGH-DIMENSIONAL VARIABLE SELECTION IN ADDITIVE HAZARDS REGRESSION by Wei Lin A Dissertation Presented to the FACULTY OF THE USC GRADUATE SCHOOL UNIVERSITY OF SOUTHERN CALIFORNIA In Partial Fulfillment of the Requirements for the Degree DOCTOR OF PHILOSOPHY (APPLIED MATHEMATICS) August 2011 Copyright 2011 Wei Lin |