Slide 1

Slide 1 text

1 FUNCTION-ON-SCALAR MODELS Jeff Goldsmith, Ph D Department of Biostatistics

Slide 2

Slide 2 text

2 Motivation J. A. Schrack, V. Zipunnikov, J. Goldsmith, J. Bai, E. M. Simonsick, C. M. Crainiceanu, L. Ferrucci (2014). Assessing the “Physical Cliff”: Detailed Quantification of Aging and Physical Activity. Journal of Gerontology: Medical Sciences, 69 973-979.

Slide 3

Slide 3 text

3 • Association ? • Confounding ? • Significance? Questions to address

Slide 4

Slide 4 text

4 • Activity may depend on covariate s • Associations may be different at different times of da y • Shift from functions as predictors to functions as outcome s • “Function-on-scalar” regression Functions as outcomes

Slide 5

Slide 5 text

5 • The function-on-scalar model that is analogous to simple linear regression i s • Functional response y • Scalar predictor x • Functional covariate is of interes t • Linear mode l –Most common approach Simple linear regression

Slide 6

Slide 6 text

6 • The MLR equivalent i s • Functional response y • Scalar predictor x • Functional covariates are of interes t • Linear mode l – Most common approac h Linear FoSR

Slide 7

Slide 7 text

7 • The functional coefficients are usually expanded in terms of a basis : • Several basis options are possibl e –FP C –Splines (my preference ) –Wavelet s –Fourier Basis expansion

Slide 8

Slide 8 text

8 • For response data on a common finite grid, the model can be expresse d • Y is the matrix of row-stacked response s • X is the usual design matri x • is the matrix of basis functions evaluated over the common gri d • B is the matrix of basis coefficient s • E is the matrix of row-stacked errors Basis expansion

Slide 9

Slide 9 text

9 • By vectorizing the response and the linear predictor, we obtain the equivalent model formulatio n • vec() concatenates the columns of the matrix argumen t • is the kronecker produc t • This reformulates function-on-scalar regression as a usual least-squares proble m • Goal is to estimate the columns of B or, equivalently, the elements of vec(B) Recast model

Slide 10

Slide 10 text

10 • Errors are correlated within a subject, in a way that’s comple x • Three possible approaches : – Ignore this issu e – Use GLS in place of OLS by “pre-whitening” the left and right side of the matrix formulation of the model : •I.e. define where is the error covariance matrix, and similarly modify the RHS – Jointly model the coefficient vector and the residual covarianc e •Easiest in a Bayesian setting Correlated errors

Slide 11

Slide 11 text

11 • The preceding does not include smoothness constraints on estimated coefficient s • Such constraints often take the form of a penalt y • Can be expressed in terms of a penalty on the basis coefficient s • Alternatively, one can be careful about the dimension of the basis Smoothness constraints

Slide 12

Slide 12 text

12 • “Global” test: under the null, there is no association between the predictor and outcome at any tim e • “Local” test: under the null, there is no association between the predictor and outcome at time t • Both can be implemented by focusing on basis coefficient s • The former avoids some multiple comparisons issue s • Analogous to F and t tests in MLR Testing

Slide 13

Slide 13 text

13 • Integrating over t for each subject gives the average activity for a subjec t • Can similarly average over coefficients in FoSR to approximate estimates in ML R • Will differ slightly from an MLR fit to average activity count s • Illustrates the connection (and difference) between approaches Connection to MLR

Slide 14

Slide 14 text

14 Switch to code

Slide 15

Slide 15 text

15 Code

Slide 16

Slide 16 text

16 Code

Slide 17

Slide 17 text

17 Code

Slide 18

Slide 18 text

18 Code

Slide 19

Slide 19 text

19 Code

Slide 20

Slide 20 text

20 Code

Slide 21

Slide 21 text

21 Code

Slide 22

Slide 22 text

22 Code

Slide 23

Slide 23 text

23 Code