Department of Statistics & OR College of Science - King Saud University STAT- 627: Generalized Linear Models Final Examination - First Semester 1427/1428 -------------------------------------------------------------------------------------------------------Time allowed is 3 hours. SECTION (A): Attempt one question. Question A-1: (i) Suppose that f() is a function of p variables, where =(1, …, p)t. Show that the value of optimizing f() can be numerically obtained by the iterative formula: 1 θ m θ m1 H (θ m1 ) f (θ m1 ) ; m 1, 2, , where H (θ m 1 ) and f (θ m 1 ) are the Hessian matrix and the gradient vector of f(), respectively, evaluated at θ m 1 , and θ 0 is an initial value of . (ii) Suppose that U 1 and U 2 are two independent random variables such that U 1 ~ 2 (n1 , 1 ) and U 2 ~ 2 (n2 , 2 ) . Show that U1 U 2 ~ 2 (n1 n2 , 1 2 ) . Question A-2: Consider the general linear model Y=X+ where ~Nn(0,2In) ), is a vector of p unknown parameters, and X is an np matrix of full rank; i.e., r(X)=p (p<n). Suppose that βˆ X Y is the least squares estimate of , X is the generalized inverse of X, ˆ is the vector of residuals. and r Y Y 1. Show that Cov(r ) , the variance-covariance matrix of r , is non-negative definite. 2. Show that Cov(r ) is not positive definite. 3. Let n i 1 i r be the sum of the residuals. Show that n i 1 i r 0 if and only if j C (X) , where j (1,1, ,1)' and C (X) is the column space of X (i.e., C (X) = the vector space spanned (or generated) by the columns of X ). SECTION (B): Attempt 3 questions. Question B-1: Suppose that Y1, Y2, …, YN are independent random variables satisfying the properties of a generalized linear model with: 1. Yi ~ f ( yi ; i ) e y i b( i ) c( i ) d ( y i ) (i=1, 2, …, N). 2. E(Yi) = i , Var(Yi)= i2 (i=1, 2, …, N). 3. g(i)= xit = i (i=1, 2, …, N); g is monotone and differentiable. l (β) Let U j be the score statistic corresponding to j (j=1, …, p), where l() is j the log likelihood function. N (Y ) (i) Show that Uj = i 2 i xij i . i i i 1 N x x i 2 ij ik ( ) . (ii) Show that jk Cov(U j ,U k ) i i2 i 1 Question B-2: Let l() be the log likelihood function where =(1,…,p)t is the vector of parameters -1- STAT- 627 Final Examination - First Semester 1427/1428 ------------------------------------------------------------------------------------------------------- l (β) of interest. Let U(β) be the vector of score statistics. Suppose that U(b) is β the vector of scores evaluated at β b , and (b) Cov(U(β)) evaluated at β b . 1 (i) Show that l (β) l (b) (β b) t U(b) (β b) t (b) (β b) . 2 (ii) Show that U(β) U(b) (b) (β b) . (iii) Show that W (β b) t (b) (β b) ~ 2 ( p) asymptotically. Hint: You may use that fact that U(b) ~ N p (0, (b)) asymptotically. Question B-3: Suppose that Y1, Y2, …, YN are independent random variables satisfying the properties of a generalized linear model where g(i)= xit (i=1, 2, …, N) and E(Yi)= i. Derive the asymptotic sampling distribution of the deviance D 2 ln( ) under H O : the model of interest is adequate, where L(b m ) / L(b) , b m is the MLE of β m for the saturated model, b is the MLE of β for the model of interest, and L is the likelihood function. Suppose that m is the maximum number of parameters that can be estimated by the saturated model and p is the number of parameters of the model of interest ( p m) . Hint: You may use the fact that (β b) t (b) (β b) ~ 2 ( p) asymptotically. Question B-4: Suppose that Y1 , Y2 , , YN are independent random variables such that Yi~Binomial(ni,i), i=1, 2, …, N. Consider modeling the probabilities i by the logistic model: logit ( i ) 1 2 xi ; i 1, 2,, N where x i is the i-th covariate pattern of the explanatory variables and is the vector of parameters. Suppose that ̂ i is the MLE of i based on this model and ŷi is the fitted value of yi . (i) Show that the deviance of this model can be given by: N y n yi D 2[ yi ln( i ) (ni yi ) ln( i )] yˆ i ni yˆ i i 1 (ii) Show that the deviance, D , is asymptotically equivalent to N ( y ni ˆ i ) 2 . Xˆ 2 i i 1 ni ˆ i (1 ˆ i ) (iii) Suppose that the MLEs of 1 and 2 based on the data given in the table below are b1 2.9 and b2 0.19 . i ni y i x i 1 10 2 5 2 10 3 15 3 10 8 20 4 10 9 25 (a) Calculate fitted values ŷi ’s. (b) Calculate the value of the deviance, D . (c) Is the model, logit ( i ) 1 2 xi , adequate to described the data? Use =0.1. -2- STAT- 627 Final Examination - First Semester 1427/1428 -------------------------------------------------------------------------------------------------------SECTION (C): Attempt 2 questions. Question C-1: Suppose that Y1 , Y2 , , Y10 are independent random variables such that Yi~Binomial(ni,i), i=1, 2, …, 10. Consider modeling the probabilities i by the following 3 logistic models: Model (1): logit ( i ) 0 1 xi 2 xi2 Model (2): logit ( i ) 0 1 xi Model (3): logit ( i ) 0 Suppose that there are only 5 different values (patterns) of the explanatory variable X. Based on the following information: Model maximum of log-likelihood function Saturated 30.55 1 31.95 2 32.05 3 104.55 (i) Is Model (1) adequate to describe the data? Use =0.05. (ii) Test H O : 2 0 against H1 : 2 0 . Use =0.05. Test H O : 1 2 0 against H1 : 1 0 or 2 0 . Use =0.05. Does the explanatory variable, X, has an effect on the proportions (response)? Use =0.05. (v) Is Model (2) adequate to describe the data? Use =0.05. (vi) Which model do you prefer among the three models? Justify your answer using =0.05. Question C-2: Suppose that Y1 , Y2 , , Y10 are independent random variables such that Yi~Binomial(ni,i), i=1, 2, …, 10. Consider modeling the probabilities i by the following 4 logistic models: Model (1): logit ( i ) 0 1 xi 2 zi 3 xi zi Model (2): logit ( i ) 0 1 xi 2 zi Model (3): logit ( i ) 0 1 xi Model (4): logit ( i ) 0 Suppose that all patterns of the explanatory variables are different and both of the explanatory variables X and Z are continuous. Based on the following information: Model maximum of log-likelihood function Saturated 47.023 1 48.723 2 249.078 3 265.035 4 270.311 (i) Is Model (1) adequate to describe the data? Use =0.05. (ii) Test H O : there is no interaction between X and Z against H1 : X and Z interact. Use =0.05. (iii) Does the explanatory variable Z have an effect on the proportions (iii) (iv) -3- STAT- 627 Final Examination - First Semester 1427/1428 -------------------------------------------------------------------------------------------------------(response)? Use =0.05. (iv) Test H O : both explanatory variables have no effect on the response against H1 : at least one explanatory variable affects the response. Use =0.05. (v) Which model do you prefer among the four models? Justify your answer using =0.05. Question C-3: Suppose that we are interested in studying the effect of the gender (j=1 for male, j=2 for female) on the employment opportunity within a year after graduation for the graduates of the college of science. Another explanatory variable that might affect the proportions (response) of graduates who employed during the first year of graduation is the grand point average (X) with 7 different levels ( x1 , x2 , x3 , x4 , x5 , x6 , x7 ). Define the following: n jk = no. of graduates with gender j and GPA x k . y jk = no. of graduates employed during the first year of graduation among the n jk graduates. Suppose that y jk ’s are independent and y jk ~ Binomial (n jk , jk ) . Consider modeling the probabilities jk (j=1, 2; and k=1, 2, …, 7) by the following 4 logistic models: Model (1): logit ( jk ) j j xk Model (2): logit ( jk ) j xk Model (3): logit ( jk ) xk Model (4): logit ( jk ) Based on the following information: Model Deviance 1 4.05 2 5.15 3 7.10 4 40.50 (i) Determine the maximum number of parameters that can be estimated by the saturated model? (ii) Is Model (1) adequate to describe the data? Use =0.05. (iii) Test H O : 1 2 against H1 : 1 2 . Use =0.05. (iv) Test H O : 1 2 and 1 2 against H1 : 1 2 or 1 2 . Use =0.05. (v) Does the gender affect the response (proportions of graduates employed during the first year of graduation). Use =0.05. (vi) Which model do you prefer among the four models? Justify your answer using =0.05. -4-
© Copyright 2026 Paperzz