Homework # 5     -     due Wednesday, 5th April 2006 at 4.15pm

Problems students are to do -
    - UG students - please do problems 1 and 2 below
    - G students -  please do problems 1 - 4 below

As always, list necessary assumptions, and include respective p-values in parentheses next to
your conclusion - e.g., one might conclude, "the data suggested a difference in the treatments
(p = 0.0013)."

1.  Huet, Bouvier, et al (Statistical Tools for Nonlinear Regression, p.2) use the Pasture Regrowth
    data from Ratkowsky (Nonlinear Regression, p.88) to fit a four-parameter sigmoidal growth
    model.  In the dataset, Y = pasture regrowth (since last grazing), and X = time, and for our present
    purposes, we can assume that the data are independent measurements.  The model function that
    these authors used to fit the data is rather complicated, and coming up with starting values for the
    model parameters is not easy, and can only come after we understand the roles they play.
    (a) Look at this SAS Program/Output and write down the assumed model function and discuss the roles
      of the model parameters which play a role in determining the upper and lower asymptotes of the
      model function; assume that q4 is positive. Next, look only at the plot to obtain a guess of
      these asymptotes, and discuss how to get good starting values for q1 and q2. Next, and most
      challenging, get a starting value for q3 and q4 by showing how the fitted linear regression is
      related to the original model function.
    (b) Look at the NLIN output and comment on the estimated upper and lower asymptotes (using
       the parameter estimates). Also, based on the output, do a (two-tailed) Wald test that q4 = 3 using
      a = 1%.   Redo this (Wald) test using a = 5%.  Clearly report your conclusions in each case.
    (c) Repeat your tests in part (b) using LR tests.
    (d) Examine the residuals, and comment on your findings. If the NLIN were to be rerun with
      the third point (x = 21) removed, would the estimate of the lower asymptote increase or decrease?
      Why?

2.  In Nonlinear Regression Analysis and its Applications (1988, p.269), Bates and Watts report data
    from Treloar (1974) regarding the "velocity" of an enzymatic reaction.  The number of counts per minute
    of radioactive product from the reaction was measured as a function of substrate concentration (ppm),
    and from these counts the initial rate, or "velocity," of the reaction was calculated (counts/min2).  The
    experiment was conducted once with the enzyme treated with Puromycin (treated = "yes") and once
    with the enzyme untreated (treated = "no").  The velocity is assumed to depend on the substrate
    concentration according to the Michaelis-Menton (MM2) equation.  It has been hypothesized that
    the "ultimate velocity parameter" (q1) should be affected by introduction of the Puromycin,
    but not necessarily the "half-velocity parameter" (q2). This SAS program/output may help us to
    answer these queries.
    (a) Give estimates for the MM2 model parameters for both the treated and untreated curves; display
       these in a 2x2 box.
    (b) Using Wald hypothesis tests, test the relevant hypotheses, reporting test statistics, p-values and
        conclusions.  Approximate p-values as best you can here and in the next part.
    (c) Using a full-and-reduced (likelihood-based) F-test, test whether the half-velocity parameters
        are the same, reporting the test statistic, p-value and conclusion.

3.  In "Calibration and assay development using the four-parameter logistic curve" (Chem. Intell.
    Lab. Systems, 1993, p.97), O'Connell et al fit the LL4 (four-parameter log-logistic) model
    function to radioimmunoassay (RIA) data. Click here for the data and analysis in SAS. This
    program first uses PROC NLIN and then three PROC NLMIXEDs.
    (a) Comment on the necessary assumptions and the roles of the parameter for the model
      fit in the NLIN. Look at the residual plot and comment on whether all necessary
      assumptions (of the NLIN) appear to be met.  The first PROC NLMIXED just fits the
      same homoskedastic (assumed constant variance) model.
    (b) Explain what is being done in the second PROC NLMIXED in terms of the new "model"
      and the roles of the parameters, and note the value of -2LL.  Perform a likelihood-based test of
      whether the extra parameter (r) in the variance is need, writing out your hypotheses, test
      statistic, p-value,  and conclusion.
    (c) It turns out that the third PROC NLMIXED involves another, more appropriate, way of
      modelling the variance.  Comparing this latter NLMIXED with the first NLMIXED, perform
      a likelihood-based hypothesis test testing for homoskedasticity, again writing out your hypotheses,
      test statistic, p-value,  and conclusion.  Finally, compare the parameter estimates and SEs for
      this latter (third) and the first NLMIXED - what has changed?

4. Examine this SAS program/output and determine what model is being fit. Write down the
    model function. What is the relevance of the parameter named PHI?  Write down it's estimate, do
    a (WALD) t-test that the true value of PHI is equal to -0.40 (using a = 5%), and write down the 95 %
    Wald Confidence Interval (WCI) for this parameter. Next, do the likelihood test of whether PHI is
    equal to -0.40 (using a = 5%).  It turns out that the true 95% confidence interval for PHI here is very
    different from the WCI; in your opinion is the true CI shifted to the right or to the left of the WCI? Why?