date: Tue Sep 20 16:15:00 2005
from: Tim Osborn <t.osborn@uea.ac.uk>
subject: Re: optimal fingerprinting
to: Gerard van der Schrier <schrier@knmi.nl>

   At 08:17 20/09/2005, you wrote:

     The correlation between 'best-guess' amplitude and 'observed' is ca. 0.7, not too bad
     and obviously not produced by any rescale. Your observation that there was some
     overshoot in the best-guess was very valid: the results in the little report have been
     rescaled.

   yes, I see now the 0.26 scaling factor listed in the conclusions.

     In the optimal-fingerprinting algorithm of Allen and Tett is a scaling argument which I
     don't completely understand. Also, I use a NAG-routine to compute the prinicpal
     components, which applies a scaling. A quick calculation shows that if I made a mistake
     here, it would produce results which may not be needing this ad-hoc rescaling.

   There are a few different "conventions" regarding the scaling of EOFs/PCs.  The one that I
   prefer is to scaling the EOF patterns so that they are unit vectors, and applying the
   opposite scaling to the PCs so that their variance is equal to the eigenvalue (I think).
   But other conventions are to scale the EOFs so that their "length" is equal to
   sqrt(eigenvalue) and the variance of the PC becomes equal to the eigenvalue-squared, or to
   scale EOFs so that their "length" is equal to 1/sqrt(eigenvalue) and the PCs have unit
   variance.  I don't know which of these options the NAG-routines apply, nor which the
   Allen/Tett algorithm requires.  I can ask Nathan if you first check what the NAG-routines
   do (if it isn't documented, then just calculate the "length" of each EOF vector and see if
   it is 1, sqrt(e), 1/sqrt(e) or something else!).

     I'm now a little more excited about this fingerprinting. Do you think it would make a
     nice RAPID paper if we applied the fingerprinting tehnique on actual SSH measurements?
     The TOPEX/Poseidon data are available and they show a strong trend over the 90s. In the
     RAPID annual meeting, we've seen 3 estimates of the decrease in THC (or MHT) strength
     over this period. Ours would be a fourth complementary way, and the first time optimal
     fingerprinting is applied in an oceanographic context.
     I realize that this would mean a further alienation of the original idea to couple
     *proxy* data to ocean circulation.

   Despite this difference to the original idea, such a paper would be worthwhile.  But the
   biggest problem is likely to be distinguishing the MHT-trend from the GHG-warming-trend,
   both of which will influence SSH.  The GHG-warming signal in SSH is not well known, being
   very different between models and also already incorporating a combination of GHG-warming
   plus MHT-weakening in some models.  Perhaps GHG-warming without any ocean circulation
   response would produce a more uniform pattern of SSH increase?  In which case, using the
   deviations in SSH from the spatial-mean increase might help?  Also, reviewers might
   complain that we only used HadCM3 to estimate the SSH signal pattern - they might ask
   whether other models would yield very different patterns?  That might be avoided by calling
   this a first attempt, allowing multi-model comparisons to be left until later work (by us
   or others)?

     I realize that. I've downloaded the DAI-precipitation, so a quick comparison should not
     be too difficult.

   Yes - a quick comparison to see if differences in precipitation data explain the
   differences in recent PDSI trend should be sufficient.

     I have done that already. The results are not very spectacular. I did the trick with
     replacing actual temperatures for the climatological temperatures in a paper on the
     ALP-IMP data.  A huge impact of higher surface temperatures on the areal extent of
     drought was found. For the US, no such thing happens. Phil recently send me an email,
     predicting this result! He also wrote that this result would indicate that there is
     nothing wrong with the CRU-temperatures. I don't quite understand this remark, so I will
     have to get back on that.

   I don't understand Phil's remark either.  But the result itself could be mentioned in the
   paper, because it is interesting to know that the temperature changes aren't causing much
   trend in PDSI.

     Ken Kunkel replied to my email (I'll forward it.) His datasets are available and seem to
     be well-documented. Do we really want to get into the trouble of making a second scPDSI
     dataset with his data? After all, we focus on (sc)PDSI, rather than precipitation. I
     guess your first reaction was to avoid it, if possible.

   Hmm.  I still want to avoid much extra work.  These are all daily data, so would need to be
   made into monthly totals.  Then you would need to locate which 0.5deg boxes each station
   was in, take the 0.5deg monthly temperatures and the Kunkel station precipitation together
   to compute PDSI and compare that with the 0.5deg box PDSI that you already computed.
   Sounds like a lot of work to do for all 0.5deg boxes with Kunkel stations in them.  Should
   I ask Keith - he's back from his holiday tomorrow (Wednesday).
   Cheers
   Tim