moyhu: Global anomaly spatial sampling error

Friday, January 27, 2017

Global anomaly spatial sampling error - and why use anomalies?

In this post I want to bring together two things that I seem to be talking a lot about, especially in the wake of our run of record high temperatures. They are

What is the main component of the error that is quoted on global anomaly average for some period (month, year)? and
Why use anomalies? (an old perennial, see also GISS, NOAA)

I'll use the USHCN V2.5 dataset as a worked example, since I'm planning to write a bit more about some recent misuse of that. In particular I'll use the adjusted USHCN for 2011.

Using anomalies

I have been finding it necessary to go over some essentials of using anomalies. The basic arithmetic is

Compute some "normal" (usually a 30-year period time average for each month) for each station in the network,
Form local anomalies by subtracting the relevant normal from each reading
Average the anomalies (usually area-weighted)

People tend to think that you get the aanomaly average just by averaging, then subtracting an offset. That is quite wrong; they must be formed before averaging. Afterward you can shift to a different anomaly base by offsetting the mean.

Coverage error - spatial sampling error for the mean.

Indices like GISS and HADCRUT usually quote a monthly or annual mean with an uncertainty of up to 0.1°C. In recent years contrarians have seized on this to say that maybe it isn't a record at all - a "statistical tie" is a pet phrase, for those whose head hurts thinking about statistics. But what very few people understand is what that uncertainty means. I'll quote here from something I wrote at WUWT:

The way to think about stated uncertainties is that they represent the range of results that could have been obtained if things had been done differently. And so the question is, which "things". This concept is made explicit in the HADCRUT ensemble approach, where they do 100 repeated runs, looking at each stage in which an estimated number is used, and choosing other estimates from a distribution. Then the actual spread of results gives the uncertainty. Brohan et al 2006 lists some of the things that are varied.

The underlying concept is sampling error. Suppose you conduct a poll, asking 1000 people if they will vote for A or B. You find 52% for A. The uncertainty comes from, what if you had asked different people? For temperature, I'll list three sources of error important in various ways:

1. Measurement error. This is what many people think uncertainties refer to, but it usually isn't. measurement errors become insignificant because of the huge number of data that is averaged. measurement error estimates what could happen if you had used different observers or instruments to make the same observation, same time, same place.

2. Location uncertainty. Ths is dominant for global annual and monthly averages.You measured in sampled locations - what if the sample changed? You measured in different places around the earth? Same time, different places.

3. Trend uncertainty, what we are talking about above. You get trend from a statistical model, in which the residuals are assumed to come from a random distribution, representing unpredictable aspects (weather). The trend uncertainty is calculated on the basis of, what if you sampled differently from that distribution? Had different weather? This is important for deciding if your trend is something that might happen again in the future. If it is a rare event, maybe. But it is not a test of whether it really happened. We know how the weather turned out.

So here I'm talking about location uncertainty. What if you had sampled in different places. And in this exercise I'll do just that. I'll choose subsets of 500 of the USHCN and see what answers we see. That is why USHCN is chosen - there is surplus information from the dense coverage.

Why use anomaly?

We'll see. What I want to show is that it dramatically reduces location sampling error. The reason is that the anomaly set is much more homogeneous, since the expected value everywhere is more or less zero. So there is less variation in switching stations in and out. So I'll measure the error with and without anomaly formation.

USHCN example

So I'll look at the data for the 1218 stations in 2010, with an anomaly relative to the 1981-2010 average. In a Monte Carlo style, I make 1000 choices of 500 random stations, and find the average for 2011, first by just averaging station temperatures, and then the anomalies. The results (in °C) are:

Base 1981-2010, unweighted ..	Mean of means ..	s.d. of means
Temperatures	11.863	0.201
Anomalies	0.191	0.025

So the spatial error is reduced by a factor of 8, to an acceptable value. The error of temperature alone, at 0.201, was quite unacceptable. But anomalies perform even better with area-weighting, which should always be used. Here I calculate state averages and then area-weight the states (as USHCN used to do):

Update: I had implemented the area-weighting incorrectly when I posted about an hour ago. Now I think it is right, and the sd's are further reduced, although now the absolute improves by slightly more than the anomalies.

Base 1981-2010, area-weighted ..	Mean of means ..	s.d. of means
Temperatures	12.102	0.137
Anomalies	0.101	0.016

For both absolute T and anomalies, the mean has gone up, but the SD has reduced. In fact T improves by a slightly greater factor, but is still rather too high. The anomaly sd is now very good.

Does the anomaly base matter? A little, which is why WMO recommends the latest 3 decade period. I'll repeat the last table with the 1951-90 base:

Base 1951-80, area-weighted ..	Mean of means ..	s.d. of means
Temperatures	12.103	0.138
Anomalies	0.620	0.021

The T average is little changed, as expected. The small change reflects the fact that sampling 1000 makes the results almost independent of that random choice. But the anomaly mean is higher, reflecting warming. And the sd is a little higher, showing that subtracting a slightly worse estimate of the 2011 value (the older base) makes a less homogeneous set.

So what to make of spatial sampling error?

It is significant (with 500 station subsets) for anomaly, and the reason why large datasets are sought. In terms of record hot years, I think there is a case for omitting it. It is the error if between 2015 and 2016 the set of stations had been changed, and that happened only to a very small extent. I don't think the theoretical possibility of juggling the station set between years is an appropriate consideration for such a record.

Conclusion

Spatial sampling, or coverage error for anomalies is significant for ConUS. Reducing this error is why a lot of stations are used. It would be an order of magnitude greater without the use of anomalies, because of the much greater inhomogeneity, which is why one should never average raw temperatures spatially.

29 comments:

Bryan - oz4casterJanuary 28, 2017 at 9:41 AM
Nick, an interesting exercise. It definitely reaffirms the appropriateness of using temperature anomalies to construct regional or global surface temperature trends. I agree that spatial sampling is one of the larger error (uncertainty) sources for estimating these trends. However, as I'm sure you know, there are a lot of additional problems that increase the uncertainties, especially over longer time periods of 50 to 100 years or more. I touched on a few of them for land stations here, and I'm sure I missed some. I'm not sure that the typical uncertainty estimates offered with various global temperature trend estimates include all the important factors. I recognize that many sources of random error tend to cancel out over time with large numbers of samples. However, there are some types of error that can introduce false trends, such as station moves, and changes in the microscale environment at a station over time. And of course, we don't have many fixed stations in the oceans that cover ~70% of the globe. I'd love to see a global CRN based on the USCRN as a model, but also including strategically placed fixed ocean platforms. Some stations outside the US may already qualify, but we need a lot more for improved future assessments.
ReplyDelete
Replies
AnonymousJanuary 31, 2017 at 12:55 PM
"So here I'm talking about location uncertainty. What if you had sampled in different places."

Your entire analysis rests on the locations we have measurements for. However as per Cowtan and Way, much of the warming is coming from the locations we have no measurements and so your analysis of uncertainty doesn't apply because we never had measurements to include or exclude.

So bearing that in mind, why doesn't the global temperature anomaly uncertainty take into account those areas where we make up data? Surely that must have uncertainty beyond what you describe above.
ReplyDelete
Replies
AnonymousFebruary 2, 2017 at 12:19 PM
"You can observe the variations in anomaly in the thousands of points that are measured, and you can look at that variability over various scales. It seems to follow a pattern."

I think you're missing the point. Its these unmeasured areas that DONT follow the pattern. They're the areas where there is supposed to be extra warming although we're not actually measuring it.
ReplyDelete
Replies
Geoff SheringtonFebruary 2, 2017 at 5:56 PM
Thank you for yuor essay, Nick.
It raises more questions. Here is but one. You write -
"•Compute some "normal" (usually a 30-year period time average for each month) for each station in the network,
•Form local anomalies by subtracting the relevant normal from each reading
•Average the anomalies (usually area-weighted)."
But does not (or can not, and usually does)step 3 produce a new 'normal'? What error is associated with its production? What error is involved in the processes of calculating and subtracting your first 'normal'? Entropy type thoughts say no pain, no gain. You can get a different normal each time you use daily, monthly, seasonal, annual data for your 30 year period; then in GISS style, by frequent later adjustments to data, you really have to calculate a new anomaly with new look at a data set, also if it has been adjusted from the starting set. This is easily missed and it can lead to more errors.
I'm not used to assignment of separate error sources except for their final combination into overall. If it is done, it seems best to calculate the largest plausible overall error, then try to identify the largest isolated contributor, then see if that can be reduced. In a well managed data set, the main errors are in the unworked originals and in a bad one, the main errors might arise from adjustment.
There is still no argument to convince me that the present error bounds on land station T data are at all realistic. I write this partly because I feel that the final error bounds should cover at least the 'raw' and adjusted sets if there are both, since both at times can be used as valid for particular applications. Or conversely, time might not be wasted on some new applications if the reality of the wide spread of error bounds suspected, is there.
Geoff.
ReplyDelete
Replies

Add comment

An interactive topic index for all Moyhu posts.
Latest Ice and Temperature data
Climate Data Portals
A gallery of Javascript-enhanced graphics
Temperature trend viewer
Climate Plotter V2
Google Maps and GHCN
WebGL map of past GHCN/SST station temperatures
WebGL map of GHCN/SST station temperature trends
HiRes NOAA OI SST with WebGL and Movie
Regional Hi-Res SST movies
WebGL Facility
TempLS Guide
More pages, and blog glossary

moyhu

Friday, January 27, 2017

Global anomaly spatial sampling error - and why use anomalies?

Global anomaly spatial sampling error - and why use anomalies?

Using anomalies

Coverage error - spatial sampling error for the mean.

Why use anomaly?

USHCN example

So what to make of spatial sampling error?

Conclusion

29 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Blog Archive

Translate

Resources

About Me

moyhu

Friday, January 27, 2017

Global anomaly spatial sampling error - and why use anomalies?

Global anomaly spatial sampling error - and why use anomalies?

Using anomalies

Coverage error - spatial sampling error for the mean.

Why use anomaly?

USHCN example

So what to make of spatial sampling error?

Conclusion

29 comments:

Maintained Pages

Search This Blog

Recent Comments

Blogroll

Subscribe To

Blog Archive

Translate

Resources

About Me