Information is now being written to another page.

Contents

Useful links

Terminolgy

Let say:

For each field we want to know
FIELD_QC
Not doneBadGood
|FIELD_DIFF|>tolerance DN (different-not done)DB (different-bad) DG (different-good)
|FIELD_DIFF|≤tolerance SN (same-not done)SB (same-bad) SG (same-good)
where

Strategy for creating a combined file with all the data

QC hierachy

The QC hierachy is

  1. Time
  2. Lat-lon. If we get this far, we can test
    1. PRESS_QC_PRF
    2. TMP_QC_PRF
    3. SAL_QC_PRF
  3. Depth. If we get this far, we can test
    1. TMP_QC_LVL
    2. SAL_QC_LVL

Matching Jim's flags with the DACs

BMRC

The profile data (looking at a file, 4901152.2010.nc, when Data Mode is "A" throughout - A is `real time with adjusted values').

What I'd like to know

I want to be sure that the DACs aren't simply throwing out data that is clearly wrong and not storing it in their files. If so, we're in danger of not giving them credit for throwing out bad data.

Julian days

YearStart in Julian days
200419,724
200520,089
200620,454
200720,819
200821,184
200921,550
201021,915

Notes

I'm only looking at data where all 5 Data Centres are defined and we have delayed mode QC that is 100% accepted or 100% rejected.

Excluding -999, the profiles take the following values

Data centres are

Before 2009, coriolis_tmp_qc is always 9.

Using all the data

Using all the profiles defined by both the delayed mode and the Data Centres. Only considering delayed mode profile where all the levels are either accepted or rejected.
Bad-reject Bad-accept Good-reject Good-accept Total ETS Bias FoM
BMRC 2103 (1.106%) 2442 (1.284%) 833 (0.438%) 184823 (97.172%) 190201 0.3830 1.55 0.649
FNMOC 2243 (0.861%) 31875 (12.229%) 4732 (1.815%) 221798 (85.095%) 260648 0.0351 4.89
MEDS 1195 (0.551%) 12792 (5.900%) 4092 (1.887%) 198743 (91.662%) 216822 0.0481 2.65
UKMO 115 (0.128%) 6288 (7.025%) 256 (0.286%) 82844 (92.560%) 89503 0.0133 17.26

Using all the levels defined by both the delayed mode and the Data Centres
Bad-reject Bad-accept Good-reject Good-accept Total ETS Bias FoM
BMRC 32,767 (0.237%) 156,742 (1.136%) 49,444 (0.358%) 13,560,006 (98.268%) 13,798,959 0.1330 2.31 0.546
Coriolis 285,257 (1.674%) 1,659,197 (9.735%) 88,162 (0.517%) 15,011,658 (88.074%) 17,044,274 0.1219 5.21
FNMOC 55,623 (0.309%) 1,981,459 (10.997%) 183,518 (1.018%) 15,797,903 (87.676%) 18,018,503 0.0130 8.52
MEDS 10,325 (0.069%) 877,361 (5.883%) 7,614 (0.051%) 14,019,228 (93.997%) 14,914,528 0.0104 49.48
UKMO 24,648 (0.330%) 407,955 (5.466%) 1,883,760 (25.240%) 5,146,907 (68.963%) 7,463,270 -0.0390 0.23

Using only the data when BMRC, Coriolis, FNMOC and MEDS are defined

For all levels
Bad-reject Bad-accept Good-reject Good-accept Total ETS Bias
bmrc 4,356 (0.037%) 89,960 (0.771%) 23,796 (0.204%) 11,548,575 (98.988%) 11,666,687 0.0350 3.35
coriolis 11029 (0.095%) 83411 (0.715%) 44980 (0.386%) 11527018 (98.805%) 11666438 0.0761 1.69
fnmoc 8540 (0.073%) 86086 (0.738%) 150541 (1.290%) 11423068 (97.899%) 11668235 0.0297 0.59
meds 3459 (0.030%) 91167 (0.781%) 5135 (0.044%) 11568471 (99.145%) 11668232 0.0340 11.01

Ranking the QC flags

BMRC

Profiles
QC flag (Jim's class)% of bad% of good
7 (accept)0.7799.23
8 (accept)7.792.3
11 (accept)2278
10 (accept, only 71 values)8020
9 (reject, only 197 values)6535
12 (reject, only 45 values)7525
According to this table, we should switch the QC for 9 and 10 around. However, since a QC of 9 is suppose to be reject and QC of 10 is suppose to be accept, I've put 10 above 9 and assumed the small sample for these has an influence.

However, putting a QC of 10 into reject does improve the skill
QC=10 is acceptQC=10 is reject
ETS0.3830.430
Bias1.551.34
FoM0.6490.680

Ranking for levels
QC flag (Jim's class)% of bad% of good
2 (accept)0.8999.11
5 (accept, 53,096 values)65.934.1
4 (reject, 82,211 values)39.960.1
This is weird! Supposedly, a QC of 5 - which we should accept according to Jim - has more bad data than the QC of 4 - which we should reject according to Jim. Not surprisingly making a QC of 5 a rejection improves the skill scores
QC=5 is acceptQC=5 is reject
ETS0.1330.258
Bias2.311.40
FoM0.5460.608

Coriolis

For levels
QC flag (Jim's class)% of bad% of good
1 (accept)1090
3 (reject)7525
4 (reject)7921
We can't improve the skill because the Bias is much greater than 1, yet we only have one flag (QC = 1) for accepting data.

FNMOC

For profiles
QC flag (Jim's class)% of bad% of good
94 (accept)17.582.5
95 (accept)19.880.2
96 (reject)20.479.6
97 (reject)21.578.5
98 (reject)27.372.7
99 (reject)29.170.9
100 (reject)43.856.3

Trying to improve the skill scores
QC > 95 is rejected QC > 94 is rejected QC > 93 is rejected QC > 91 is rejected QC > 90 is rejected QC > 89 is rejected
ETS0.03510.03620.0366 0.03700.03720.0368
Bias4.894.293.82 3.132.852.61
FoM0.5120.5120.513 0.5140.5140.514
Best ETS score is for rejecting all QC > 90.

For levels
QC flag (Jim's class)% of bad% of good
98 (accept)10.489.6
99 (accept)11.588.5
100 (reject)23.376.7
There are 10 values where the QC is greater than 100 (all for ArgoId 5901503 on 20821, which also one of two argos to give delayed mode QC above 100) - so looks like dodgy data.

Despite the bias being very high (8.52, so accepting too much data), we can't improve this (ETS and FoM are decrease if we reject QC of 99).

MEDS

Profiles
QC flag (Jim's class)% of bad% of good
5 (accept)5.4194.6
1 (accept)6.0593.95
4 (reject)24.375.7
3 (reject)15.085.0
Changing a QC of 1 to reject will not improve this.

Levels
QC flag (Jim's class)% of bad% of good
1 (accept)5.8994.1
3 (reject)45.754.3
4 (reject)59.240.8
Only 30 values above 4 and these are for the same time (juld=20139) and probably the same ArgoId. Changing what is accepting and rejected isn't going to help here.

Summary

For the four data centres where we have a lot of data, the bias values are all greater than 1 - indicating that they're accepting more data than there is good data. This suggests that we might be able to improve the skill scores for the observation centres if they rejected more data. The obvious data to reject would be those data where the QC flags suggests the data is only sufficiently OK to keep, and is considered doubtful.

This has been possbile for BMRC where the skill scores have been significantly increased by rejecting profiles with a QC of 10 and rejecting levels with a QC of 5. A slight improvement to the skill scores for the profiles for FNMOC can be made by increasing the amount of data that is rejected. But changing the accept/reject criteria can't improve the skill scores for Coriolis and MEDS.

Where the data comes from