| 1 | = Data Assimilation Diagnostics = |
| 2 | |
| 3 | [[PageOutline]] |
| 4 | |
| 5 | With version 1.12 of PDAF, we introduce a first set of routines to compute diagnostics about the ensembles. The diagnostics can be used to assess the quality of the ensemble. |
| 6 | |
| 7 | For now there are three routines: |
| 8 | |
| 9 | == PDAF_diag_effsample == |
| 10 | |
| 11 | This routine compute the effective sample size as used in particle filters. The effective sample size is define as the inverse of the sum of the squared particle weights: '''n_eff = 1 / sum[(w_i)^2^]'''. The effective sample size can range between one - if a single particle has the maximum weight and all other particles have zero weight - and |
| 12 | the actual sample size - if all samples have the same weight. |
| 13 | |
| 14 | For a documention on `PDAF_diag_effsample` see the [wiki:PDAF_diag_effsample detail page on PDAF_diag_effsample]. The routine is used in the NETF and LNETF filter methods of PDAF. |
| 15 | |
| 16 | |
| 17 | == PDAF_diag_histogram == |
| 18 | |
| 19 | Rank histograms are frequently used to assess the distribution of an ensemble around an observation or, in twin experiments, the true state. The histograms use bins computed form the ensemble distribution and count how frequent e.g. the observation falls into which bin. A flat histogram typically indicates a good ensemble. A concave (U-shaped) histogram indicates too little ensemble spread, while a convex histogram is obtained when the ensemble spread is too large. Further, a sloped histogram indicates bias. (A discussion on the interpretation of rank histograms can be found in Hamill, Monthly Weather Review, 129 (2001) 550-560) |
| 20 | |
| 21 | For a documention on `PDAF_diag_histogram` see the [wiki:PDAF_diag_histogram detail page on PDAF_diag_histogram]. The routine is used in the LOrenz-96 example in `testsuite/src/lorenz96/compute_truerms.F90`. |
| 22 | |
| 23 | |
| 24 | == PDAF_diag_ensstats == |
| 25 | |
| 26 | Ensemble Kalman filters assume that the ensemble is Gaussian distributed. In this case the distribution is symmetric and only the first and second moments of the distribution (the mean and standard deviation) are non-zero. The routine `PDAF_diag_ensstats` allows a data assimilation program to check the values of the third (skewness) and fourth (kurtosis) moment of the distribution. As there are different definition of the kurtosis, please note that PDAF uses the definition used by Lawson and Hansen, Mon. Wea. Rev. 132 (2004) 1966. |
| 27 | |
| 28 | For a documention on `PDAF_diag_ensstats` see the [wiki:PDAF_diag_ensstats detail page on PDAF_diag_ensstats]. The routine is used in the LOrenz-96 example in `testsuite/src/lorenz96/compute_truerms.F90`. |