Context Navigation

Changes between Version 16 and Version 17 of FirstSteps

Timestamp:: May 31, 2025, 7:02:03 PM (7 weeks ago)
Author:: lnerger
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

FirstSteps

-              v16
+              v17
 demonstrating the implementation and application of PDAF with a simple 2-dimensional example.
+Here we describe the steps for a Linux-based computer.
 == First Test Case - A Single Analysis Step ==
 In this test case, a data assimilation program is used to read an ensemble of model fields from files and compute an analysis step. This is the so-called offline coupling mode of PDAF. A full description of this test case is provided in the [wiki:PdafTutorial implementation tutorial for the offline mode of PDAF].
+In this test case, a data assimilation program is used to read an ensemble of model fields from files and compute an analysis step. This is the so-called ''offline coupled mode'' of PDAF. A full description of this test case is provided in the [wiki:PdafTutorial implementation tutorial for the offline mode of PDAF].
 === Compiling ===
 We recommend to first look at the tutorial offline_2D_serial, which is a
+We recommend to first look at the tutorial `offline_2D_serial`, which is a
 single analysis step acting on an ensemble of 2D model fields without any
 parallelization.
 …
 and run
 {{{
 make PDAF_ARCH=linux_gfortran_openmpi
+make PDAF_ARCH=linux_gfortran
 }}}
 This compiles the assimilation program including PDAF. The compilation
 …
 ./PDAF_offline
 }}}
+The program reads ensemble files and a file holding the observations from the directory `tutorial/inputs_offline/`. Then it computes a single analysis step of the ensemble Kalman filter ESTKF. Running the program should not take more than a second. The program generates the
+output files
+`state_ana.txt` (The analysis state)
+and
+`ens_01_ana.txt` to `ens_09_ana.txt` (the analysis ensemble).
+The screen output shows the progress of the program. For example the ensemble standard deviation before and after the analysis step and final timing and memory information are shown. Lines starting with 'PDAF' are outputs from the code part of PDAF other lines are from the user routines.
+The program reads ensemble files and a file holding the observations from the directory `tutorial/inputs_offline/`. Then, it computes a single analysis step of the ensemble Kalman filter ESTKF. Running the program should only take seconds. The program generates the output files
+* `state_ana.txt` (the analysis state)
+* `ens_01_ana.txt` to `ens_09_ana.txt` (the analysis ensemble).
+The screen output shows the progress of the program. For example, the ensemble standard deviation before and after the analysis step and final timing and memory information are shown. Lines starting with 'PDAF' are outputs from the core part of PDAF; other lines are from the user routines.
 === Plotting ===
+To plot, e.g. the analysis field you can use Matlab or Octave and do
+{{{
+load state_ana.txt
+pcolor(state_ana)
+}}}
+You can also plot the initial ensemble mean field by
+{{{
+cd ../inputs_offline
+load state_ini.txt
+pcolor(state_ini)
+}}}
+Analogously you can plot the observations (`obs.txt`) and the true state (`true.txt`) from which the observations have been generated. In the
+observation file, only 28 grid points are observed, while non-observed grid points have the value -999.0. To get a meaningful plot, you can specify the limits for the color map by
+{{{
+set(gca,'clim',[-2 2])
+}}}
+Alternatively, you can plot using Python with e.g.
+You can plot using Python with, for example
 {{{
 import numpy as np
 …
 }}}
+Analogously you can plot the observations (`obs.txt`) and the true state (`true.txt`) from which the observations have been generated. These files are in the input directory. In the observation file, only 28 grid points are observed, while non-observed grid points have the value -999.0. To get a meaningful plot, you can specify the color limits by
+{{{
+plt.clim([-1.5, 1.5])
+}}}
+before showing the plot.
 === Assimilation Options ===
+There are various options you can set to modify the assimilation,
+There are various options you can set on the command line to modify the assimilation.
 For example you can run
 {{{
 ./PDAF_offline -filtertype 7
 }}}
 With this setting, the localized filter LESTKF instead of the global ESTKF.
 Without further settings, the localization radius is set to 0 so that only
+to apply the localized filter LESTKF instead of the global ESTKF.
+Without further settings, the localization radius is set to 0.0 so that only
 the observed grid points are changed by the assimilation.
 You can further set the localization cut-offg radius with
+You can further set the localization cut-off radius with
 {{{
 ./PDAF_offline -filtertype 7 -cradius 5.0
 }}}
 Now the LESTKF is used with a localization radius of 5 grid points. This
 localization still uses a constant weight of the observation. So you will
+Now, the LESTKF is used with a localization radius of 5 grid points. This
+localization uses a constant weight of the observation. So you will
 see steps in the analysis fields around each observation. To add a
 tapering so that observations get less influence for increasing distance,
 …
 }}}
 Now, the filter is applied with the 5th-order polynomial function by
 Gaspari and Cohn. As a result you get a smoothly changing analysis field.
+Gaspari and Cohn (1999). As a result you get a smoothly varying analysis field.
 You can also change the ensemble size, e.g. running
 {{{
 …
 }}}
 to run with an ensemble of 5 model states. (For this test case we only
 prepared 9, so only dim_obs<=9 is possible to run here.)
 The standard deviation (RMS error) of the observation is set to 0.5 in the program. To change it to, e.g. 2.0, would would run
+prepared 9 ensemble files, so only dim_ens<=9 is possible to run here. Please note that such ensemble size is usually too low for real cases)
+The standard deviation (RMS error) of the observation is set to 0.5 in the program. To change it to, e.g. 2.0, you can run
 {{{
 ./PDAF_offline -rms_obs 2.0
 }}}
 Also the inflation can be specified on the command line. PDAF uses the so-called forgetting factor, which is a positive value <=1 (the ensemble variance is influted by the inverse of the forgetting factor). One can specify the forgetting factor as
+Also the inflation can be specified on the command line. PDAF uses the so-called forgetting factor, which is a positive value <=1.0 (the ensemble variance is inflated by the inverse of the forgetting factor). One can specify the forgetting factor as
 {{{
 ./PDAF_offline -forget 0.9
 }}}
 All the different options can be combined. For a complete list of possible options, see the file `init_pdaf_offline.F90`, which is the source code file in which the default values of options are specified.
+All the different options can be combined. For a complete list of possible options, see the file `mod_assimilate.F90`, which is the source code file in which the default values of options are declared and explained.
 …
 == Second Test Case - A Sequence of Analysis Steps ==
 As a second test case, we recommend to look at the tutorial online_2D_serialmodel.
+As a second test case, we recommend to look at the tutorial `online_2D_serialmodel`.
 This case is again a simple 2D model field, but now coupled to PDAF with
 time stepping. This is the so-called online-coupling of PDAF, in which the model code is augmented with data assimilation functionality provided by PDAF. A full description of this test case is provided in the [wiki:PdafTutorial implementation tutorial for the online mode of PDAF].
+time stepping. This is the so-called ''online coupling'' of PDAF, in which the model code is augmented with data assimilation functionality provided by PDAF. A full description of this test case is provided in the [wiki:PdafTutorial implementation tutorial for the online mode of PDAF].
 === Compiling ===
 …
 }}}
 and run
-{{{
-make cleanall PDAF_ARCH=linux_gfortran_openmpi
-}}}
-and then
 {{{
 make model_pdaf PDAF_ARCH=linux_gfortran_openmpi
 …
 This compiles the assimilation program including PDAF. The compilation
 should work for computers running Linux, but it requires that OpenMPI
+is installed on the computer. If it's not installed, please install it
+using the Linux package providing it. If the compilation still fails,
+is installed on the computer. If it is not installed, please install the Linux package providing it. If the compilation still fails,
 please see below the section [#CompilationProblems Compilation Problems].
 …
 Having compiled the program, you can just run it by executing
 {{{
+mpirun -np 9 ./model_pdaf -dim_ens 9
+}}}
+The program computes a sequence of 9 analysis steps with two model time
+steps in between subsequent analysis steps. The initial ensemble are read from the directory `tutorial/inputs_online/`, where also the observation files are stored. The assimilation uses of the
+mpirun -np 5 ./model_pdaf -dim_ens 5
+}}}
+The program computes a sequence of 9 analysis steps with of forecase phase of 2 time steps (thus two model time steps are computed in between subsequent analysis steps). The initial ensemble files are read from the directory `tutorial/inputs_online/`, where also the observation files are stored. The assimilation uses of the
 ensemble Kalman filter ESTKF. It should not take more than a few seconds.
+The program generates the
+output files
+`state_stepX_ana.txt` (The analysis state at time step X)
+and
+`ens_01_stepX_ana.txt` to `ens_09_stepX_ana.txt` (the analysis ensemble at time step X)
+`ens_01_stepX_for.txt to `ens_09_stepX_for.txt` (the forecast ensemble at time step X)
+The program generates the output files
+* `state_stepX_ana.txt` (the analysis state at time step X)
+* `ens_01_stepX_ana.txt` to `ens_09_stepX_ana.txt` (the analysis ensemble at time step X)
+* `ens_01_stepX_for.txt` to `ens_09_stepX_for.txt` (the forecast ensemble at time step X)
 === Plotting ===
 …
 inputs_online/
+To plot the analysis field at time step 10, you can do
+{{{
+load state_step10_ana.txt
+pcolor(state_step10_ana)
+}}}
+You can also plot the initial model field by
+{{{
+cd ../inputs_online
+load state_ini.txt
+pcolor(state_ini)
+}}}
+The directory `inputs_online/` also contains files for the true state at time steps 1 to 18.
+For example, you can plot the true state at time step 15, with
+{{{
+load true_step15.txt
+pcolor(true_step15)
+}}}
+Analogously you can plot the observations (`obs_stepX.txt`) with time step X. In the
+observation file, observation gaps are indiced by the value -999.0. So
+to get a meaningful plot, you can specify the limits for the color map by
+{{{
+set(gca,'clim',[-2 2])
+}}}
+You can plot using Python with, for example
+{{{
+import numpy as np
+import matplotlib.pyplot as plt
+file = 'state_step10_ana.txt'
+field = np.loadtxt(file)
+plt.pcolor(field)
+plt.show()
+}}}
+Analogously, you can also plot the initial model field `state_ini.txt`, the true state (`true_stepX.txt`), or observations (`obs_stepX.txt`) with X the time step.
 === Assimilation Options ===
+The same options as for the first test case can be used here, too. In addition, one can specify the
+forecast length (number of time steps between two analysis steps by
+The same options as for the first test case can be used here, too.
+In addition, one can specify the
+forecast length (number of time steps between two analysis steps) by
 {{{
 mpirun -np 9 ./model_pdaf -dim_ens 9 -delt_obs 6