3.5. Troubleshooting¶

Check the FAQ: https://github.com/CICE-Consortium/CICE/wiki

3.5.1. Initial setup¶

If there are problems, you can manually edit the env, Macros, and cice.run files in the case directory until things are working properly. Then you can copy the env and Macros files back to configuration/scripts/machines.

Changes made directly in the run directory, e.g. to the namelist file, will be overwritten if scripts in the case directory are run again later.

If changes are needed in the cice.run.setup.csh script, it must be manually modified.

Ensure that the block size block_size_x, block_size_y, and max_blocks is compatible with the processor_shape and other domain options in ice_in

If using the rake or space-filling curve algorithms for block distribution (distribution_type in ice_in) the code will abort if max_blocks is not large enough. The correct value is provided in the diagnostic output. Also, the spacecurve setting can only be used with certain block sizes that results in number of blocks in the x and y directions being only multiples of 2, 3, or 5.

If starting from a restart file, ensure that kcatbound is the same as that used to create the file (kcatbound = 0 for the files included in this code distribution). Other configuration parameters, such as NICELYR, must also be consistent between runs.

For stand-alone runs, check that -Dcoupled is not set in the Macros.* file.

For coupled runs, check that -Dcoupled and other coupled-model-specific (e.g., CESM, popcice or hadgem) preprocessing options are set in the Macros.* file.

Set ICE_CLEANBUILD to true to clean before rebuilding.

3.5.2. Restarts¶

Manual restart tests require the path to the restart file be included in ice_in in the namelist file.

Ensure that kcatbound is the same as that used to create the restart file. Other configuration parameters, such as nilyr, must also be consistent between runs.

CICE v5 and later use a model configuration that makes restarting from older simulations difficult. In particular, the number of ice categories, the category boundaries, and the number of vertical layers within each category must be the same in the restart file and in the run restarting from that file. Moreover, significant differences in the physics, such as the salinity profile, may cause the code to fail upon restart. Therefore, new model configurations may need to be started using runtype = ‘initial’. Binary restart files that were provided with CICE v4.1 were made using the BL99 thermodynamics with 4 layers and 5 thickness categories (kcatbound = 0) and therefore can not be used for the default CICE v5 and later configuration (7 layers). In addition, CICE’s default restart file format is now instead of binary.

Restarting a run using runtype = ‘continue’ requires restart data for all tracers used in the new run. If tracer restart data is not available, use runtype = ‘initial’, setting ice_ic to the name of the core restart file and setting to true the namelist restart flags for each tracer that is available. The unavailable tracers will be initialized to their default settings.

On tripole grids, use restart_ext = true when using either binary or regular (non-PIO) netcdf.

Provided that the same number of ice layers (default: 4) will be used for the new runs, it is possible to convert v4.1 restart files to the new file structure and then to format. If the same physical parameterizations are used, the code should be able to execute from these files. However if different physics is used (for instance, mushy thermo instead of BL99), the code may still fail. To convert a v4.1 restart file, consult section 5.2 in the CICE v5 documentation.

If restart files are taking a long time to be written serially (i.e., not using PIO), see the next section.

3.5.3. Slow execution¶

On some architectures, underflows (\(10^{-300}\) for example) are not flushed to zero automatically. Usually a compiler flag is available to do this, but if not, try uncommenting the block of code at the end of subroutine stress in ice_dyn_evp.F90 or ice_dyn_eap.F90. You will take a hit for the extra computations, but it will not be as bad as running with the underflows.

3.5.4. Debugging hints¶

Several utilities are available that can be helpful when debugging the code. Not all of these will work everywhere in the code, due to possible conflicts in module dependencies.

debug_ice (CICE.F90): A wrapper for print_state that is easily called from numerous points during the timestepping loop (see CICE_RunMod.F90_debug, which can be substituted for CICE_RunMod.F90).
print_state (ice_diagnostics.F90): Print the ice state and forcing fields for a given grid cell.
dbug = true (ice_in): Print numerous diagnostic quantities.
print_global (ice_in): If true, compute and print numerous global sums for energy and mass balance analysis. This option can significantly degrade code efficiency.
print_points (ice_in): If true, print numerous diagnostic quantities for two grid cells, one near the north pole and one in the Weddell Sea. This utility also provides the local grid indices and block and processor numbers (ip, jp, iblkp, mtask) for these points, which can be used in conjunction with check_step, to call print_state. These flags are set in ice_diagnostics.F90. This option can be fairly slow, due to gathering data from processors.
global_minval, global_maxval, global_sum (ice_global_reductions.F90): Compute and print the minimum and maximum values for an individual real array, or its global sum.

3.5.5. Known bugs¶

Fluxes sent to the CESM coupler may have incorrect values in grid cells that change from an ice-free state to having ice during the given time step, or vice versa, due to scaling by the ice area. The authors of the CESM flux coupler insist on the area scaling so that the ice and land models are treated consistently in the coupler (but note that the land area does not suddenly become zero in a grid cell, as does the ice area).
With the old CCSM radiative scheme (shortwave = ‘default’ or ‘ccsm3’), a sizable fraction (more than 10%) of the total shortwave radiation is absorbed at the surface but should be penetrating into the ice interior instead. This is due to use of the aggregated, effective albedo rather than the bare ice albedo when snowpatch \(< 1\).
The date-of-onset diagnostic variables, melt_onset and frz_onset, are not included in the core restart file, and therefore may be incorrect for the current year if the run is restarted after Jan 1. Also, these variables were implemented with the Arctic in mind and may be incorrect for the Antarctic.
The single-processor system_clock time may give erratic results on some architectures.
History files that contain time averaged data (hist_avg = true in ice_in) will be incorrect if restarting from midway through an averaging period.
In stand-alone runs, restarts from the end of ycycle will not be exact.
Using the same frequency twice in histfreq will have unexpected consequences and causes the code to abort.
Latitude and longitude fields in the history output may be wrong when using padding.
History and restart files will not be written on the first timestep in some cases.

3.5.6. Interpretation of albedos¶

More information about interpretation of albedos can be found in the Icepack documentation.

3.5.7. Proliferating subprocess parameterizations¶

With the addition of several alternative parameterizations for sea ice processes, a number of subprocesses now appear in multiple parts of the code with differing descriptions. For instance, sea ice porosity and permeability, along with associated flushing and flooding, are calculated separately for mushy thermodynamics, topo and level-ice melt ponds, and for the brine height tracer, each employing its own equations. Likewise, the BL99 and mushy thermodynamics compute freeboard and snow–ice formation differently, and the topo and level-ice melt pond schemes both allow fresh ice to grow atop melt ponds, using slightly different formulations for Stefan freezing. These various process parameterizations will be compared and their subprocess descriptions possibly unified in the future.

3.5. Troubleshooting¶

3.5.1. Initial setup¶

3.5.2. Restarts¶

3.5.3. Slow execution¶

3.5.4. Debugging hints¶

3.5.5. Known bugs¶

3.5.6. Interpretation of albedos¶

3.5.7. Proliferating subprocess parameterizations¶

Table of Contents

Previous topic

Next topic

This Page