3.1. Implementation

CICE is written in FORTRAN90 and runs on platforms using UNIX, LINUX, and other operating systems. The current coding standard is Fortran2003 with use of Fortran2008 feature CONTIGUOUS in the 1d evp solver. The code is based on a two-dimensional horizontal orthogonal grid that is broken into two-dimensional horizontal blocks and parallelized over blocks with MPI and OpenMP threads. The code also includes some optimizations for vector architectures.

CICE consists of source code under the cicecore/ directory that supports model dynamics and top-level control. The column physics source code is under the icepack/ directory and this is implemented as a submodule in github from a separate repository (CICE) There is also a configuration/ directory that includes scripts for configuring CICE cases.

3.1.1. Directory structure

The present code distribution includes source code and scripts. Forcing data is available from the ftp site. The directory structure of CICE is as follows

LICENSE.pdf

license for using and sharing the code

DistributionPolicy.pdf

policy for using and sharing the code

README.md

basic information and pointers

icepack/

the Icepack module. The icepack subdirectory includes Icepack specific scripts, drivers, and documentation. CICE only uses the columnphysics source code under icepack/columnphysics/.

cicecore/

CICE source code

cicecore/cicedyn/

routines associated with the dynamics core

cicecore/drivers/

top-level CICE drivers and coupling layers

cicecore/shared/

CICE source code that is independent of the dynamical core

cicecore/version.txt

file that indicates the CICE model version.

configuration/scripts/

support scripts, see Scripts

doc/

documentation

cice.setup

main CICE script for creating cases

dot files

various files that begin with . and store information about the git repository or other tools.

A case (compile) directory is created upon initial execution of the script cice.setup at the user-specified location provided after the -c flag. Executing the command ./cice.setup -h provides helpful information for this tool.

3.1.2. Grid, boundary conditions and masks

The spatial discretization of the original implementation is specialized for a generalized orthogonal B-grid as in [42] or [55]. Figure Schematic of CICE B-grid. is a schematic of CICE B-grid. This cell with the tracer point \(t(i,j)\) in the middle is referred to as T-cell. The ice and snow area, volume and energy are given at the t-point. The velocity \({\bf u}(i,j)\) associated with \(t(i,j)\) is defined in the northeast (NE) corner. The other corners of the T-cell are northwest (NW), southwest (SW) and southeast (SE). The lengths of the four edges of the T-cell are respectively HTN, HTW, HTS and HTE for the northern, western, southern and eastern edges. The lengths of the T-cell through the middle are respectively dxT and dyT along the x and y axis.

We also occasionally refer to “U-cells,” which are centered on the northeast corner of the corresponding T-cells and have velocity in the center of each. The velocity components are aligned along grid lines.

The internal ice stress tensor takes four different values within a grid cell with the B-grid implementation; bilinear approximations are used for the stress tensor and the ice velocity across the cell, as described in [22]. This tends to avoid the grid decoupling problems associated with the B-grid.

../_images/CICE_Bgrid.png

Schematic of CICE B-grid.

The ability to solve on the C and CD grids was added later. With the C-grid, the u velocity points are located on the E edges and the v velocity points are located on the N edges of the T cell rather than at the T cell corners. On the CD-grid, the u and v velocity points are located on both the N and E edges. To support this capability, N and E grids were added to the existing T and U grids, and the N and E grids are defined at the northern and eastern edge of the T cell. This is shown in Figure Schematic of CICE CD-grid..

../_images/CICE_Cgrid.png

Schematic of CICE CD-grid.

The user has several ways to initialize the grid: popgrid reads grid lengths and other parameters for a nonuniform grid (including tripole and regional grids), and rectgrid creates a regular rectangular grid. The input files global_gx3.grid and global_gx3.kmt contain the \(\left<3^\circ\right>\) POP grid and land mask; global_gx1.grid and global_gx1.kmt contain the \(\left<1^\circ\right>\) grid and land mask, and global_tx1.grid and global_tx1.kmt contain the \(\left<1^\circ\right>\) POP tripole grid and land mask. These are binary unformatted, direct access, Big Endian files.

The input grid file for the B-grid and CD-grid is identical. That file contains each cells’ HTN, HTE, ULON, ULAT, and kmt value. From those variables, the longitude, latitude, grid lengths (dx and dy), areas, and masks can be derived for all grids. Table Primary CICE Prognostic Grid Variable Names lists the primary prognostic grid variable names on the different grids.

Primary CICE Prognostic Grid Variable Names

variable

T

U

N

E

longitude

TLON

ULON

NLON

ELON

latitude

TLAT

ULAT

NLAT

ELAT

dx

dxT

dxU

dxN

dxE

dy

dyT

dyU

dyN

dyE

area

tarea

uarea

narea

earea

mask (logical)

tmask

umask

nmask

emask

mask (real)

hm

uvm

npm

epm

In CESM, the sea ice model may exchange coupling fluxes using a different grid than the computational grid. This functionality is activated using the namelist variable gridcpl_file.

3.1.2.1. Grid domains and blocks

In general, the global gridded domain is nx_global \(\times\)ny_global, while the subdomains used in the block distribution are nx_block \(\times\)ny_block. The physical portion of a subdomain is indexed as [ilo:ihi, jlo:jhi], with nghost “ghost” or “halo” cells outside the domain used for boundary conditions. These parameters are illustrated in Grid parameters in one dimension. The routines global_scatter and global_gather distribute information from the global domain to the local domains and back, respectively. If MPI is not being used for grid decomposition in the ice model, these routines simply adjust the indexing on the global domain to the single, local domain index coordinates. Although we recommend that the user choose the local domains so that the global domain is evenly divided, if this is not possible then the furthest east and/or north blocks will contain nonphysical points (“padding”). These points are excluded from the computation domain and have little effect on model performance. nghost is a hardcoded parameter in ice_blocks.F90. While the halo code has been implemented to support arbitrary sized halos, nghost is set to 1 and has not been formally tested on larger halos.

../_images/grid.png

Grid parameters

Figure Grid parameters shows the grid parameters for a sample one-dimensional, 20-cell global domain decomposed into four local subdomains. Each local domain has one ghost (halo) cell on each side, and the physical portion of the local domains are labeled ilo:ihi. The parameter nx_block is the total number of cells in the local domain, including ghost cells, and the same numbering system is applied to each of the four subdomains.

The user sets the NTASKS and NTHRDS settings in cice.settings and chooses a block size block_size_x \(\times\)block_size_y, max_blocks, and decomposition information distribution_type, processor_shape, and distribution_type in ice_in. That information is used to determine how the blocks are distributed across the processors, and how the processors are distributed across the grid domain. The model is parallelized over blocks for both MPI and OpenMP. Some suggested combinations for these parameters for best performance are given in Section Performance. The script cice.setup computes some default decompositions and layouts but the user can overwrite the defaults by manually changing the values in ice_in. At runtime, the model will print decomposition information to the log file, and if the block size or max blocks is inconsistent with the task and thread size, the model will abort. The code will also print a warning if the maximum number of blocks is too large. Although this is not fatal, it does use extra memory. If max_blocks is set to -1, the code will compute a tentative max_blocks on the fly.

A loop at the end of routine create_blocks in module ice_blocks.F90 will print the locations for all of the blocks on the global grid if the namelist variable debug_blocks is set to be true. Likewise, a similar loop at the end of routine create_local_block_ids in module ice_distribution.F90 will print the processor and local block number for each block. With this information, the grid decomposition into processors and blocks can be ascertained. This debug_blocks variable should be used carefully as there may be hundreds or thousands of blocks to print and this information should be needed only rarely. debug_blocks can be set to true using the debugblocks option with cice.setup. This information is much easier to look at using a debugger such as Totalview. There is also an output field that can be activated in icefields_nml, f_blkmask, that prints out the variable blkmask to the history file and which labels the blocks in the grid decomposition according to blkmask = my_task + iblk/100.

The namelist add_mpi_barriers can be set to .true. to help throttle communication for communication intensive configurations. This may slow the code down a bit. These barriers have been added to a few select locations, but it’s possible others may be needed. As a general rule, add_mpi_barriers should be .false..

3.1.2.2. Tripole grids

The tripole grid is a device for constructing a global grid with a normal south pole and southern boundary condition, which avoids placing a physical boundary or grid singularity in the Arctic Ocean. Instead of a single north pole, it has two “poles” in the north, both located on land, with a line of grid points between them. This line of points is called the “fold,” and it is the “top row” of the physical grid. One pole is at the left-hand end of the top row, and the other is in the middle of the row. The grid is constructed by “folding” the top row, so that the left-hand half and the right-hand half of it coincide. Two choices for constructing the tripole grid are available. The one first introduced to CICE is called “U-fold”, which means that the poles and the grid cells between them are U-cells on the grid. Alternatively the poles and the cells between them can be grid T-cells, making a “T-fold.” Both of these options are also supported by the OPA/NEMO ocean model, which calls the U-fold an “f-fold” (because it uses the Arakawa C-grid in which U-cells are on T-rows). The choice of tripole grid is given by the namelist variable ns_boundary_type, ‘tripole’ for the U-fold and ‘tripoleT’ for the T-fold grid.

In the U-fold tripole grid, the poles have U-index \(nx\_global/2\) and \(nx\_global\) on the top U-row of the physical grid, and points with U-index \(i\) and \(nx\_global-i\) are coincident. Let the fold have U-row index \(n\) on the global grid; this will also be the T-row index of the T-row to the south of the fold. There are ghost (halo) T- and U-rows to the north, beyond the fold, on the logical grid. The point with index i along the ghost T-row of index \(n+1\) physically coincides with point \(nx\_global-i+1\) on the T-row of index \(n\). The ghost U-row of index \(n+1\) physically coincides with the U-row of index \(n-1\). In the schematics below, symbols A-H represent grid points from 1:nx_global at a given j index and the setup of the tripole seam is depicted within a few rows of the seam.

Tripole (u-fold) Grid Schematic

global j index

grid point IDs (i index)

global j index source

ny_global+2

H

G

F

E

D

C

B

A

ny_global-1

ny_global+1

H

G

F

E

D

C

B

A

ny_global

ny_global

A

B

C

D

E

F

G

H

ny_global-1

A

B

C

D

E

F

G

H

In the T-fold tripole grid, the poles have T-index \(1\) and and \(nx\_global/2+1\) on the top T-row of the physical grid, and points with T-index \(i\) and \(nx\_global-i+2\) are coincident. Let the fold have T-row index \(n\) on the global grid. It is usual for the northernmost row of the physical domain to be a U-row, but in the case of the T-fold, the U-row of index \(n\) is “beyond” the fold; although it is not a ghost row, it is not physically independent, because it coincides with U-row \(n-1\), and it therefore has to be treated like a ghost row. Points i on U-row \(n\) coincides with \(nx\_global-i+1\) on U-row \(n-1\). There are still ghost T- and U-rows \(n+1\) to the north of U-row \(n\). Ghost T-row \(n+1\) coincides with T-row \(n-1\), and ghost U-row \(n+1\) coincides with U-row \(n-2\).

TripoleT (t-fold) Grid Schematic

global j index

grid point IDs (i index)

global j index source

ny_global+2

H

G

F

E

D

C

B

A

ny_global-2

ny_global+1

H

G

F

E

D

C

B

A

ny_global-1

ny_global

A

BH

CG

DF

E

FD

GC

HB

ny_global-1

A

B

C

D

E

F

G

H

ny_global-2

A

B

C

D

E

F

G

H

The tripole grid thus requires two special kinds of treatment for certain rows, arranged by the halo-update routines. First, within rows along the fold, coincident points must always have the same value. This is achieved by averaging them in pairs. Second, values for ghost rows and the “quasi-ghost” U-row on the T-fold grid are reflected copies of the coincident physical rows. Both operations involve the tripole buffer, which is used to assemble the data for the affected rows. Special treatment is also required in the scattering routine, and when computing global sums one of each pair of coincident points has to be excluded. Halos of center, east, north, and northeast points are supported, and each requires slightly different halo indexing across the tripole seam.

3.1.2.3. Rectangular grids

Rectangular test grids can be defined for CICE. They are generated internally and defined by several namelist settings including grid_type = rectangular, nx_global, ny_global, dx_rect, dy_rect, lonrefrect, and latrefrect. Forcing and initial condition can be set via namelists atm_data_type, ocn_data_type, ice_data_type, ice_data_conc, ice_data_dist. Variable grid spacing is also supported with the namelist settings scale_dxdy which turns on the option, and dxscale and dyscale which sets the variable grid scaling factor. Values of 1.0 will produced constant grid spacing. For rectangular grids, lonrefrect and latrefrect define the lower left longitude and latitude value of the grid, dx_rect and dy_rect define the base grid spacing, and dxscale and dyscale provide the grid space scaling. The base spacing is set in the center of the rectangular domain and the scaling is applied symetrically outward as a multiplicative factor in the x and y directions.

Several predefined rectangular grids are available in CICE with cice.setup –grid including gbox12, gbox80, gbox128, and gbox180 where 12, 80, 128, and 180 are the number of gridcells in each direction. Several predefined options also exist, set with cice.setup –set, to establish varied idealized configurations of box tests including box2001, boxadv, boxchan, boxchan1e, boxchan1n, boxnodyn, boxrestore, boxslotcyl, and boxopen, boxclosed, and boxforcee. See cice.setup –help for a current list of supported settings.

3.1.2.4. Vertical Grids

The sea ice physics described in a single column or grid cell is contained in the Icepack submodule, which can be run independently of the CICE model. Icepack includes a vertical grid for the physics and a “bio-grid” for biogeochemistry, described in the Icepack Documentation. History variables available for column output are ice and snow temperature, Tinz and Tsnz, and the ice salinity profile, Sinz. These variables also include thickness category as a fourth dimension.

3.1.2.5. Boundary conditions

Much of the infrastructure used in CICE, including the boundary routines, is adopted from POP. The boundary routines perform boundary communications among processors when MPI is in use and among blocks whenever there is more than one block per processor.

Boundary conditions are defined by the ns_boundary_type and ew_boundary_type namelist inputs. Valid values are open and cyclic. In addition, tripole and tripoleT are options for the ns_boundary_type. Closed boundary conditions are not supported currently. The domain can be physically closed with the close_boundaries namelist which forces a land mask on the boundary with a two gridcell depth. Where the boundary is land, the boundary_type settings play no role. For example, in the displaced-pole grids, at least one row of grid cells along the north and south boundaries is land. Along the east/west domain boundaries not masked by land, periodic conditions wrap the domain around the globe. In this example, the appropriate namelist settings are nsboundary_type = open, ew_boundary_type = cyclic, and close_boundaries = .false..

CICE can be run on regional grids with open boundary conditions; except for variables describing grid lengths, non-land halo cells along the grid edge must be filled by restoring them to specified values. The namelist variable restore_ice turns this functionality on and off; the restoring timescale trestore may be used (it is also used for restoring ocean sea surface temperature in stand-alone ice runs). This implementation is only intended to provide the “hooks” for a more sophisticated treatment; the rectangular grid option can be used to test this configuration. The ‘displaced_pole’ grid option should not be used unless the regional grid contains land all along the north and south boundaries. The current form of the boundary condition routines does not allow Neumann boundary conditions, which must be set explicitly. This has been done in an unreleased branch of the code; contact Elizabeth for more information.

For exact restarts using restoring, set restart_ext = true in namelist to use the extended-grid subroutines.

On tripole grids, the order of operations used for calculating elements of the stress tensor can differ on either side of the fold, leading to round-off differences. Although restarts using the extended grid routines are exact for a given run, the solution will differ from another run in which restarts are written at different times. For this reason, explicit halo updates of the stress tensor are implemented for the tripole grid, both within the dynamics calculation and for restarts. This has not been implemented yet for tripoleT grids, pending further testing.

3.1.2.6. Masks

A land mask hm (\(M_h\)) is specified in the cell centers (on the T-grid), with 0 representing land and 1 representing ocean cells. Corresponding masks for the U, N, and E grids are given by

\[M_u(i,j)=\min\{M_h(l),\,l=(i,j),\,(i+1,j),\,(i,j+1),\,(i+1,j+1)\}.\]
\[M_n(i,j)=\min\{M_h(l),\,l=(i,j),\,(i,j+1)\}.\]
\[M_e(i,j)=\min\{M_h(l),\,l=(i,j),\,(i+1,j)\}.\]

The logical masks tmask, umask, nmask, and emask (which correspond to the real masks hm, uvm, npm, and epm respectively) are useful in conditional statements.

In addition to the land masks, two other masks are implemented in dyn_prep in order to reduce the dynamics component’s work on a global grid. At each time step the logical masks iceTmask and iceUmask are determined from the current ice extent, such that they have the value “true” wherever ice exists. They also include a border of cells around the ice pack for numerical purposes. These masks are used in the dynamics component to prevent unnecessary calculations on grid points where there is no ice. They are not used in the thermodynamics component, so that ice may form in previously ice-free cells. Like the land masks hm and uvm, the ice extent masks iceTmask and iceUmask are for T-cells and U-cells, respectively. Note that the ice extent masks iceEmask and iceNmask are also defined when using the C or CD grid.

Improved parallel performance may result from utilizing halo masks for boundary updates of the full ice state, incremental remapping transport, or for EVP or EAP dynamics. These options are accessed through the logical namelist flags maskhalo_bound, maskhalo_remap, and maskhalo_dyn, respectively. Only the halo cells containing needed information are communicated.

Two additional masks are created for the user’s convenience: lmask_n and lmask_s can be used to compute or write data only for the northern or southern hemispheres, respectively. Special constants (spval and spval_dbl, each equal to \(10^{30}\)) are used to indicate land points in the history files and diagnostics.

3.1.2.7. Interpolating between grids

Fields in CICE are generally defined at particular grid locations, such as T cell centers, U corners, or N or E edges. These are assigned internally in CICE based on the grid_ice namelist variable. Forcing/coupling fields are also associated with a specific set of grid locations that may or may not be the same as on the internal CICE model grid. The namelist variables grid_atm and grid_ocn define the forcing/coupling grids. The grid_ice, grid_atm, and grid_ocn variables are independent and take values like A, B, C, or CD consistent with the Arakawa grid convention [2]. The relationship between the grid system and the internal grids is shown in Grid System and Type Definitions.

Grid System and Type Definitions

grid system

thermo grid

u dynamic grid

v dynamic grid

A

T

T

T

B

T

U

U

C

T

E

N

CD

T

N+E

N+E

For all grid systems, thermodynamic variables are always defined on the T grid for the model and model forcing/coupling fields. However, the dynamics u and v fields vary. In the CD grid, there are twice as many u and v fields as on the other grids. Within the CICE model, the variables grid_ice_thrm, grid_ice_dynu, grid_ice_dynv, grid_atm_thrm, grid_atm_dynu, grid_atm_dynv, grid_ocn_thrm, grid_ocn_dynu, and grid_ocn_dynv are character strings (T, U, N, E , NE) derived from the grid_ice, grid_atm, and grid_ocn namelist values.

The CICE model has several internal methods that will interpolate (a.k.a. map or average) fields on (T, U, N, E, NE) grids to (T, U, N, E). An interpolation to an identical grid results in a field copy. The generic interface to this method is grid_average_X2Y, and there are several forms.

subroutine grid_average_X2Y(type,work1,grid1,work2,grid2)
  character(len=*)    , intent(in)  :: type           ! mapping type (S, A, F)
  real (kind=dbl_kind), intent(in)  :: work1(:,:,:)   ! input field(nx_block, ny_block, max_blocks)
  character(len=*)    , intent(in)  :: grid1          ! work1 grid (T, U, N, E)
  real (kind=dbl_kind), intent(out) :: work2(:,:,:)   ! output field(nx_block, ny_block, max_blocks)
  character(len=*)    , intent(in)  :: grid2          ! work2 grid (T, U, N, E)

where type is an interpolation type with the following valid values,

type = S is a normalized, masked, area-weighted interpolation

\[work2 = \frac{\sum_{i=1}^{n} (M_{1i}A_{1i}work1_{i})} {\sum_{i=1}^{n} (M_{1i}A_{1i})}\]

type = A is a normalized, unmasked, area-weighted interpolation

\[work2 = \frac{\sum_{i=1}^{n} (A_{1i}work1_{i})} {\sum_{i=1}^{n} (A_{1i})}\]

type = F is a normalized, unmasked, conservative flux interpolation

\[work2 = \frac{\sum_{i=1}^{n} (A_{1i}work1_{i})} {n*A_{2}}\]

with A defined as the appropriate gridcell area and M as the gridcell mask. Another form of the grid_average_X2Y is

subroutine grid_average_X2Y(type,work1,grid1,wght1,mask1,work2,grid2)
  character(len=*)    , intent(in)  :: type           ! mapping type (S, A, F)
  real (kind=dbl_kind), intent(in)  :: work1(:,:,:)   ! input field(nx_block, ny_block, max_blocks)
  real (kind=dbl_kind), intent(in)  :: wght1(:,:,:)   ! input weight(nx_block, ny_block, max_blocks)
  real (kind=dbl_kind), intent(in)  :: mask1(:,:,:)   ! input mask(nx_block, ny_block, max_blocks)
  character(len=*)    , intent(in)  :: grid1          ! work1 grid (T, U, N, E)
  real (kind=dbl_kind), intent(out) :: work2(:,:,:)   ! output field(nx_block, ny_block, max_blocks)
  character(len=*)    , intent(in)  :: grid2          ! work2 grid (T, U, N, E)

In this case, the input arrays wght1 and mask1 are used in the interpolation equations instead of gridcell area and mask. This version allows the user to define the weights and mask explicitly. This implementation is supported only for type = S or A interpolations.

A final form of the grid_average_X2Y interface is

subroutine grid_average_X2Y(type,work1a,grid1a,work1b,grid1b,work2,grid2)
  character(len=*)    , intent(in)  :: type           ! mapping type (S, A, F)
  real (kind=dbl_kind), intent(in)  :: work1a(:,:,:)  ! input field(nx_block, ny_block, max_blocks)
  character(len=*)    , intent(in)  :: grid1a         ! work1 grid (N, E)
  real (kind=dbl_kind), intent(in)  :: work1b(:,:,:)  ! input field(nx_block, ny_block, max_blocks)
  character(len=*)    , intent(in)  :: grid1b         ! work1 grid (N, E)
  real (kind=dbl_kind), intent(out) :: work2(:,:,:)   ! output field(nx_block, ny_block, max_blocks)
  character(len=*)    , intent(in)  :: grid2          ! work2 grid (T, U)

This version supports mapping from an NE grid to a T or U grid. In this case, the 1a arguments are for either the N or E field and the 1b arguments are for the complementary field (E or N respectively). At present, only S type mappings are supported with this interface.

In all cases, the work1, wght1, and mask1 input arrays should have correct halo values when called. Examples of usage can be found in the source code, but the following example maps the uocn and vocn fields from their native forcing/coupling grid to the U grid using a masked, area-weighted, average method.

call grid_average_X2Y('S', uocn, grid_ocn_dynu, uocnU, 'U')
call grid_average_X2Y('S', vocn, grid_ocn_dynv, vocnU, 'U')

3.1.2.8. Performance

Namelist options (domain_nml) provide considerable flexibility for finding efficient processor and block configuration. Some of these choices are illustrated in Distribution options. Users have control of many aspects of the decomposition such as the block size (block_size_x, block_size_y), the distribution_type, the distribution_wght, the distribution_wght_file (when distribution_type = wghtfile), and the processor_shape (when distribution_type = cartesian).

The user specifies the total number of tasks and threads in cice.settings and the block size and decompostion in the namelist file. The main trades offs are the relative efficiency of large square blocks versus model internal load balance as CICE computation cost is very small for ice-free blocks. The code is parallelized over blocks for both MPI and OpenMP. Smaller, more numerous blocks provides an opportunity for better load balance by allocating each processor both ice-covered and ice-free blocks. But smaller, more numerous blocks becomes less efficient due to MPI communication associated with halo updates. In practice, blocks should probably not have fewer than about 8 to 10 grid cells in each direction, and more square blocks tend to optimize the volume-to-surface ratio important for communication cost. Often 3 to 8 blocks per processor provide the decompositions flexiblity to create reasonable load balance configurations.

Like MPI, load balance of blocks across threads is important for efficient performance. Most of the OpenMP threading is implemented with SCHEDULE(runtime), so the OMP_SCHEDULE env variable can be used to set the OpenMPI schedule. The default OMP_SCHEDULE setting is defined by the variable ICE_OMPSCHE in cice.settings. OMP_SCHEDULE values of “STATIC,1” and “DYNAMIC,1” are worth testing. The OpenMP implementation in CICE is constantly under review, but users should validate results and performance on their machine. CICE should be bit-for-bit with different block sizes, different decompositions, different MPI task counts, and different OpenMP threads. Finally, we recommend the OMP_STACKSIZE env variable should be set to 32M or greater.

The distribution_type options allow standard cartesian distributions of blocks, redistribution via a ‘rake’ algorithm for improved load balancing across processors, and redistribution based on space-filling curves. There are also additional distribution types (‘roundrobin,’ ‘sectrobin,’ ‘sectcart’, and ‘spiralcenter’) that support alternative decompositions and also allow more flexibility in the number of processors used. Finally, there is a ‘wghtfile’ decomposition that generates a decomposition based on weights specified in an input file.

../_images/distrb.png

Distribution options

Figure Distribution options shows distribution of 256 blocks across 16 processors, represented by colors, on the gx1 grid: (a) cartesian, slenderX1, (b) cartesian, slenderX2, (c) cartesian, square-ice (square-pop is equivalent here), (d) rake with block weighting, (e) rake with latitude weighting, (f) spacecurve. Each block consists of 20x24 grid cells, and white blocks consist entirely of land cells.

../_images/distrbB.png

Decomposition options

Figure Decomposition options shows sample decompositions for (a) spiral center and (b) wghtfile for an Arctic polar grid. (c) is the weight field in the input file use to drive the decompostion in (b).

processor_shape is used with the distribution_type cartesian option, and it allocates blocks to processors in various groupings such as tall, thin processor domains (slenderX1 or slenderX2, often better for sea ice simulations on global grids where nearly all of the work is at the top and bottom of the grid with little to do in between) and close-to-square domains (square-pop or square-ice), which maximize the volume to surface ratio (and therefore on-processor computations to message passing, if there were ice in every grid cell). In cases where the number of processors is not a perfect square (4, 9, 16…), the processor_shape namelist variable allows the user to choose how the processors are arranged. Here again, it is better in the sea ice model to have more processors in x than in y, for example, 8 processors arranged 4x2 (square-ice) rather than 2x4 (square-pop). The latter option is offered for direct-communication compatibility with POP, in which this is the default.

distribution_wght chooses how the work-per-block estimates are weighted. The ‘block’ option is the default in POP and it weights each block equally. This is useful in POP which always has work in each block and is written with a lot of array syntax requiring calculations over entire blocks (whether or not land is present). This option is provided in CICE as well for direct-communication compatibility with POP. Blocks that contain 100% land grid cells are eliminated with ‘block’. The ‘blockall’ option is identical to ‘block’ but does not do land block elimination. The ‘latitude’ option weights the blocks based on latitude and the number of ocean grid cells they contain. Many of the non-cartesian decompositions support automatic land block elimination and provide alternative ways to decompose blocks without needing the distribution_wght.

The rake distribution type is initialized as a standard, Cartesian distribution. Using the work-per-block estimates, blocks are “raked” onto neighboring processors as needed to improve load balancing characteristics among processors, first in the x direction and then in y.

Space-filling curves reduce a multi-dimensional space (2D, in our case) to one dimension. The curve is composed of a string of blocks that is snipped into sections, again based on the work per processor, and each piece is placed on a processor for optimal load balancing. This option requires that the block size be chosen such that the number of blocks in the x direction and the number of blocks in the y direction must be factorable as \(2^n 3^m 5^p\) where \(n, m, p\) are integers. For example, a 16x16 array of blocks, each containing 20x24 grid cells, fills the gx1 grid (\(n=4, m=p=0\)). If either of these conditions is not met, the spacecurve decomposition will fail.

While the Cartesian distribution groups sets of blocks by processor, the ‘roundrobin’ distribution loops through the blocks and processors together, putting one block on each processor until the blocks are gone. This provides good load balancing but poor communication characteristics due to the number of neighbors and the amount of data needed to communicate. The ‘sectrobin’ and ‘sectcart’ algorithms loop similarly, but put groups of blocks on each processor to improve the communication characteristics. In the ‘sectcart’ case, the domain is divided into four (east-west,north-south) quarters and the loops are done over each, sequentially.

The wghtfile decomposition drives the decomposition based on weights provided in a weight file. That file should be a netCDF file with a double real field called wght containing the relative weight of each gridcell. Decomposition options (b) and (c) show an example. The weights associated with each gridcell will be summed on a per block basis and normalized to about 10 bins to carry out the distribution of highest to lowest block weights to processors. Scorecard provides an overview of the pros and cons of the various distribution types.

../_images/scorecard.png

Scorecard

Figure Scorecard shows the scorecard for block distribution choices in CICE, courtesy T. Craig. For more information, see [9] or http://www.cesm.ucar.edu/events/workshops/ws.2012/presentations/sewg/craig.pdf

The maskhalo options in the namelist improve performance by removing unnecessary halo communications where there is no ice. There is some overhead in setting up the halo masks, which is done during the timestepping procedure as the ice area changes, but this option usually improves timings even for relatively small processor counts. T. Craig has found that performance improved by more than 20% for combinations of updated decompositions and masked haloes, in CESM’s version of CICE.

Throughout the code, (i, j) loops have been combined into a single loop, often over just ocean cells or those containing sea ice. This was done to reduce unnecessary operations and to improve vector performance.

Timings illustrates the CICE v5 computational expense of various options, relative to the total time (excluding initialization) of a 7-layer configuration using BL99 thermodynamics, EVP dynamics, and the ‘ccsm3’ shortwave parameterization on the gx1 grid, run for one year from a no-ice initial condition. The block distribution consisted of 20 \(\times\) 192 blocks spread over 32 processors (‘slenderX2’) with no threads and -O2 optimization. Timings varied by about \(\pm3\)% in identically configured runs due to machine load. Extra time required for tracers has two components, that needed to carry the tracer itself (advection, category conversions) and that needed for the calculations associated with the particular tracer. The age tracers (FY and iage) require very little extra calculation, so their timings represent essentially the time needed just to carry an extra tracer. The topo melt pond scheme is slightly faster than the others because it calculates pond area and volume once per grid cell, while the others calculate it for each thickness category.

../_images/histograms.png

Timings

Figure Timings shows change in ‘TimeLoop’ timings from the 7-layer configuration using BL99 thermodynamics and EVP dynamics. Timings were made on a nondedicated machine, with variations of about \(\pm3\)% in identically configured runs (light grey). Darker grey indicates the time needed for extra required options; The Delta-Eddington radiation scheme is required for all melt pond schemes and the aerosol tracers, and the level-ice pond parameterization additionally requires the level-ice tracers.

3.1.3. Time Manager and Initialization

The time manager is an important piece of the CICE model.

3.1.3.1. Time Manager

The primary prognostic variables in the time manager are myear, mmonth, mday, and msec. These are integers and identify the current model year, month, day, and second respectively. The model timestep is dt with units of seconds. See Choosing an appropriate time step for additional information about choosing an appropriate timestep. The internal variables istep, istep0, and istep1 keep track of the number of timesteps. istep is the counter for the current run and is set to 0 at the start of each run. istep0 is the step count at the start of a long multi-restart run, and istep1 is the step count of a long multi-restart run and is continuous across model restarts.

In general, the time manager should be advanced by calling advance_timestep. This subroutine in ice_calendar.F90 automatically advances the model time by dt. It also advances the istep numbers and calls subroutine calendar to update additional calendar data.

The namelist variable use_restart_time specifies whether to use the time and step numbers saved on a restart file or whether to set the initial model time to the namelist values defined by year_init, month_init, day_init, and sec_init. Normally, use_restart_time is set to false on the initial run. In continue mode, use_restart_time is ignored and the restart date is always used to initialize the model run. More information about the restart capability can be found in Restart files.

Several different calendars are supported including noleap (365 days per year), 360-day (twelve 30 day months per year), and gregorian (leap days every 4 years except every 100 years except every 400 years). The gregorian calendar in CICE is formally a proleptic gregorian calendar without any discontinuties over time. The calendar is set by specifying days_per_year and use_leap_years in the namelist, and the following combinations are supported,

Supported Calendar Options

days_per_year

use_leap_years

calendar

365

false

noleap

365

true

gregorian

360

false

360-day

The history (History files) and restart (Restart files) outputs and frequencies are specified in namelist and are computed relative to a reference date defined by the namelist histfreq_base and dumpfreq_base. Valid values for each are zero and init. If set to zero, all output will be relative to the absolute reference year-month-day date, 0000-01-01. This is the default value for histfreq_base, so runs with different initial dates will have identical output. If the histfreq_base or dumpfreq_base are set to init, all frequencies will be relative to the model initial date specified by year_init, month_init, and day_init. sec_init plays no role in setting output frequencies. init is the default for dumpfreq_base and makes it easy to generate restarts 5 or 10 model days after startup as we often do in testing. Both histfreq_base and dumpfreq_base are arrays and can be set for each stream separately.

In general, output is always written at the start of the year, month, day, or hour without any ability to shift the phase. For instance, monthly output is always written on the first of the month. It is not possible, for instance, to write monthly data once a month on the 10th of the month. In the same way, quarterly data for Dec-Jan-Feb vs Jan-Feb-Mar is not easily controlled. A better approach is to create monthly data and then to aggregate to quarters as a post-processing step. The history and restart (histfreq, dumpfreq) setting 1 indicates output at a frequency of timesteps. This is the character 1 as opposed to the integer 1. This frequency output is computed using istep1, the model timestep. This may vary with each run depending on several factors including the model timestep, initial date, and value of istep0.

The model year is limited by some integer math. In particular, calculation of elapsed hours in ice_calendar.F90, and the model year is limited to the value of myear_max set in that file. Currently, that’s 200,000 years.

The time manager was updated in early 2021. The standalone model was modified, and some tests were done in a coupled framework after modifications to the high level coupling interface. For some coupled models, the coupling interface may need to be updated when updating CICE with the new time manager. In particular, the old prognostic variable time no longer exists in CICE, year_init only defines the model initial year, and the calendar subroutine is called without any arguments. One can set the namelist variables year_init, month_init, day_init, sec_init, and dt in conjuction with days_per_year and use_leap_years to initialize the model date, timestep, and calendar. To overwrite the default/namelist settings in the coupling layer, set the ice_calendar.F90 variables myear, mmonth, mday, msec and dt after the namelists have been read. Subroutine calendar should then be called to update all the calendar data. Finally, subroutine advance_timestep should be used to advance the model time manager. It advances the step numbers, advances time by dt, and updates the calendar data. The older method of manually advancing the steps and adding dt to time should be deprecated.

3.1.3.2. Initialization and Restarts

The ice model’s parameters and variables are initialized in several steps. Many constants and physical parameters are set in ice_constants.F90. Namelist variables (Tables of Namelist Options), whose values can be altered at run time, are handled in input_data and other initialization routines. These variables are given default values in the code, which may then be changed when the input file ice_in is read. Other physical constants, numerical parameters, and variables are first set in initialization routines for each ice model component or module. Then, if the ice model is being restarted from a previous run, core variables are read and reinitialized in restartfile, while tracer variables needed for specific configurations are read in separate restart routines associated with each tracer or specialized parameterization. Finally, albedo and other quantities dependent on the initial ice state are set. Some of these parameters will be described in more detail in Tables of Namelist Options.

The restart files supplied with the code release include the core variables on the default configuration, that is, with seven vertical layers and the ice thickness distribution defined by kcatbound = 0. Restart information for some tracers is also included in the netCDF restart files.

Three namelist variables generally control model initialization, runtype, ice_ic, and use_restart_time. The valid values for runtype are initial or continue. When runtype = continue, the restart filename is stored in a small text (pointer) file, use_restart_time is forced to true and ice_ic plays no role. When runtype = initial, ice_ic has three options, none, internal, or filename. These initial states are no-ice, namelist driven initial condition, and ice defined by a file respectively. If ice_ic is set to internal, the initial state is defined by the namelist values ice_data_type, ice_data_dist, and ice_data_conc. In initial mode, use_restart_time should generally be set to false and the initial time is then defined by year_init, month_init, day_init, and sec_init. These combinations options are summarized in Ice Initialization.

Restart files and initial condition files are generally the same format and can be the same files. They contain the model state from a particular instance in time. In general, that state includes the physical and dynamical state as well as the state of optional tracers. Reading of various tracer groups can be independently controlled by various restart flags. In other words, a restart file can be used to initialize a new configuration where new tracers are used (i.e. bgc). In that case, the physical state of the model will be read, but if bgc tracers don’t exist on the restart file, they can be initialized from scratch.

In continue mode, a pointer file is used to restart the model. In this mode, the CICE model writes out a small text (pointer) file to the run directory that names the most recent restart file. On restart, the model reads the pointer file which defines the name of the restart file. The model then reads that restart file. By having this feature, the ice namelist does not need to be constantly updated with the latest restart filename, and the model can be automatically resubmitted. Manually editing the pointer file in the middle of a run will reset the restart filename and allow the run to continue.

Table Ice Initialization shows runtype, ice_ic, and use_restart_time namelist combinations for initializing the model. If namelist defines the start date, it’s done with year_init, month_init, day_init, and sec_init.

Ice Initialization

runtype

ice_ic

use_restart_time

Note

initial

none

not used

no ice, namelist defines start date

initial

internal or default

not used

set by namelist ice_data_type, ice_data_dist, ice_data_conc

initial

filename

false

read ice state from filename, namelist defines start date

initial

filename

true

read ice state from filename, restart file defines start date

continue

not used

not used

pointer file defines restart file, restart file defines start date

An additional namelist option, restart_ext specifies whether halo cells are included in the restart files. This option is useful for tripole and regional grids, but can not be used with PIO.

An additional namelist option, restart_coszen specifies whether the cosine of the zenith angle is included in the restart files. This is mainly used in coupled models.

MPI is initialized in init_communicate for both coupled and stand-alone MPI runs. The ice component communicates with a flux coupler or other climate components via external routines that handle the variables listed in the Icepack documentation. For stand-alone runs, routines in ice_forcing.F90 read and interpolate data from files, and are intended merely to provide guidance for the user to write his or her own routines. Whether the code is to be run in stand-alone or coupled mode is determined at compile time, as described below.

3.1.3.3. Choosing an appropriate time step

The time step is chosen based on stability of the transport component (both horizontal and in thickness space) and on resolution of the physical forcing. CICE allows the dynamics, advection and ridging portion of the code to be run with a shorter timestep, \(\Delta t_{dyn}\) (dt_dyn), than the thermodynamics timestep \(\Delta t\) (dt). In this case, dt and the integer ndtd are specified, and dt_dyn = dt/ndtd.

A conservative estimate of the horizontal transport time step bound, or CFL condition, under remapping yields

\[\Delta t_{dyn} < {\min\left(\Delta x, \Delta y\right)\over 2\max\left(u, v\right)}.\]

Numerical estimates for this bound for several POP grids, assuming \(\max(u, v)=0.5\) m/s, are as follows:

Time Step Bound

grid label

N pole singularity

dimensions

min \(\sqrt{\Delta x\cdot\Delta y}\)

max \(\Delta t_{dyn}\)

gx3

Greenland

\(100\times 116\)

\(39\times 10^3\) m

10.8hr

gx1

Greenland

\(320\times 384\)

\(18\times 10^3\) m

5.0hr

p4

Canada

\(900\times 600\)

\(6.5\times 10^3\) m

1.8hr

As discussed in [39], the maximum time step in practice is usually determined by the time scale for large changes in the ice strength (which depends in part on wind strength). Using the strength parameterization of [52], limits the time step to \(\sim\)30 minutes for the old ridging scheme (krdg_partic = 0), and to \(\sim\)2 hours for the new scheme (krdg_partic = 1), assuming \(\Delta x\) = 10 km. Practical limits may be somewhat less, depending on the strength of the atmospheric winds.

Transport in thickness space imposes a similar restraint on the time step, given by the ice growth/melt rate and the smallest range of thickness among the categories, \(\Delta t<\min(\Delta H)/2\max(f)\), where \(\Delta H\) is the distance between category boundaries and \(f\) is the thermodynamic growth rate. For the 5-category ice thickness distribution used as the default in this distribution, this is not a stringent limitation: \(\Delta t < 19.4\) hr, assuming \(\max(f) = 40\) cm/day.

In the classic EVP or EAP approach (kdyn = 1 or 2, revised_evp = false), the dynamics component is subcycled ndte (\(N\)) times per dynamics time step so that the elastic waves essentially disappear before the next time step. The subcycling time step (\(\Delta t_e\)) is thus

\[dte = dt\_dyn/ndte.\]

A second parameter, \(E_\circ\) (elasticDamp), defines the elastic wave damping timescale \(T\), described in Section Dynamics, as elasticDamp * dt_dyn. The forcing terms are not updated during the subcycling. Given the small step (dte) at which the EVP dynamics model is subcycled, the elastic parameter \(E\) is also limited by stability constraints, as discussed in [21]. Linear stability analysis for the dynamics component shows that the numerical method is stable as long as the subcycling time step \(\Delta t_e\) sufficiently resolves the damping timescale \(T\). For the stability analysis we had to make several simplifications of the problem; hence the location of the boundary between stable and unstable regions is merely an estimate. The current default parameters for the EVP and EAP are \(ndte=240\) and \(E_\circ=0.36\). For high resolution applications, it is however recommended to increase the value of \(ndte\) [30], [5].

Note that only \(T\) and \(\Delta t_e\) figure into the stability of the dynamics component; \(\Delta t\) does not. Although the time step may not be tightly limited by stability considerations, large time steps (e.g., \(\Delta t=1\) day, given daily forcing) do not produce accurate results in the dynamics component. The reasons for this error are discussed in [21]; see [25] for its practical effects. The thermodynamics component is stable for any time step, as long as the surface temperature \(T_{sfc}\) is computed internally. The numerical constraint on the thermodynamics time step is associated with the transport scheme rather than the thermodynamic solver.

For the revised EVP approach (kdyn = 1, revised_evp = true), the relaxation parameter arlx1i effectively sets the damping timescale in the problem, and brlx represents the effective subcycling [6] (see Section Revised EVP approach).

3.1.4. Model Input and Output

3.1.4.1. IO Overview

CICE provides the ability to read and write binary unformatted or netCDF data via a number of different methods. The IO implementation is specified both at build-time (via selection of specific source code) and run-time (via namelist). Three different IO packages are available in CICE under the directory cicecore/cicedyn/infrastructure/io. Those are io_binary, io_netcdf, and io_pio2, and those support IO thru binary, netCDF (https://www.unidata.ucar.edu/software/netcdf), and PIO (https://github.com/NCAR/ParallelIO) interfaces respectively. The io_pio2 directory supports both PIO1 and PIO2 and can write data thru the netCDF or parallel netCDF (pnetCDF) interface. The netCDF history files are CF-compliant, and header information for data contained in the netCDF files is displayed with the command ncdump -h filename.nc. To select the io source code, set ICE_IOTYPE in cice.settings to binary, netcdf, pio1, or pio2.

At run-time, more detailed IO settings are available. restart_format and history_format namelist options specify the method and format further. Valid options are listed in CICE IO formats. These options specify the format of new files created by CICE. Existing files can be read in any format as long as it’s consistent with ICE_IOTYPE defined. Note that with ICE_IOTYPE = binary, the format name is actually ignored. The CICE netCDF output contains a global metadata attribute, io_flavor, that indicates the format chosen for the file. ncdump -k filename.nc also provides information about the specific netCDF file format. In general, the detailed format is not enforced for input files, so any netCDF format can be read in CICE regardless of CICE namelist settings.

CICE IO formats

Namelist Option

Format

Written Thru

Valid With ICE_IOTYPE

binary

Fortran binary

fortran

binary

cdf1

netCDF3-classic

netCDF

netcdf, pio1, pio2

cdf2

netCDF3-64bit-offset

netCDF

netcdf, pio1, pio2

cdf5

netCDF3-64bit-data

netCDF

netcdf, pio1, pio2

default

binary or cdf1, depends on ICE_IOTYPE

varies

binary, netcdf, pio1, pio2

hdf5

netCDF4 hdf5

netCDF

netcdf, pio1, pio2

pnetcdf1

netCDF3-classic

pnetCDF

pio1, pio2

pnetcdf2

netCDF3-64bit-offset

pnetCDF

pio1, pio2

pnetcdf5

netCDF3-64bit-data

pnetCDF

pio1, pio2

There are additional namelist options that affect PIO performance for both restart and history output. [history_,restart_] [iotasks,root,stride] namelist options control the PIO processor/task usage and specify the total number of IO tasks, the root IO task, and the IO task stride respectively. history_rearranger and restart_rearranger define the PIO rearranger strategy. Finally, [history_,restart_] [deflate,chunksize] provide controls for hdf5 compression and chunking for the hdf5 options in both netCDF and PIO output. hdf5 is written serially thru the netCDF library and in parallel thru the PIO library in CICE. Additional details about the netCDF and PIO settings and implementations can found in (https://www.unidata.ucar.edu/software/netcdf) and (https://github.com/NCAR/ParallelIO).

netCDF requires CICE compilation with a netCDF library built externally. PIO requires CICE compilation with a PIO and netCDF library built externally. Both netCDF and PIO can be built with many options which may require additional libraries such as MPI, hdf5, or pnetCDF.

3.1.4.2. History files

CICE provides history data output in binary unformatted or netCDF formats via separate implementations of binary, netCDF, and PIO interfaces as described above. In addition, history_format as well as other history namelist options control the specific file format as well as features related to IO performance, see IO Overview.

The data is written at the period(s) given by histfreq and histfreq_n relative to a reference date specified by histfreq_base. The files are written to binary or netCDF files prepended by the history_file and history_suffix namelist setting. The settings for history files are set in the setup_nml section of ice_in (see Tables of Namelist Options). The history filenames will have a form like [history_file][history_suffix][_freq].[timeID].[nc,da] depending on the namelist options chosen. With binary files, a separate header file is written with equivalent information. Standard fields are output according to settings in the icefields_nml section of ice_in (see Tables of Namelist Options). The user may add (or subtract) variables not already available in the namelist by following the instructions in section Adding History fields.

The history implementation has been divided into several modules based on the desired formatting and on the variables themselves. Parameters, variables and routines needed by multiple modules is in ice_history_shared.F90, while the primary routines for initializing and accumulating all of the history variables are in ice_history.F90. These routines call format-specific code in the io_binary, io_netcdf and io_pio2 directories. History variables specific to certain components or parameterizations are collected in their own history modules (ice_history_bgc.F90, ice_history_drag.F90, ice_history_mechred.F90, ice_history_pond.F90).

The history modules allow output at different frequencies. Five output options (1, h, d, m, y) are available simultaneously for histfreq during a run, and each stream must have a unique value for histfreq. In other words, d cannot be used by two different streams. Each stream has an associated frequency set by histfreq_n. The frequency is relative to a reference date specified by the corresponding entry in histfreq_base. Each stream can be instantaneous or time averaged data over the frequency internal. The hist_avg namelist turns on time averaging for each stream individually. The same model variable can be written to multiple history streams (ie. daily d and monthly m) via its namelist flag, f_ \(\left<{var}\right>\), while x turns that history variable off. For example, f_aice = 'md' will write aice to the monthly and daily streams. Grid variable history output flags are logicals and written to all stream files if turned on. If there are no namelist flags with a given histfreq value, or if an element of histfreq_n is 0, then no file will be written at that frequency. The history filenames are set in the subroutine construct_filename in ice_history_shared.F90. In cases where two streams produce the same identical filename, the model will abort. Use the namelist hist_suffix to make stream filenames unique. More information about how the frequency is computed is found in Time Manager. Also, some Earth Sytem Models require the history file time axis to be centered in the averaging interval. The flag hist_time_axis will allow the user to chose begin, middle, or end for the time stamp.

For example, in the namelist:

histfreq   = '1', 'h', 'd', 'm', 'y'
histfreq_n =  1 ,  6 ,  0 ,  1 ,  1
histfreq_base = 'zero','zero','zero','zero','zero'
hist_avg      = .true.,.true.,.true.,.true.,.true.
f_hi = '1'
f_hs = 'h'
f_Tsfc = 'd'
f_aice = 'm'
f_meltb = 'mh'
f_iage = 'x'

Here, hi will be written to a file on every timestep, hs will be written once every 6 hours, aice once a month, meltb once a month AND once every 6 hours, and Tsfc and iage will not be written. All streams are time averaged over the interval although because one stream has histfreq=1 and histfreq_n=1, that is equivalent to instantaneous output each model timestep.

From an efficiency standpoint, it is best to set unused frequencies in histfreq to ‘x’. Having output at all 5 frequencies takes nearly 5 times as long as for a single frequency. If you only want monthly output, the most efficient setting is histfreq = ’m’,’x’,’x’,’x’,’x’. The code counts the number of desired streams (nstreams) based on histfreq.

There is no restart capability built into the history implementation. If the model stops in the middle of a history accumulation period, that data is lost on restart, and the accumulation is zeroed out at startup. That means the dump frequency (see Restart files) and history frequency need to be somewhat coordinated. For example, if monthly history files are requested, the dump frequency should be set to an integer number of months.

The history variable names must be unique for netCDF, so in cases where a variable is written at more than one frequency, the variable name is appended with the frequency in files after the first one. In the example above, meltb is called meltb in the monthly file (for backward compatibility with the default configuration) and meltb_h in the 6-hourly file.

If write_ic is set to true in ice_in, a snapshot of the same set of history fields at the start of the run will be written to the history directory in iceh_ic.[timeID].nc(da). Several history variables are hard-coded for instantaneous output regardless of the hist_avg averaging flag, at the frequency given by their namelist flag.

The normalized principal components of internal ice stress (sig1, sig2) are computed in principal_stress and written to the history file. This calculation is not necessary for the simulation; principal stresses are merely computed for diagnostic purposes and included here for the user’s convenience.

Several history variables are available in two forms, a value representing an average over the sea ice fraction of the grid cell, and another that is multiplied by \(a_i\), representing an average over the grid cell area. Our naming convention attaches the suffix “_ai” to the grid-cell-mean variable names.

Beginning with CICE v6, history variables requested by the Sea Ice Model Intercomparison Project (SIMIP) [43] have been added as possible history output variables (e.g. f_sithick, f_sidmassgrowthbottom, etc.). The lists of monthly and daily requested SIMIP variables provide the names of possible history fields in CICE. However, each of the additional variables can be output at any temporal frequency specified in the icefields_nml section of ice_in as detailed above. Additionally, a new history output variable, f_CMIP, has been added. When f_CMIP is added to the icefields_nml section of ice_in then all SIMIP variables will be turned on for output at the frequency specified by f_CMIP.

It may also be helpful for debugging to increase the precision of the history file output from 4 bytes to 8 bytes. This is changed through the history_precision namelist flag.

3.1.4.3. Diagnostic files

Like histfreq, the parameter diagfreq can be used to regulate how often output is written to a log file. The log file unit to which diagnostic output is written is set in ice_fileunits.F90. If diag_type = ‘stdout’, then it is written to standard out (or to ice.log.[ID] if you redirect standard out as in cice.run); otherwise it is written to the file given by diag_file.

In addition to the standard diagnostic output (maximum area-averaged thickness, velocity, average albedo, total ice area, and total ice and snow volumes), the namelist options print_points and print_global cause additional diagnostic information to be computed and written. print_global outputs global sums that are useful for checking global conservation of mass and energy. print_points writes data for two specific grid points defined by the input namelist lonpnt and latpnt. By default, one point is near the North Pole and the other is in the Weddell Sea; these may be changed in ice_in.

The namelist debug_model prints detailed debug diagnostics for a single point as the model advances. The point is defined by the namelist debug_model_i, debug_model_j, debug_model_iblk, and debug_model_task. These are the local i, j, block, and mpi task index values of the point to be diagnosed. This point is defined in local index space and can be values in the array halo. If the local point is not defined in namelist, the point associated with lonpnt(1) and latpnt(1) is used. debug_model is normally used when the model aborts and needs to be debugged in detail at a particular (usually failing) grid point.

Memory use diagnostics are controlled by the logical namelist memory_stats. This feature uses an intrinsic query in C defined in ice_memusage_gptl.c. Memory diagnostics will be written at the the frequency defined by diagfreq.

Timers are declared and initialized in ice_timers.F90, and the code to be timed is wrapped with calls to ice_timer_start and ice_timer_stop. Finally, ice_timer_print writes the results to the log file. The optional “stats” argument (true/false) prints additional statistics. The “stats” argument can be set by the timer_stats namelist. Calling ice_timer_print_all prints all of the timings at once, rather than having to call each individually. Currently, the timers are set up as in CICE timers. Section Adding Timers contains instructions for adding timers.

The timings provided by these timers are not mutually exclusive. For example, the Column timer includes the timings from several other timers, while timer Bound is called from many different places in the code, including the dynamics and advection routines. The Dynamics, Advection, and Column timers do not overlap and represent most of the overall model work.

The timers use MPI_WTIME for parallel runs and the F90 intrinsic system_clock for single-processor runs.

CICE timers

Timer

Index

Label

1

Total

the entire run

2

Timeloop

total minus initialization and exit

3

Dynamics

dynamics

4

Advection

horizontal transport

5

Column

all vertical (column) processes

6

Thermo

vertical thermodynamics, part of Column timer

7

Shortwave

SW radiation and albedo, part of Thermo timer

8

Ridging

mechanical redistribution, part of Column timer

9

FloeSize

flow size, part of Column timer

10

Coupling

sending/receiving coupler messages

11

ReadWrite

reading/writing files

12

Diags

diagnostics (log file)

13

History

history output

14

Bound

boundary conditions and subdomain communications

15

BundBound

halo update bundle copy

16

BGC

biogeochemistry, part of Thermo timer

17

Forcing

forcing

18

1d-evp

1d evp, part of Dynamics timer

19

2d-evp

2d evp, part of Dynamics timer

20

UpdState

update state

3.1.4.4. Restart files

CICE reads and writes restart data in binary unformatted or netCDF formats via separate implementations of binary, netCDF, and PIO interfaces as described above. In addition, restart_format as well as other restart namelist options control the specific file format as well as features related to IO performance, see IO Overview.

The restart files created by CICE contain all of the variables needed for a full, exact restart. The filename begins with the character string defined by the restart_file namelist input, and the restart dump frequency is given by the namelist variables dumpfreq and dumpfreq_n relative to a reference date specified by dumpfreq_base. Multiple restart frequencies are supported in the code with a similar mechanism to history streams. The pointer to the filename from which the restart data is to be read for a continuation run is set in pointer_file. The code assumes that auxiliary binary tracer restart files will be identified using the same pointer and file name prefix, but with an additional character string in the file name that is associated with each tracer set. All variables are included in netCDF restart files.

Additional namelist flags provide further control of restart behavior. dump_last = true causes a set of restart files to be written at the end of a run when it is otherwise not scheduled to occur. The flag use_restart_time enables the user to choose to use the model date provided in the restart files for initial runs. If use_restart_time = false then the initial model date stamp is determined from the namelist parameters, year_init, month_init, day_init, and sec_init. lcdf64 = true sets 64-bit netCDF output, allowing larger file sizes.

Routines for gathering, scattering and (unformatted) reading and writing of the “extended” global grid, including the physical domain and ghost (halo) cells around the outer edges, allow exact restarts on regional grids with open boundary conditions, and they will also simplify restarts on the various tripole grids. They are accessed by setting restart_ext = true in namelist. Extended grid restarts are not available when using PIO; in this case extra halo update calls fill ghost cells for tripole grids (do not use PIO for regional grids).

Restart files are available for the CICE code distributions for the gx3 and gx1 grids (see Forcing data for information about obtaining these files). They were created using the default model configuration and run for multiple years using the JRA55 forcing.