What is sea_ice_area_fraction, really ? #407
-
QuestionDear community, Sea-ice concentration is the area fraction of an ocean surface that is covered by sea ice. This is the well accepted "geophysical" definition, adopted by all providers of satellite-based observation data. In the CF world, sea-ice concentration is encoded with standard_name In the first standard name table (April 2002) it was:
In version 66 (May 2019) it became:
Today (version 89, January 2025) it is:
Now imagine a coastal grid cell with partial ocean cover: 70% of the grid is over the ocean, 30% over land. Imagine the ocean is fully covered by sea ice. What sea-ice concentration should be reported for that grid cell? If we follow the geophysical definition the correct answer is 100%: the entirety of the (coastal) ocean is covered. In the CF-world (v66), there is an ambiguity. Should the sea_ice_area_fraction be reported as 100% (Sea ice area fraction is area of the sea surface occupied by sea ice.) or rather 70% ("Area fraction" is the fraction of a grid cell's horizontal area that has some characteristic of interest. It is evaluated as the area of interest divided by the grid cell area) ? Going further, the latest definition of sea_ice_area_fraction mentions the option of using cell_methods to "restrict the evaluation to some portion of that grid cell". If I understand this correctly, it would mean that:
would report a value of 70% for the coastal grid cell introduced above, while
would report a value of 100% for the same coastal grid cell. Obviously, the difference above is only a problem for grid cells with a non-zero land fraction. For cells farther from land, the ocean and grid cell areas are the same numbers. Going back to the evolution of the standard_name definition, it strikes me how the sentences : Sea ice area fraction is area of the sea surface occupied by sea ice. It is also called "sea ice concentration". , that correspond to the geophysical definition of sea-ice concentration, have been there from the start. I would argue that the first definition (table v1) was leaning towards the geophysical definition, then version 66 was ambiguous, then today's version opens for the two options. I find it sub-optimal that the definition of sea_ice_area_fraction does not hold a clear guidance that if one wants to follow the geophysical definition, one must use cell_methods with "area: mean where sea". I also find it puzzling that reporting the sea-surface temperature for the same coastal grid cell would be un-problematic and would not require a cell_methods attribute.
would have the same values whether the cell_methods was present or not. This is puzzling because SIC, like SST, is only defined over the ocean (geophysical sense), why are they treated differently by CF? Here comes the time for some questions:
And bonus questions:
Sorry if the above was difficult to follow. I hope this discussion can help bring awareness on this ambiguity, and hopefully improve the situation for the future. |
Beta Was this translation helpful? Give feedback.
Replies: 16 comments 12 replies
-
@taylor13 @mohntorte, fyi |
Beta Was this translation helpful? Give feedback.
-
I'm not an authority by any means on this, but here's my understanding:
Yes,
Yes, I think you have. In particular if you want to report the fraction of the sea portion of a cell occupied by sea ice, you should use the
If "sea ice concentration" (as distinct from sea ice area fraction) is commonly of interest, we probably should include in the description something like: "To report sea ice concentration, which is the fraction of sea area in a grid cell covered by sea ice, use this standard name but include in the Note that when "where sea" appears in Also, careful data providers will include in I'm surprised you were able to track down historical versions of the standard names. For another purpose, I would also be interested in looking at earlier versions. Where did you find them? |
Beta Was this translation helpful? Give feedback.
-
Thanks for bringing this up -- what a lot of confusion! My thoughts: I liked the 2002 definition:
This seems pretty straightforward, and consistent with the idea behind standard names -- they represent a particular physical quantity. But then, in 2019, this was added: "Area fraction" is the fraction of a grid cell's horizontal area that has some characteristic of interest. It is evaluated as the area of interest divided by the grid cell area. " I understand that that's standard text, which is a great idea for standard names in general, but I wonder -- why introduce the "grid cell" concept here? fraction of a grid cell is not a physical quantity -- fraction of an area is - regardless of how that area is defined -- In this case, it's area_of_ice / total_area (unitless), and could be defined at a point, or at any area defined any way. If it IS the average concentration over a cell, wouldn't that be specified by the cell method? Is it really any different, from a what physical quantity is it? point of view than, say sea_ice_surface_temperature, as Tom points out? I would argue that this can be used both for model output, or results of, e.g. remote sensing analysis. In remote sensing, "fraction of the cell's area" makes sense, but in the model case, it's not really a "fraction of the cell" -- it's the value that the model computes, and depending on the algorithm, could be a point measure or, the average of the cell, or ... really not that different than any other physical quantity, like temperature. On the practicalities:
would report a value of 70% for the coastal grid cell introduced above... Please, please, please do not support this meaning! that is painfully difficult to for me to process on the other end, and feel completely unnecessary -- when would anyone want that definition? I want to know what the concentration of sea ice is in a cell -- and in my case, I WANT a point value, which I would get by interpolating (or applying the same value to the whole cell) -- I absolutely want 100% in that case. And trying to back-calculate from a reported 70% to 100% by looking at the fraction that is land would be really, really painful. My thought is to remove the "fraction of the cell" language altogether, if possible. Question: in the remote sensing case, if a cell (pixel?) did have 30% land, and 70% full coverage of sea ice, what would it report? |
Beta Was this translation helpful? Give feedback.
-
Regarding
For gridded data when cells are partly ocean and partly land, and you interpolated to each point within the cell (including the points on land) do you really want sea ice cover to be 100% everywhere? I would think you would ideally want it to be 100% for ocean points and 0% for land points. Especially for areas with very small fractions of sea. Do you want sea ice fraction to be 100% when most of the area is in fact land? I think using cell_methods to distinguish between values calculated "where sea" or calculated over the entire cell seems like it should be very clear.
This is one for the satellite folks. If they accurately sense an area fraction of sea ice independent of the underlying surface type, then I suppose they would find that the pixel was 70% sea-ice coverd (assuming 100% sea ice where sea). Their cell_methods would omit the phrase "where sea". |
Beta Was this translation helpful? Give feedback.
-
Sure: whether the point is on land or at sea is a different question than what the sea ice coverage is: sea ice fraction is undefined on land -- it really doesn't matter what value I get there. To be honest, this is pretty hypothetical -- all the models I've ever dealt with son't have fraction-land cells, so I don't think my code would do the right thing anyway. But I can't see the use case for define a sea-ice-fraction that isn't, well, the sea-ice fraction. Maybe the way I think of it is that sea-ice-fraction is: area_of_sea_ice / area_of_sea or area_of_sea_ice / (area_of_open_water + area_of_sea_ice) where area_of_land fits in to that I have no idea. Again, I guess the idea is that I think if it not in terms of area but in terms of infinitesimal area -- units of (area/area), rather than an measurement over the finite size of the cell. Consider continuity -- if you have one cell that's all sea, and next to it one with a fraction of land, and the sea ice conditions are the same in both, you really want to have a different value associated with those two cells?
It may be clear, but it's not easy to detect and calculate. Back to the example, if we take Tom's interpretation, then not having a cell_method, or ignoring it would lead to the "wrong" answer. Though maybe there's a reasonable use case for the sea_ice_area_fraction to mean "fraction of cell" rather than "fraction of sea surface", I think the default, easy to process definition should be the "fraction of sea surface" one. And "fraction of sea surface" is a physical quantity -- which we aim to represent with standard_names -- "fraction of cell area" isn't really a physical quantity -- as it depends on the the cell itself. Use case: In my oil spill model, the fraction of sea ice cover at a point influences the behavior of the oil. So we need to know it, and we have code that understands a wide variety or grid types, etc, and can interpolate to points within cells by various algorithms. But we have no way to look at the cell method, and then go and adjust the value in a cell based on a whole other concept of what fraction of that cell is water vs land. And that would be very ugly code to write. |
Beta Was this translation helpful? Give feedback.
-
if it's recent, it's all in gitHub discussions / issues -- maybe a bit hard to find, but it should be there. |
Beta Was this translation helpful? Give feedback.
-
Thanks for that. I'm having a hard time seeing all the available versions. Can I get to the directory that lists them? this link returns a 404. I'm sure I'm doing something stupid here. |
Beta Was this translation helpful? Give feedback.
-
Chris, regarding https://github.com/orgs/cf-convention/discussions/407#discussioncomment-12409485, I think we all agree that the current definition of If you want the By the way, your use case is helpful. As I understand it, your code will only give correct answers if cells are either wholly occupied by sea or not; grid cells with both land and ocean portions included would be problematic. It turns out for your data, including "where sea" for |
Beta Was this translation helpful? Give feedback.
-
Credits where they are due, this topic was already brought by @Bruce-T-T-Calvert in cf-vocabulary issue 66. |
Beta Was this translation helpful? Give feedback.
-
I agree that if you report "sea_ice_area_fraction, area: mean where sea" and if some of your grid cells are partly land and partly sea, then you should also provide the sea_area_fraction. This allows you to calculate, for example, the total area of sea ice over a cell or a region: seaIceArea=seaiceFraction(where sea) x seaFraction x gridCellArea. Also for this case, sea_ice_area_fraction is "undefined" or "missing" over cells entirely occupied by land because (seaIceArea)/(seaArea) is undefined for those cells. For "sea_ice_area_fraction, area: mean", you should always provide the sea_area_fraction because otherwise it would be impossible to calculate sea ice concentration, which in this case equals seaiceFraction/seaFraction. Note that for this case, sea_ice_area_fraction is defined everywhere and is equal to 0 in regions underlain by land. |
Beta Was this translation helpful? Give feedback.
-
+1 on this, because:
And at least some of us want to know (and report?) what the sea ice concentration is at a given point -- I'm having a really hard time understanding why it would be helpful to convert concentration to "fraction over a grid cell". I'm sure I'm missing (maybe many) use cases, but if one wanted, for instance, to know what the total area of sea ice was in a cell, then the "fraction over a grid cell" would be helpful, obviously, but is that useful without knowing what the rest of the cell has in it? how much open water? how much land how much land with ice/snow/??. In that case, you'd need to do a lot more computation anyway .... In short -- these various quantities can be derived from each other, and as a rule, we prefer in CF to define only one -- but in this case, it seems we are storing the derived quantity, which seems off to me. But we have what we have, so adding a "sea_ice_concentration" name might be the best way forward. In a sense, they ARE different things. A concentration is unitless, "differential" quantity: area/area. It can ( at least in theory) vary over a region of any size, like most other location-dependent quantities. Whereas when you define it as: "Area fraction": the fraction of a grid cell's horizontal area Then it is, by definition, a discrete quantity, dependent on the cell size, that does not vary over the cell. So a different thing. But I do agree with @TomLav: first, clarify the definition for what we already have. |
Beta Was this translation helpful? Give feedback.
-
Here is the current definition of sea_ice_area_fraction: "Area fraction" is the fraction of a grid cell's horizontal area that has some characteristic of interest. It is evaluated as the area of interest divided by the grid cell area, or if the cell_methods restricts the evaluation to some portion of that grid cell (e.g. "where sea_ice"), then it is the area of interest divided by the area of the identified portion. It may be expressed as a fraction, a percentage, or any other dimensionless representation of a fraction. Sea ice area fraction is area of the sea surface occupied by sea ice. It is also called "sea ice concentration". "Sea ice" means all ice floating in the sea which has formed from freezing sea water, rather than by other processes such as calving of land ice to form icebergs. I suggest a new text along the lines of: By default, a variable with this standard name holds the fraction of a grid cell covered by sea ice. It may be expressed as a fraction, a percentage, or any other dimensionless representation of a fraction. "Area fraction" is the fraction of a grid cell's horizontal area that has some characteristic of interest. It is evaluated as the area of interest divided by the grid cell area, or if the cell_methods restricts the evaluation to some portion of that grid cell (e.g. "where sea_ice"), then it is the area of interest divided by the area of the identified portion. Consequently, a variable with a standard name of sea_ice_area_fraction and an attribute cell_methods that includes "area:mean over sea", holds the sea-ice concentration : the area fraction of ocean surface that is covered by sea ice. The two quantities (the sea ice area fraction of a grid cell, and the sea-ice concentration) differ in grid cells that are partly sea, e.g. in coastal grid cells. "Sea ice" means all ice floating in the sea which has formed from freezing sea water, rather than by other processes such as calving of land ice to form icebergs. As should be clear from the above, the new text clarifies what sea_ice_area_fraction can be, it does not change its meaning in either way. |
Beta Was this translation helpful? Give feedback.
-
Tom's text is a definite improvement. We should hear from those familiar with how the standard name "descriptions" are constructed to guide us with the wording. In general, names are supposed to indicate a property or quantity independent of how it is reported. So, I think we should first describe it in general terms without reference to grid cells. Also, if a term appears in multiple standard names, then the description of that term is usually stated separately and consistently across all variables, so any change in the description of the term I've suggested some edits here:
I'm sure further tweaks of the above could improve it. |
Beta Was this translation helpful? Give feedback.
-
I wonder if we should say just before the last sentence in my description:
|
Beta Was this translation helpful? Give feedback.
-
Yes, please move it to a vocabularies issue. And I agree that all area_fraction descriptions should be reworded without (initial) reference to the.bounds of the area represented by the fraction. The area could be a cell, a region, a pixel, a zonal band, a hemisphere, the globe, etc. For example, the global land_area_fraction, as I recall, is about 30%, so there is no need to mention "cell". This is similar to "surface_temperature", for which the name is independent of the area it represents. It might apply to individual grid cells or the globe, for example. |
Beta Was this translation helpful? Give feedback.
-
The answer is being pursued in vocabularies issue 254, opened by @TomLav. Thanks. |
Beta Was this translation helpful? Give feedback.
The answer is being pursued in vocabularies issue 254, opened by @TomLav. Thanks.