Statistical API: sampleCount and noDataCount

mke · September 30, 2021, 6:14am

Hi,

How are sampleCount and noDataCount defined in case of non-rectangular geometries?

It seems that noDataCount never reaches 0 for non-rectangular regions but has a minimum value that is related to the region’s boundingbox.

For example the results in this example have a constant noDataCount for different dates:
https://sentinelhub-py.readthedocs.io/en/latest/examples/statistical_request.html#Multiple-requests-for-a-collection-of-geometries

If I’m using dynamic masking (with for example CLM/SCL flags) to set noDataMask, is there a way to determine the number of no-data-pixels within the irregular polygon?

william.ray · October 1, 2021, 9:53am

Hi,

Thanks for your question. This is quite a difficult concept to answer, but, to put it simply when you make a request, a bounding box of a geometry is requested and the pixels not in the geometry but within the bounding box are labelled no data. Therefore:

sampleCount = width * height (always the bounding box of the request, be it bbox or geometry that is being requested)
noDataCount = sampleCount - no. of pixels in the geometry

I hope this answers your question. Let me know if you need a more detailed explanation and I can run through an example of the concept for you.

mke · October 1, 2021, 12:13pm

Thank you, this helps.

For my use case, daily statistics from S2L2A data with custom noData mask derived from CLM and SLC, I’m interested also in the daily number of no-data pixel within the geometry.

Getting that value would be nice improvement to the API.

Is there currently any other way to get that information than to assume that in the time-series the date when noDataCount gets it’s minimum value is completely cloud free and derive the total number of pixels within the geometry accordingly?

no. of pixels outside the geometry = min(noDataCount)
thus
number of no-data-pixels in geometry = noDataCount - min(nodataCount)

ksnn · April 6, 2023, 11:54am

I had the same issue. I’m fetching Sentinel-1 and Sentienl-2 stats for parcels. My current workaround is to use sampleCount and noDataCount from Sentinel-1 stats to determine the number of no-data pixels within the geometry to get Sentinel-2 cloud coverage within the parcel.

Is there a more robust way to determine the number of pixels that the geometry corresponds to?

chung.horng · April 11, 2023, 10:21am

Hi @ksnn ,

The inability to determine where the dataMask comes from is a known issue and we will upgrade the dataMask to provide the information indicating if the nodata comes from the lack of observations v. from the clipping of geometry.

However, we don’t have an ETA for the upgrade. Sorry for the inconvenience.

ksnn · April 11, 2023, 10:54am

No worries! Thanks for the update