Why are we doing it?
Premises and Premise Counts
Currently we have FCC total count of locations (BSL) at the census block level.
MS Building Foot print
Overview
Microsoft (MS) used satellite data (from multiple campains/vintage) to get the footprint of buildings.
They are classifying pixels that are supposed to be part of a buildings (segmentation using a neural network) and then convert pixels to a shape.
It exists worldwide and for US states.
The precision of their model vary depending on the region: the Carribean is at 92,2% and the US at 98.5%. The rate of false positive is 1% for the US and 1.8% for the Carribean. (Oceania was not provided)
The license is ODbL.
Buildings to BSL?
Buildings are shapes, BSL are points. We converted the buildings to single point (arbitrary: the first vertex of the shape) to lower the amout of data. Hence, now we have “buildings” summarized to points (lat/long).
Our pipeline can be find here (private repo)
We do not have access to lat/long of BSL (this is only part of the fabric data). Our assumptions is if a count at block match they are describing the same reality (we can’t do the “on the ground verification”).
The number of buildings reported for 51 states is: 130 099 920
While is the number of BSL is: 114 074 438
Should this information be provided and how?
What could be the use cases?
Integrating those informations will have cost: information “overload”, documentation about it is needed
We can:
Provide the count per census block of MS building footprint and let the user decide on the quality of the data around their specific geography
Add the dot to the map
if we add dot and/or count per block, how trustworthy this is information will be: add a “confidence” about it (will always be statitical)
How this data helps BEAD applicant?