Accurate geodata is required when developing tomorrows
telecom network, where an increased demand are seen in
several fields such as video, 5G and IoT solutions. By
using geodata, it is possible to plan and design telecom
networks with optimal performance. Telecom networks
are often overloaded in crowded areas, and by localizing
social structures that contains a lot of people, it is possible
to distribute resources more efficiently.
Together with Vricon we have developed a product utilizing deep learning convolutional
neural networks to efficiently detect and mark airports in satelite images. This webpage
describes the task tackeled, how the challange was accomplished and what results were obtained.
Welcome to the future of computer vision.
Our product constitutes a way to efficiently train deep neural networks
to perform semantic segmentation on satelite images. The product is modular,
and can easily be adapted to detect more types of objects and landmarks.
Provided is also pre-trained networks, based on the award-winning ResNet50 and
DeepLabV3+ designs, which are out-of-the-box capeable of detecting airports from
satellite images.
Above is shown an image describing the concept of semantic segmentations. Illustrated is a satellite image, which has been fed to a deep convoluational neural network, and classified to several diferent types of objects and landmarks, reining from rivers and images to roads and bushes.
DeepLabV3+ is a deep convolutional neural network designed to perform Semantic Segmentation.
It makes use of Spatial Pyramid Pooling to preserve semantic information. We have implemented
DeepLabV3+ in our system, designing it to perform Semantic Segmentation on satellite images, with
focus on detecting and classifying airports. A resulting image when using the system on a satellite image is visible below.
As ilustrated in the above image, our implementation of DeepLabV3+ can successfully mark an airport on the given map. It does so with an F1-score of 98.3 % and 75.3 %, for non-airport and airport pixels respectively, resulting in an average F1-score of 86.8 %.
ResNet50 is deep neural network designed to perform image classification.
It utilizes Residual Skips to achieve more efficient training and higher accuracy.
We have modified the original design to perform semantic segmenation instead of image classification,
while still making use of the benificial residual Skips. Below is shown a resulting image from applying our implementation
to a satellite image.
As visible above, our implementation of ResNet50 is clearly able to mark an airport in a given satellite image. It does so with an F1-score of 97.6 % and 58.9 %, for non-airport and airport pixels respectively, resulting in an average F1-score of 78.3 %.
The conclusion that can be drawn from this project is that airports are a challenging type to classify, possibly because of their variety in appearance as well as similarity to other social structures. The experiment does however show that it is possible to train these models to find larger airports, but with an insufficient reliability. The best average F1-score for DeepLabV3+ achieved a result higher than the required of 0.80, while the ResNet50 implementation did not. To get a more accurate classification it is obvious that a much large set of training data is required, not at least when training a neural network that has millions of parameters.