Drone Imagery Anonymization using AI

Problem statement

The GDPR introduced new regulations on how to process and handle personal data. As such, meeting the GDPR requirements is key in selling software applications to customers. The anonymization of textual data is largely accepted as something that is possible, but this is not something that is true for images. For example sometimes a face detection algorithm fails and a face is not blurred. Further, vehicles are often recognizable not just by their number plates but also by logos or other characteristics. A specific research challenge with regard to drone imagery is the fact that the camera is mobile and that real-world footage often features rapid zooming in and out. In this thesis we want to mitigate these problems.

Drone Anonymization

Goal

The goal of this thesis is to create a machine learning algorithm that can pseudonymize/anonymize faces and vehicles in images captured by a drone taking into account above challenges. Technologies that can be used are Python, Tensorflow, Keras and in general the Python data science and machine learning track. More specifically, this topic encompasses two potential research challenges.

Detection and tracking of moving objects such as people and vehicles is challenging in the case of drone footage as camera movement can be sudden and fast and the possibility to zoom further complicates matters. The fact that no frames or edges of objects can be missed in the case of anonymization makes this a particularly challenging problem. One possibility for research could be to use motion sensors and/or drone input signals to help anticipate movement and improve tracking.

An often-used technique today in pseudonymization is to replace identifiable elements rather than replace them, e.g. replace all names in documents by other names. That way an observer is unable to identify names that were accidentally missed by the algorithm. Recently, this technique has been transferred to the vision space, amongst others under the name of deep privacy whereby GAN-based techniques are used to replace real faces by artificially generated ones. Optimizing this technique for differing circumstances (cameras, angles) and drone-specific aspects such as motion and zooming already present reasonable challenges. Replacing number plates and other recognizable writing or logos on cars and vans present even more research subtopics.

ML6

This thesis is in collaboration with the company ML6, a fast-growing machine learning services company based in Ghent, with additional offices in Amsterdam, Berlin and London. They deliver specialized engineering, research, and consulting services to clients active in a broad range of sectors. Technical areas of expertise include machine vision, natural language processing, time series analysis and reinforcement learning.

This thesis has been conducted by Karel Vandenbussche (Karel.Vandenbussche@UGent.be).