In view of the unbroken popularity of applications for the Android mobile platform, there are numerous apps in the “App Stores” which, according to the description, implement similar or identical functionality. In many cases, however, this description is non-existent, incomplete or simply wrong. This circumstance is particularly problematic when applications process sensitive data, access critical sensors or process data from users.
In the course of this project, we will use autoencoder, an approach from the “deep learning” area, to analyze similarities in the description texts and the claimed permissions of Android applications. The goal is to better understand whether and to what extent applications are similar based on their description texts and whether Android apps with comparable functionality require the same permissions.