1 Introduction
-
We propose a procedure for the identification of urban groups. The identification approach is applicable to every context providing a graph that expresses both the interactions/communications among users and the users’ mobility traces. Due to its high modularity, the methodology can be employed to discover whatever subgraph expresses the concept of group, as well as to map, when feasible, the group’s activities in urban places. In this latter respect, it finds its favorite locations.
-
By applying the above procedure, we analyze how urban groups meet and behave within the urban space. We show that these groups meet all the main criteria of what makes for a sociological group, namely: mutuality (i.e. groups are highly dense subgraphs where each one interacts with any other); reachability (i.e. within a group no one is disconnected); interactivity (i.e. urban group members interact with one another frequently, and in large groups they devote much greater efforts to interacting with one another and to maintaining relationships established within the group than they do in small groups).
-
We provide a characterization of the urban groups by analyzing their size and membership, and we find similarities with modern instant messaging services (e.g. WhatsApp and WeChat). In addition , we also focus on the preferences of urban groups by investigating the places where they meet and the frequency with which they gather. Specifically, we show that, in strict analogy to human mobility, urban groups are characterized by few visited locations; also they need to combine on-phone interactions with gatherings in such locations, since the visitation patterns of these locations is regular. Finally, we investigate how their preferences impact the city of Milan. This tells us which areas encourage group get-togethers best.
-
We also highlight how mobility and interaction information define social roles within urban groups [10]. Specifically, we focus on the identification of leader/follower relations through the visit patterns of the places hosting urban group gatherings. We find a subset of members (the leaders) who take part frequently in the get-togethers, while other members (the followers) play a much more marginal role w.r.t. the urban group activities. The same observation also holds for the frequency of the interactions within a group. In this case, within the largest groups, we identify the presence of a backbone of strong links involving a small subset of group members.
-
Generally, we show that cellular network data—CDRs—are a feasible and rich source of data to discover and analyze the behavior of social groups, since they capture both social interactions and a medium-grain mobility needed to identify likely group get-togethers.
2 Dataset
2.1 Data description
2.2 User’s localization
LocationAPI
that provides the cell center along with the estimated error. Currently, we are not using this last data and we assume that the cell center corresponds to the exact position provided by the system. For each cell, \(\mathit{cell}_{i}\), \(r_{i}\) is half the mean of the Euclidean distances between the center of \(\mathit{cell}_{i}\) and the centers of the six closest cells .33 Methodology
3.1 Interaction graph building
Nodes | Links | Density |
k̂
|
ŵ
| % nodes in GCC |
ĉ
|
---|---|---|---|---|---|---|
289,448 | 429,273 | 1.02⋅10−5 | 3 | 29 | 78% | 0.12 |
3.2 Cohesive group identification
3.3 Co-location filtering
4 Urban group behaviors
4.1 Size and membership
4.2 Locations and visit patterns
4.3 Interactivity of the urban groups
5 Preferences of urban groups
5.1 Favorite interactions within urban groups
5.2 Favorite locations
Minimum days | Mean | Median | Std. | 75-pct | 90-pct | 95-pct |
---|---|---|---|---|---|---|
50% | 1.09 | 1.00 | 0.54 | 1.00 | 2.00 | 2.00 |
60% | 0.89 | 1.00 | 0.51 | 1.00 | 1.00 | 2.00 |
70% | 0.78 | 1.00 | 0.51 | 1.00 | 1.00 | 1.00 |
80% | 0.68 | 1.00 | 0.51 | 1.00 | 1.00 | 1.00 |
90% | 0.54 | 1.00 | 0.52 | 1.00 | 1.00 | 1.00 |
100% | 0.40 | 0.00 | 0.51 | 1.00 | 1.00 | 1.00 |