Blame view

investigation.tex 9.51 KB
Jinghao Shi authored
1
2
3
\section{Investigation}
\label{sec:investigation}
Jinghao Shi authored
4
To investigate reciprocal sharing opportunity in real life scenarios, we
Geoffrey Challen authored
5
6
7
8
9
10
obtained a \wifi{} scan result dataset from
\PhoneLab{}\footnote{\url{http://www.phone-lab.org}} (\S\ref{subsec:phonelab}).
We first discuss some heuristics to identify the home AP for each device
(\S\ref{subsec:homeap}). Then we show the RSSI comparison between a user's
home and neighbor APs (\S\ref{subsec:better}). Finally, we explore the
reciprocal sharing relationships in the dataset (\S\ref{subsec:reciprocal}).
Jinghao Shi authored
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28

\subsection{PhoneLab \wifi{} Dataset}
\label{subsec:phonelab}

\begin{table}[t]
  \begin{tabularx}{\columnwidth}{Xr}
    \toprule
    Begin & 11/7/2014 \\ 
    End & 4/3/2015 \\ 
    Duration (Days) & 147 \\ \midrule
    Participants & 254 \\
    Device Type & Nexus~5 \\ \midrule
    Scans & \num{21192417} \\
    Observed APs & \num{1197522} \\
    Used APs & \num{15668} \\ \midrule
    \wifi{} Sessions & \num{466032} \\
    \bottomrule
  \end{tabularx}
Jinghao Shi authored
29
30
  \caption{\textbf{\PhoneLab{} \wifi{} Dataset Summary.} Used APs refers to the
  subset of total APs that were used by the devices participating in the study.}
Jinghao Shi authored
31
  \label{tab:summary}
Jinghao Shi authored
32
  \vspace*{-3mm}
Jinghao Shi authored
33
34
\end{table}
Geoffrey Challen authored
35
36
37
38
39
40
41
42
43
44
45
46
47
\PhoneLab{}\cite{phonelab-sensemine13} is a public smartphone platform
testbed operated at the University at Buffalo. Several hundreds of
participants carry instrumented Nexus 5 smartphones as their primary device.
In particular, the smartphone platform was modified to log each \wifi{} scan
result and \wifi{} connection events naturally generated by the Android
system. Note that from data collection point of view, platform
instrumentation is not necessary, and the same information can also be logged
by applications with appropriate permissions. A \wifi{} scan result
represents the device's network visibility, and consists of multiple
entries---each corresponds to one \wifi{} AP the device observed. The content
of one entry includes: (1) beacon timestamp, (2) AP SSID and BSSID, (3) AP
channel and (4) RSSI. The timestamp when the scan was performed is also
logged. Table~\ref{tab:summary} summarizes the \PhoneLab{} \wifi{} dataset.
Jinghao Shi authored
48
49
50
51

\subsection{Home AP Detection}
\label{subsec:homeap}
Jinghao Shi authored
52
53
54
55
56
57
58
59
We focus on home \wifi{} networks which are more likely to reveal stable and
immediate reciprocal sharing opportunities. For this purpose, we first developed
several heuristics to identify the home AP for each device in the dataset. The
intuition is that the devices are most likely connected to their home AP at
night. More specifically, to identify the home AP for a device, we look at
\wifi{} sessions that happened during 12~AM and 4~AM and count the number of
days that the device connects to each AP during this time period. We then
identify the AP which has the largest day count as the device's home AP,
Jinghao Shi authored
60
61
provided that the day count is larger than a threshold (30 days) to further
filter out false positives.
Jinghao Shi authored
62
Jinghao Shi authored
63
After applying the above heuristics, the home AP information of 107 devices is
Jinghao Shi authored
64
identified, including 101 unique BSSIDs. There are 6 BSSIDs that are identified
Jinghao Shi authored
65
66
as home APs for two devices. After further investigation and clarification with
\PhoneLab{} administrators, we found this is because some participants are family
Jinghao Shi authored
67
68
69
members, and certain participants had device replacements during the data
collection period. In both cases, multiple devices may be associated with the
same home AP.
Jinghao Shi authored
70
Jinghao Shi authored
71
\subsection{\wifi{} Session Signal Strength}
Jinghao Shi authored
72
73
\label{subsec:better}
Jinghao Shi authored
74
75
After identifying the home AP for each device, we ask two questions: (1) When
the device is connected to its home AP, how often does it receive a better signal
Jinghao Shi authored
76
from neighbors' APs which it does not have access to? and (2) When the home AP
Jinghao Shi authored
77
78
fails to provide the best signal, are there dominant neighbor APs
that provide better signal most of the time?
Jinghao Shi authored
79
Jinghao Shi authored
80
81
82
\sloppy{%
  To answer the first question, we inspect scan results that are reported during
  \wifi{} sessions with home APs. For each such scan result, we identify the
Jinghao Shi authored
83
84
85
86
87
88
  currently associated home AP, $AP_{home}$, and the AP with best RSSI, denoted
  as $AP_{best}$. We are particularly interested in \textit{sub-optimal} cases,
  where: (1) $AP_{home} \neq AP_{best}$ and (2) the device never connects to
  $AP_{best}$ in the dataset. Such cases indicate that the device could
  potentially improve its \wifi{} performance by connecting to a neighbor AP
  which has a strong signal yet it does not have access to that AP. Note that
Jinghao Shi authored
89
90
  here we consider RSSI as a hint in determining the \textit{optimal} AP and it
  is well understood that RSSI does not directly translate to \wifi{}
Jinghao Shi authored
91
  performance, which we will discuss in Section~\ref{subsec:monitoring}.  Also note
Jinghao Shi authored
92
93
94
  that the cases when the device is not connected to APs with the strongest
  signal due to bad roaming strategies are not interesting in the context of
  this paper, and are excluded by the second condition.
Jinghao Shi authored
95
}
Jinghao Shi authored
96
97
98
99
100
101

\begin{figure}[t]
  \centering
  \includegraphics[width=\columnwidth]{./figures/HomeAPSessionRSSI.pdf}
  \caption{\textbf{CDF of Sub-Optimal Connection Time.}}
  \label{fig:suboptimal}
Jinghao Shi authored
102
  \vspace*{-4mm}
Jinghao Shi authored
103
104
\end{figure}
Jinghao Shi authored
105
We classify all scan results reported during home \wifi{} sessions into
Jinghao Shi authored
106
107
two categories: sub-optimal and the rest. For each device, we calculate the
percentage of time when the scan results indicate sub-optimal association.
Jinghao Shi authored
108
Figure~\ref{fig:suboptimal} shows the CDF of this percentage for 
Jinghao Shi authored
109
the 107 devices. We make several observations. First, for 60\% of the devices,
Jinghao Shi authored
110
their home APs usually provides best signal (sub-optimal percentage less than
Jinghao Shi authored
111
20\%). This result is not particularly surprising considering that home APs are
Jinghao Shi authored
112
usually carefully positioned to provide good coverage. Second, we notice
Jinghao Shi authored
113
114
that for certain number (15\%) of devices, their home APs failed to provide best
signal for more than 50\% of the time, suggesting that these users may benefit
Jinghao Shi authored
115
from sharing the \wifi{} access of neighbor APs.
Jinghao Shi authored
116
Jinghao Shi authored
117
Next, we want to answer the question when the device is in a sub-optimal association
Jinghao Shi authored
118
119
120
121
with its home AP, are there \textit{dominant} neighbor APs that
usually provide the best signal among other neighbor APs? If such dominant
neighbor APs exist, then by just sharing access of several particular neighbor
APs, the device's sub-optimal association time can be largely reduced. To this
Jinghao Shi authored
122
end, we look at all the scan results in the sub-optimal category, and count the
Jinghao Shi authored
123
number of times that each neighbor AP appears as $AP_{best}$. For each device,
Jinghao Shi authored
124
125
we calculate the fraction of the top $n (1 \le n \le 3)$ dominant neighbor APs.
Figure~\ref{fig:dominantap} shows the CDF of dominant AP fraction. By sharing 1,
Jinghao Shi authored
126
2 or 3 neighbor APs, the suboptimal connection time for 50\% of the devices can be
Jinghao Shi authored
127
reduced by 55\%, 82\% and 90\% respectively.
Jinghao Shi authored
128
129
130

\begin{figure}[t]
  \centering
Jinghao Shi authored
131
  \includegraphics[width=\columnwidth]{./figures/DominantNeighborAPFigure.pdf}
Jinghao Shi authored
132
  \caption{\textbf{CDF of Dominant AP Fraction.}}
Jinghao Shi authored
133
  \label{fig:dominantap}
Jinghao Shi authored
134
  \vspace*{-4mm}
Jinghao Shi authored
135
\end{figure}
Jinghao Shi authored
136
Jinghao Shi authored
137
\subsection{Reciprocal Sharing Opportunities}
Jinghao Shi authored
138
139
140
\label{subsec:reciprocal}

Finally, we investigate the cases where two devices can obtain better signals
Jinghao Shi authored
141
142
143
144
145
from each other's home AP, i.e., reciprocal sharing opportunities. For this
purpose, we build a reciprocal sharing graph $G_r=(V, E)$, where $V$ is the set
of APs, and $\langle AP_i \rightarrow AP_j \rangle \in E$ if $AP_i$'s clients
receive better signal from $AP_j$, that is, $AP_j$ appears as $AP_{best}$ in the
scan results of $AP_i$'s clients. Note that according to the definition, $AP_i$
Jinghao Shi authored
146
is one of the identified home APs, while $AP_j$ could be any other arbitrary AP.
Jinghao Shi authored
147
148
149
150
151
152
153
154
155
Loops in $G_r$ represent reciprocal sharing opportunities.

To capture the reciprocal sharing relationships among \PhoneLab{} participants,
we further construct a subgraph of $G_r$, $G_r'=(V', E')$, where $\langle AP_i
\rightarrow AP_j \rangle \in E'$ only if both $AP_i$ and $AP_j$ are identified
home APs of \PhoneLab{} participants. Figure~\ref{fig:reciprocal} visualize
$G_r'$, where nodes without outgoing edges are omitted for clarity. Sharing
opportunities are sparse but exist. In particular, we observe one pair of home
APs, node 0 and 7, which exhibit reciprocal sharing relationships.
Jinghao Shi authored
156
Jinghao Shi authored
157
158
\begin{figure}[t]
  \centering
Jinghao Shi authored
159
  \begin{subfigure}{\columnwidth}
Jinghao Shi authored
160
161
    \centering
    \includegraphics[width=0.5\columnwidth]{./figures/ReciprocalSharingFigure.pdf}
Jinghao Shi authored
162
163
164
165
166
167
    \caption{\textbf{Reciprocal Sharing Graph Among PhoneLab Participants.}}
    \label{fig:reciprocal}
  \end{subfigure}
  \begin{subfigure}{\columnwidth}
    \includegraphics[width=\columnwidth]{./figures/SpatialSparseFigure.pdf}
    \caption{\textbf{CDF of Node Outdegree in Reciprocal Sharing Graph.} Only
Geoffrey Challen authored
168
    outdegrees of identified \PhoneLab{} home APs are counted. }
Jinghao Shi authored
169
170
    \label{fig:sparse}
  \end{subfigure}
Jinghao Shi authored
171
172
  \caption{\textbf{Reciprocal Sharing Opportunities in \PhoneLab{} Dataset.}}
  \vspace*{-3mm}
Jinghao Shi authored
173
\end{figure}
Jinghao Shi authored
174
175
176

We must point out that \PhoneLab{} participants reside sparsely among the vast
Buffalo area, and the above analysis is further restricted to those participants
Jinghao Shi authored
177
whose home AP can be detected using heuristics described
Jinghao Shi authored
178
179
180
181
182
183
184
185
186
187
188
189
Section~\ref{subsec:homeap}. The consequence of such sparsity is that most of
the neighbor APs which can provide better signal are not one of the identified
home APs, thus are not shown in Figure~\ref{fig:reciprocal}. To quantify the
spatial sparsity, Figure~\ref{fig:sparse} shows the CDF of node outdegree
in $G_r$ and $G_r'$. While the median node outdegree in $G_r$ is 2, 95\% of nodes
in $G_r'$ has no outgoing edges.  The fact that reciprocal sharing
opportunities exist at all in such a sparse dataset is quite surprising, and
motivates the need for a system to detect and enable such reciprocal \wifi{}
sharing.