Appl. Sci. 20 20 , 10 , x; doi: FOR PEER REVIEW www .mdpi.com/journal/ applsci [620398]
Appl. Sci.
20
20
,
10
,
x; doi: FOR PEER REVIEW
www
.mdpi.com/journal/
applsci
Article
1
Cooperation Based
Proactive Caching in Multi
–
tier
2
Cellular Networks
3
Fawad Ahmad
1
,
Ayaz
Ahmad
1
, Irshad
Hussain
2,
*
,
Peerapong
Uthansakul
3
,
*
and Suleman Khan
2
4
1
Department
of Electrical and computer Engineering,
COMSATS University
Islamabad,
Wah Campus, Wah
5
C
antt 47040,
Pakistan
;
6
2
Faculty of Electrical & Computer Engineering, University of Engineering and Technology Peshawar, 25000
7
Pakistan
(I.H.)
;
8
3
Sch
ool
of
Telecommunication Engineering, Suranaree University o
f Technology, Nakhon Ratchasima
9
30000,
Thailand
(P.U.);
10
11
*
Correspondence:
[anonimizat]
;
[anonimizat]
;
12
Received: date; Accepted: date; Published: date
13
Abstract:
The scarce caching capacity of the cache enabled local Base stations (BSs) mitigate the cache
14
hit ratio (CHR) and user satisfaction ratio (USR). However, Cache enable multier cellular networks
15
h
ave been presented as a promising candidate for fifth generation networks for much higher CHR
16
and USR through densification of networks by deploying smaller networks. In addition to this, the
17
cooperation among the BSs of various tiers for cached data trans
fer
intensify its significance many
18
folds .Therefore, in this paper we consider CHR and USR maximization of a multier cellular network.
19
We formulate a CHR and USR p
roblem for multitier cellular
networks.
W
e put major constraints on
20
caching space of BSs of
each tier. The unsupervised learning algorithms such as K
–
mean clustering
21
and col
laborative filtering are used
for clustering the similar BSs in each tier and estimating the
22
content popularity respectively.
A
novel scheme such as cluster average popularity
based
23
c
ollaborative filtering
(
CAP
–
CF
)
algorithm is used to cache the data and maximize the CHR in each
24
tier. Similarly, two novel method
s such as intra tier and Cross
tier cooperation (ITCT) and modified
25
ITCT algorithms are employed in order to optimize
the US
R. Simulations results
witnesses,
that the
26
proposed schemes yields significant performance in terms of average cache hit ratio and user
27
satisfaction ratio compared to conventional approaches
.
28
K
eyw
ords:
caching
;
cooperative network;
multi
–
tier cellular
network
;
content popularity
;
machine
29
learning
;
30
1.
Introduction
& Background
31
Recently the smart phone usage is enormously increased which result into exponential growth
32
of mobile data traf
fic. This Mobile data traffic
pos
s
es
s
an unbelievable challenge in the radio resource
33
demand [1], [2]. CISCO predicts that mobile traffic will increase 8 times from 2015
–
2020 [3] and would
34
reach to 30.6
Exabyte p
er month by the year 2020[4]. It is observed that a big chunk of increased data
35
traffic is due to duplicate downloads of famous videos files from the remote server [5]. In the recent
36
era, mobile media users are extensively sharing
information and their thoughts via
cross
net. The
37
mobile social media services such as Facebook and Twitte
rs have also increased
the data traffic
38
exponentially [6].
This unbelievable mobile data traffic has brought burden on the communication
39
networks as these are limited in the transmission rate and hardware capabilities. Also the rapid
40
growth of the contents
in the Web for example
cosse
tting
articles, games or other type of
41
entertaining contents such as videos, audios etc. have now started problems regarding quality and
42
downloading for the users [7
],
[8]. Thus, Proactive caching
performs a pivotal role in th
e upcoming
43
5G technology. Caching the popular files on nearby base station and then serving the users from the
44
Appl. Sci.
20
20
,
10
, x FOR PEER REVIEW
2
of
18
local cache are now becoming advantageous due to reducing latency and the data traffic load
45
especially on the back haul.
46
On the other hand, Machi
ne learning tool brings a new research area for the modeling and
47
prediction of the user behavior for proactive caching decisions [9]. Recently the stupendous success
48
of deep learning methods in pattern recognition, images and natural language have attracte
d the user
49
to use them for network optimization [10], [11]. Different machine learning tool can extract
50
information in the data traffic and cache the contents at the local base station [10]. For example,
51
extreme learning machine predicts the popularity of
the contents based on the
different meta features
52
of the contents [12]. Unsupervised learning such as K
–
mean clustering and Collaborative filtering
53
methods are employed to cluster the users with similar
cross
est for the contents and then cache the
54
popular
contents accordingly [11], [13]. Similarly ad
aptive caching scheme optimizes
the network
55
performance thereby decreasing the delay
and back haul load [12], [13].
In [14], the author suggested
56
the multi
–
armed bandit algorithm to estimate content popularity a
nd the authors in [15] proposed an
57
extreme
–
learning machine algorithm for estimation of the future content popularity.
58
Similarly, reinforcement learning methods are employed in cache enabled networks [16], [17]
59
and [18]. For example, in [16] a multi
–
step r
eturn actor
–
critic architecture which is reinforcement
60
learning agent is used for optimizing the cache hit ratio.
The information content can be extracted
61
from the data using a (LSTM) deep learning [19]. The contents are cached if the
cross
est of the data
is
62
known. This decreases the service latency. These contents are most likely requested by the users.
63
Deep
CachNet is a Proactive caching framework in cellular network using deep learning
64
algorithm [20]. In this caching scheme, a huge
amount of data is coll
ected from the end mobile devices
65
of users connected to SBSs. In [21], the author solve the problem of context
–
aware data caching in the
66
heterogeneous small cell networks using deep learning algorithm. The author in [22], deals with the
67
problem of minimiza
tion average energy cost in cache enabled networks especially in limited cache
68
memory nodes using reinforced learning algorithm. Similarly multi
–
agent multi
–
armed bandit can be
69
applied to the caching problems regarding the device to device communication. S
uch design applied
70
the Q
–
learning in order to coordinate the caching decisions [23].
71
The author in paper [24], strives to jointly minimize the average transmission delay in cache
72
enabled networks by jointly considering scheduling and the caching problem us
ing reinforced
73
leaning and deep learning. In [25], the author optimizes multimedia service in 5G caching networks
74
based on semantic information of user. Content popularity is estimated by the singular value
75
decomposition (RSVD) which is based on collaborat
ive filtering (CF).
In [26], the author optimize the
76
cache hit ratio, and a decision matrix is formed from a huge available data. In paper [27], proposed
77
caching scheme for the ultra
–
dense networks. Backhaul load minimization problem is solved but k
–
78
Neares
t Neighbors (k
–
NN) classification.
79
In this article, we propose a
framework for
content
caching
and using satisfaction rate using
80
cross
tier and intra tier cooperation among the base stations.
O
ur
prominent
contributions
are as
81
under
:
82
•
Introducing a data
caching system; where data is cached
in every tier of th
e network, thereby,
83
Optimizing
the
CHR
and
USR
utilizing classical method of Collaborative filtering and
K
–
mean
84
clustering.
85
•
The requests
for different contents are considered
and
a popularity matri
x is formed in each
86
tier, which
is highly sparse in nature. Proactive caching is done by creating clusters of base stations
87
using the unsupervised K
–
mean clustering in each tier and fill half of the cache memory with the
88
most popular contents
of the cluste
r
and the rest of vacant cache is occupied by the contents that will
89
be requested in future predicted by collaborative filtering
C.F
.
90
•P
resent
ing
two novel methods for the user satisfaction rate. The first method deals with the
91
clusters made by K
–
mean clustering, based on contents
requests, in
each tier and content cashing is
92
done by C.F while user satisfaction is achieved thorough
cross
tier and i
ntra cooperation among the
93
different tiers.
94
•In the second method, initially contents are cached in all tier based on CF algori
thm. Then
95
clusters are created
using K
–
mean algorithm while taking all the base station into account,
96
Appl. Sci.
20
20
,
10
, x FOR PEER REVIEW
3
of
18
irrespective of which tier
they belong to .The cluster formed may have base station
s
of different tiers
97
.The second method is faster one as ones requests don’t require intra tier
and X
–
tier flow
for the
98
fulfillment of their content satisfaction.
99
• Extensive simulations are conducted
with different contents and their requests depends on the
100
mode of the users to elaborate the effectiveness of the proposed algorithms. It is shown that by using
101
learning algorithms, caching the data on base station in the different tier, considerably impr
ove the
102
cache ratio and the user satisfaction ration.
103
2.
System Model
104
Su
ppose a modern multitier cache enabled
cellul
ar
network consisting of
t tiers of M base
105
stations, denoted
as M
t
= {1,…, M
t
}, serve N
t
user terminals (UTs) from the set N
t
= {1,…,N
t
}
deployed
106
in a geographical area that consists of residential area,
factories,
colleges and schools
etc
. As the
107
network
is
multitier
so it comprises of Pico BSs, micro BSs, and a single Macro BS to serve greater
108
number of users. Each
BS
is equipped with it
s limited physical storages S
mt
and the storage
109
capacity of
the base stations is denoted by
,
S={S
11
,S
21
,…..Sm
1
,S
12
,S
22
,……S
13
,S
23
….,S
m3
}.
110
All the BSs
are connected through limited capacity optical fibre links.
When the users request
111
the contents and if, t
he contents are cached in
the associated base station the
n it is served otherwise
112
it is
ro
uted to the other base station in the same tier or
another tier
or the contents are fetched from
113
the content servers via
limited capacity back haul. The users request
a variety of contents such as
114
music, movie
s,
breaking news, face book, twitter
s, technical books, notes etc.
from a
content
library
115
F={1,…,…..,F}, where each content f has a size of L(f)Byte and bit rate requirement of B(f) M
byte/
s.
116
The
C
ontent serve
r
caches
F Bytes of data in its storage. In such a scenario, each base station
117
proactively caches selected contents from the library F during off
–
peak hours. Let P
t
(
t
)
∈
R
M×F
is the
118
content popularity matrix which describe the history of
requested contents
at time t, where each
119
coefficient p
m
t
,
f
(
t
)
represents the frequency of requesting the content f received by the
BS
mt
of tier t.
In
120
fact, the matrix P
t
(
t
) is the local content popularity distribution observed at base BS
mt
, whereas the
121
global popularity
distribution of the all the contents is described by the Zipf distribution PF(f),
∀
f
∈
F
122
i.e. lesser contents have greater popularity and greater contents are less popular. Moreover, we
123
consider that all SBSs
of all tiers
have stationary content popularity
matrix over T time slots, thus we
124
represent P(t) by P. Using tools from unsupervised machine learning and collaborative filtering
125
(CF)
,
we propose a proactive caching Procedure using P to determine which content should be cached.
126
To show this, let us defi
ne the cache decision matrix of BSs as X
t
(t)
∈
{0,1}
Mt×F
, where the entry x
mt,f
127
takes 0 if the f
th
content is not cached at m
th
BS
at tier t
in
time
t
, and 1 otherwise. We suppose that the
128
cache placement is done during off
–
peak hours, therefore X
t
(
t
) is
represented by a cache decision
129
matrix X
t
which remains static during peak hours .The second decision variable is Y which is user
130
satisfaction
ratio
parame
ter whose value is one
if request is served other wise 0.
131
2.1. Caching Model
132
In our model, every tie
r has
its
own users and they have their demands according to their
133
preferences. In particular, we consider such caching system where users download different types of
134
contents e.g
.
a university student will have to down load books, lecture notes and video
lecture
135
relevant to his fields. People
interest
ed
in current affairs will watch breaking news.
we consider
136
d
ifferent types of files and th
e BS that receive request for similar data will be virtually clustered,
137
which
will discussed i
n detail later on. In ou
r model.
The caching capacity of Pico BS is lesser
138
compared to micro BS and Macro BS. On the other hand, Macro base station will cache more files as
139
compared to micros and Picos. Similarly number of users for pico is lesser as compared to micro and
140
Macro .
In the multitier system, there will be more pico then micro and single macro. As the capacity
141
is limited, so only popular contents can be store that the associated users requested on its local base
142
station. Thus the base station will offload the traffic th
ereby reducing traffic on the back haul and
143
improve the quality of services.
144
Let us consider a base station BS
m
receives a set of requests R (m) for certain data during
145
time
inter
val T seconds .These requests are Zipf
–
like distribution in nature. Let X be the cache decision
146
Appl. Sci.
20
20
,
10
, x FOR PEER REVIEW
4
of
18
matrix that depends on the popularity matric P of the content and C (m) is the set of cached contents
147
according to then
the cache hit ratio is given in Equation
1 as under.
148
CHR=
∑
(
)
∩
(
)
∑
(
)
(
1
)
149
150
In t
–
tier wireless
communication system, each tier will require its cache to be filled by
the
151
requested contents.
The overall or the average cache hit ratio of the multi
–
tier system can be modelled,
152
as
153
α=
∑
∑
(
)
∩
(
)
∈
∈
∑
∑
(
)
∈
∈
(
2
)
154
Each tier has its own users and cache decision
matrix X
t
which depends on
the popularity matrix
155
P
t
(t)
∈
R
M×F
156
X
mt
=
