A multi-objective approach for PH-graphs with applications to stochastic shortest paths
Mathematical Methods of Operations Research
https://doi.org/10.1007/s00186-020-00729-3
ORIGINAL ARTICLE
A multi-objective approach for PH-graphs with applications
to stochastic shortest paths
Peter Buchholz1
· Iryna Dohndorf1
Received: 25 September 2019 / Revised: 23 August 2020 / Accepted: 3 October 2020
© The Author(s) 2020
Abstract
Stochastic shortest path problems (SSPPs) have many applications in practice and
are subject of ongoing research for many years. This paper considers a variant of
SSPPs where times or costs to pass an edge in a graph are, possibly correlated, random
variables. There are two general goals one can aim for, the minimization of the expected
costs to reach the destination or the maximization of the probability to reach the
destination within a given budget. Often one is interested in policies that build a
compromise between different goals which results in multi-objective problems. In
this paper, an algorithm to compute the convex hull of Pareto optimal policies that
consider expected costs and probabilities of falling below given budgets is developed.
The approach uses the recently published class of PH-graphs that allow one to map
SSPPs, even with generally distributed and correlated costs associated to edges, on
Markov decision processes (MDPs) and apply the available techniques for MDPs to
compute optimal policies.
Keywords Stochastic shortest path problems · Markov decision processes · Phase
type distributions · PH graphs · Multicriteria optimization
1 Introduction
Shortest path problems are classical decision problems in computer science and operations research with many practical applications. The goal is to find an in some sense
optimal path between a source and a destination node in a weighted directed graph.
Weights of edges describe time, costs, gain, or reliability when passing the edge. We
use in the sequel the term costs of an edge to describe the quantitative parameter. The
B Peter Buchholz
Iryna Dohndorf
1
Informatik IV, TU Dortmund, 44221 Dortmund, Germany
123
P. Buchholz, I. Dohndorf
entire version of the problem assumes deterministic costs. However, often costs are
not exactly known such that stochastic descriptions of the costs are more appropriate
which results in the Stochastic Shortest Path Problem (SSPP). We consider in this
paper cases where a decision maker can decide to choose an outgoing edge when
reaching a node. This implies that decisions are made adaptively and not a priori as
in Nie and Wu (2009). In many practical applications, random variables modeling the
edge costs are not independent. For example adjacent roads have similar traffic conditions, interest rates in subsequent periods are correlated and failure rates of related
components are dependent. This implies that the introduction of correlation in SSPPs
is important for realistic models but makes the problem of specifying parameters and
computing optimal paths even harder.
In a stochastic setting with adaptive decisions, a unique optimal path usually does
not exist, instead the chosen path depends on the realization of costs on already passed
edges. This includes the situation that even with only positive weights, the optimal
path may contain cycles. Instead of a path, a policy is defined that specifies for each
node the choice of the outgoing edge which may depend on additional information
that is available when reaching the node. The term optimal is not uniquely defined
for stochastic decision problems. One possible interpretation is the minimization (or
maximization) of the expected costs to reach the destination from the source. In other
settings, it is more appropriate to maximize (or minimize) the probability to reach the
destination within a given budget. In traffic networks, the first goal corresponds to
the minimization of the expected travel time and the second goal corresponds to the
maximization of the probability to meet a deadline. Often one is interested in a compromise solution, i.e., a policy that reaches the destination in a short expected time with
a small probability of missing the deadline. This results in a multi-objective optimization problem, with incomparable solutions. Therefore it is important to characterize
the set of Pareto optimal solutions or at least a convex hull of this set.
This paper considers SSPPs with correlated edge costs and introduces methods to
compute optimal policies for instances of the two mentioned problems. Furthermore,
an approach is introduced to approximate the convex hull of Pareto optimal policies,
if several goal functions are combined. The problem will be solved in the context of
PH-graphs (PHGs) (Buchholz and Felko 2015), a recently published class of stochastic graphs that allows one to include generally distributed edge costs and correlation
between adjacent edges. Edge costs are modeled by phase-type distributions (PHDs)
(Buchholz et al. 2014; Neuts 1981). PHGs can be mapped on Markov decision processes (MDPs) such that algorithms from MDPs can be adopted to compute optimal
policies and a convex hull of Pareto optimal policies.
Related work SSPPs are considered in many different applications ranging from
transportation (Barbarosoǧlu and Arda 2004; Nikolova and Karger 2008) to computer
networks (Nain and Towsley 2016), data migration (Li et al. 2016), social networks
(Rezvanian and Reza Meybodi 2016) or finance (Budd 2016; Koenig and Meissner
2015). An enormous number of papers has been published in the area such that we
can only highlight a few, most relevant results for our work. Algorithms to compute
policies that maximize the probability to reach the destination within a given budget
can be found in Fan et al. (2005b) and Samaranayake et al. (2012). Correlation among
edge costs is considered in several papers (Fan et al. 2005a; Nain and Towsley 2016;
123
A multi-objective approach for PH-graphs with applications…
Samaranayake et al. 2012). The major problem in this context is the compact definition
of dependencies and its consideration in optimization algorithms. Often dependencies
are defined for the costs of adjacent edges.
The combination of different goal functions results in multi-objective SSPPs (Ji
et al. 2011; Zhaowang et al. 2004; Randour et al. 2015; Zhang et al. 2010). Often
multi-dimensional costs are considered in this case. Alternatively, the combination of
expectation and variance is minimized (Zhaowang et al. 2004; Zhang et al. 2010) or
the probability of exceeding the budget is defined as a constraint (Ji et al. 2011). Often
heuristic algorithms are applied to approximate the set of Pareto optimal solutions (Ji
et al. 2011; Zhaowang et al. 2004; Zhang et al. 2010). Only in some cases correlations
are considered, more often independent distributions are assumed (Nikolova et al.
2006).
We analyze the problems in the context of PHGs which have been defined in Buchholz and Felko (2015) and Dohndorf (2017). Distributions in this model class are
described by (...truncated)