This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON CYBERNETICS 1 Multiobjective Vehicle Routing Problems With Simultaneous Delivery and Pickup and Time Windows: Formulation, Instances, and Algorithms Jiahai Wang, Member, IEEE, Ying Zhou, Yong Wang, Member, IEEE, Jun Zhang, Senior Member, IEEE, C. L. Philip Chen, Fellow, IEEE, and Zibin Zheng, Member, IEEE Abstract—This paper investigates a practical variant of the vehicle routing problem (VRP), called VRP with simultaneous delivery and pickup and time windows (VRPSDPTW), in the logistics industry. VRPSDPTW is an important logistics problem in closed-loop supply chain network optimization. VRPSDPTW exhibits multiobjective properties in realworld applications. In this paper, a general multiobjective VRPSDPTW (MO-VRPSDPTW) with five objectives is first defined, and then a set of MO-VRPSDPTW instances based on data from the real-world are introduced. These instances represent more realistic multiobjective nature and more challenging MO-VRPSDPTW cases. Finally, two algorithms, multiobjective local search (MOLS) and multiobjective memetic algorithm (MOMA), are designed, implemented and compared for solving MO-VRPSDPTW. The simulation results on the proposed real-world instances and traditional instances show that MOLS outperforms MOMA in most of instances. However, the superiority of MOLS over MOMA in real-world instances is not so obvious as in traditional instances. Manuscript received February 16, 2015; revised February 16, 2015; accepted February 27, 2015. This work was supported in part by the National High-Technology Research and Development Program (863 Program) of China under Grant 2013AA01A212, in part by the National Natural Science Foundation of China (NSFC) for Distinguished Young Scholars under Grant 61125205, in part by the NSFC under Grant 61332002, Grant 61300044, and Grant 61273314, and in part by the Program for New Century Excellent Talents in University under Grant NCET-13-0596. This paper was recommended by Associate Editor J. Wang. J. Wang is with the Department of Computer Science, Sun Yat-sen University, Guangzhou 510006, China (e-mail: [email protected]). Y. Zhou is with the Department of Computer Network Technology, Shenzhen Institute of Information Technology, Shenzhen 518172, China. Y. Wang is with the School of Information Science and Engineering, Central South University, Changsha 410083, China. J. Zhang is with the Sun Yat-sen University, Guangzhou 510006, China, also with the Key Laboratory of Machine Intelligence and Advanced Computing, Ministry of Education, China, also with the Engineering Research Center of Supercomputing Engineering Software, Ministry of Education, China, and also with the Key Laboratory of Software Technology, Education Department of Guangdong Province, Guangzhou 510006, China. C. L. P. Chen is with the Faculty of Science and Technology, University of Macau, Macau 99999, China. Z. Zheng is with the Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen 518057, China. Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TCYB.2015.2409837 Index Terms—Bi-directional logistics, multiobjective optimization, simultaneous delivery and pickup, vehicle routing problem with time windows (VRPTW). I. I NTRODUCTION ECENTLY, green manufacturing and logistics have emerged as the new agenda item in supply chain management [1]–[3]. They have received increasing attention from governments and business organization. One of the actions taken by manufacturing companies toward green manufacturing is to collect end-of-life products from customers for either reuse or proper disposal, which is known as reverse logistics [3]–[5]. Economics, environmental laws, and the environmental consciousness of consumers are the driving factors for adopting reverse logistics concepts in industry. Depending on the nature of returned products, one option is to design combined distribution-collection (delivery-pickup) systems. For example, in the distribution system of grocery store chains, each grocery store may have demand for both delivery (fresh food or soft drinks) and pickup (outdated items or empty bottles) and is served with a single stop by the supplier. In the foundry industry, collection of used sand and delivery of purified reusable sand at the same customer location are carried out with only a single stop. In the printer manufacturing industry, full ink toners and cartridges are delivered and empty ones are collected. In the photocopier manufacturing industry, manufacturers are required to take back or properly dispose of end-of-life products. In these cases, the utilization of vehicles increases significantly when merging products brought to the customers (forward logistics) with returning products brought back to the depot (reverse logistics). Thus, the vehicle routing and the flows of freights become more effective and balanced in the bi-directional logistics [2]. In this paper, a more realistic and general variant of the vehicle routing problem (VRP), called VRP with simultaneous delivery and pickup with time windows (VRPSDPTW), is considered. VRPSPDTW considers simultaneous pickup and delivery at each customer such that a customer is visited only once within the specified time window and without violating R c 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/ 2168-2267 redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information. This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 2 the vehicle capacity constraints [4]–[6]. VRPSPDTW is a challenging combinatorial optimization problem, containing complex constraints not present in classic VRP [1], [7]. VRPSPDTW is NP-hard because it contains VRP as a special case [7]. Thus, practical large-scale instances cannot be solved by exact methodologies within acceptable computational time [5], and most researchers have focused on metaheuristic approaches [8]–[10]. At present, studies on VRPSDPTW remain scarce because the pickup and time window constraints make VRPSDPTW more difficult. Angelelli and Mansini [11] made a first attempt to solve VRPSPDTW via an exact algorithm which can solve problems with up to 20 customers. Lai and Cao [6] proposed an improved differential evolution for VRPSDPTW and carried out some experiments on their own small size instances (with only 8 and 40 customers). Boubahri et al. [12] proposed a multiagent colonies algorithm for VRPSDPTW, but their method has not been tested on any instance. Wang and Chen [5] proposed a co-evolution genetic algorithm for VRPSPDTW, and developed 65 benchmark instances [13] revised from the well-known Solomon benchmark for VRP with time windows (VRPTW) [14]. Recently, Wang et al. [15] and Kassem and Chen [4] adopted simulated annealing to deal with VRPSPDTW. Deng et al. [16] proposed an improved simulated annealing for VRPSDPTW with soft time windows. In [17], a special VPRSDPTW with four types of demands in home health care logistics was considered and solved by genetic algorithm and tabu search. Among the existing work mentioned, some researches study on single-objective VRPSDPTW. For example, in [4] and [6], the total travel distance is considered as a sole objective. Considering VRPSDPTW with multiple objectives (often two objectives including the number of vehicles and total distance), most previous studies transform it to a single-objective optimization problem, and thus adopt single-objective approaches to solve it and return a single solution. One of the most popular techniques is to use scalar techniques. In [5], [15], and [16], several algorithms are proposed to optimize a weighted linear aggregation function of objectives. However, this kind of techniques needs to set the weights according to the importance of the objectives, which is a difficult task [18]. Due to the constraints and problem structure of VRPSDPTW, the optimization of one objective may lead to the deterioration of other objectives, thus VRPSDPTW is essentially a multiobjective optimization problem (MOP) [18]–[20]. Since the decision maker’s preference is not known a priori, multiobjective formulation is necessary for VRPSDPTW to provide a set of solutions that represent the tradeoffs among the objectives, rather than a single solution [18]–[20]. The feature of multiobjective formulation is to consider all objectives with the same importance and obtain a set of Pareto optimal solutions. To the authors’ best knowledge, no previous work utilizes the multiobjective optimization method for VRPSDPTW, which motivates this paper. This paper first defines a general multiobjective VRPSDPTW (MO-VRPSDPTW) with five objectives commonly used in the VRP literature. These objectives include: number of vehicles, total travel distance, makespan IEEE TRANSACTIONS ON CYBERNETICS [i.e., travel time of the longest route (from/to depot)], total waiting time, and total delay time [20]. Then this paper proposes a set of MO-VRPSDPTW instances based on data from a distribution company in Tenerife, Spain. These MO-VRPSDPTW instances are quite different from the revised Solomon instances in [5]. Finally, two algorithms, called multiobjective local search (MOLS) and multiobjective memetic algorithm (MOMA), are designed, implemented, and compared. The usefulness of the proposed MO-VRPSDPTW formulation and algorithms are demonstrated by solving two sets of benchmark instances. The proposed algorithms can be seen as benchmark algorithms for the real-world MO-VRPSDPTW instances, which can be used for comparison by future research. The contribution of this paper is fourfold: 1) introducing a five-objective variant of VRPSDPTW; 2) introducing a set of realistic benchmark instances; 3) designing and testing multiobjective optimization algorithms to solve the five-objective VRPSDPTW; and 4) performing extensive experiments to evaluate the two proposed algorithms. The remaining sections are organized as follows. Section II presents problem formulation and benchmark instances of MO-VRPSDPTW. Section III proposes two algorithms for MO-VRPSDPTW. Section IV presents experimental results. Finally, Section V presents the conclusion. II. P ROBLEM F ORMULATION AND B ENCHMARK I NSTANCES A. MO-VRPSDPTW Given a number of customers who require both forward supply service and reverse recycling service within a given time window, MO-VRPSDPTW considered in this paper aims to send out a fleet of capacitated vehicles at a distribution center to meet the requests with minimum costs or objectives. Let us introduce the following nomenclature [21]. v = {0, . . . , N} Vertex 0 is called depot and the others are called customers. C Vehicle capacity. Delivery demand of customer i. gi Pickup amount of recycling of pi customer i. Service time of customer i. si [bi , ei ] Time window of customer i, where bi and ei denote the earliest and latest service time of customer i, respectively. Travel distance between cusdij tomers i and j. Travel time between customers tij i and j. rj =< c(1, j), . . . , c(Nj , j)>Sequence of Nj customers served in the jth route, where c(i, j) denotes the ith customer to be visited in the jth route. For computational convenience, let c(0, j) = c(Nj + 1, j) = 0 represents the depot. This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. WANG et al.: MO-VRPSDPTW: FORMULATION, INSTANCES, AND ALGORITHMS Fig. 1. Solution and its representation. (a) Solution. (b) Representation. Note that the depot also has a time window [0, e0 ]. Delivery demand and pickup amount of the depot are g0 = 0 and p0 = 0, respectively. The aim of VRPSDTW is to design a set of M routes (i.e., R = {r1 , . . . , rM }) with the lowest cost such that each vehicle departs from and returns to the same depot and each customer is served by exactly one vehicle. As shown in Fig. 1(a), there are three routes (i.e., M = 3): R = {r1 , r2 , r3 }. r1 = <c(1, 1), c(2, 1)> specifies the sequence of two customers (customers 2 and 7) served in route 1, that is, r1 = <2, 7>. In addition, the remaining two routes have three and two customers served, respectively. The total travel distance of the jth route is defined as Distj = Nj dc(i,j)c(i+1,j) . (1) 3 paper, soft time windows are considered as in [16] and [20]. VRPSDPTW with soft time windows can be viewed as a generalization of VRPSDPTW with hard time windows. It has many practical applications. In many cases, relaxing the time windows may be more appropriate [24], [25] because it may result in lower cost solutions requiring fewer vehicles, shorter travel distance, and less travel time. In VRPSDPTW with soft time windows, a vehicle can arrive late within the maximum allowed time. Arriving outwith the maximum allowed time is not allowed. Let md denote the maximum allowed time that a vehicle can arrive after the end of the time window. The delay time of vehicle j at the ith vertex is 0, if ac(i,j) ≤ ec(i,j) (7) dtc(i,j) = ac(i,j) − ec(i,j) , otherwise and the total delay time of this route is Nj DTj = ac(i,j) = lc(i−1,j) + tc(i−1,j)c(i,j) (2) where lc(0,j) = 0, which indicates that vehicle j departs from the depot at time 0. If a vehicle arrives at a customer before the earliest service time, it will cause waiting time. The waiting time of vehicle j at the ith customer can be described as 0, if ac(i,j) ≥ bc(i,j) (3) wc(i,j) = bc(i,j) − ac(i,j) , otherwise f1 = |R| = M. (4) Hence, the total travel time of route rj is Nj Tj = tc(i,j)c(i+1,j) + wc(i+1,j) + sc(i+1,j) 2) Total travel distance ( f2 ) (5) and the total waiting time of this route is Nj wc(i,j) . f2 = M Distj . (10) j=1 3) Makespan ( f3 ), i.e., travel time of the longest route (from/to depot) f3 = max{Tj | j = 1 . . . M}. j (11) 4) Total waiting time due to early arrivals ( f4 ) f4 = M Wj . (12) j=1 f5 = M DTj . (13) j=1 i=0 Wj = (9) 5) Total delay time due to late arrivals ( f5 ) and the leaving time from the ith customer of vehicle j is lc(i,j) = ac(i,j) + wc(i,j) + sc(i,j) . (8) Based on the introduction above, the objectives for MO-VRPSDPTW can be defined as follows. 1) Number of vehicles ( f1 ) i=0 Let ac(i−1,j) be the arrival time of vehicle j at the (i − 1)th vertex and lc(i−1,j) be the leaving time. Thus, the arrival time of vehicle j at the ith vertex is dtc(i,j) . i=1 (6) i=1 In VRPSDPTW with hard time windows [5], arriving after the latest service time is not allowed. In practice, the hard time windows are often relaxed because travel time is usually stochastic and cannot be precisely predictable [22], [23]. Additionally, a small time window violation is usually not a critical breach of service requirements. Hence, in this Minimization of f1 aims to reduce the fixed costs of buying (or hiring) and maintaining vehicles. Variable costs considered in routing and distribution/collection problems are often estimated by using a function of the total distance traveled (denoted as f2 ). Hence, f1 and f2 can be considered as transportation costs (i.e., economic objectives). f3 can be considered as a social objective. This objective has a twofold contribution toward social sustainability. On the one hand, it is used to minimize the maximum working hours among all drivers, thus enabling a balanced workload to promote equity among the drivers. On the other hand, minimizing the maximum working hours releases drivers to activities such as recycling awareness campaigns or training, which is helpful for improving the career development and promoting versatility among This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 4 IEEE TRANSACTIONS ON CYBERNETICS human resources [3]. Minimization of f4 improves work efficiency and avoids wasting working hours. Finally, f5 can be considered as a service cost related to the satisfaction of customers [22]. Needless to say, customers want to be served just in time and cannot tolerate severe delays. In addition, the constraints for MO-VRPSDPTW can be defined as follows. 1) Vehicle capacity constraint: The total delivery demand of each route rj should not exceed the vehicle capacity, that is Nj gc(i,j) ≤ C ∀j = 1, . . . , M. (14) i=1 When a vehicle arrives at customer c(i, j), its load en route is denoted by Cc(i,j) , then the following constraint should be satisfied: Cc(i,j) − gc(i,j) + pc(i,j) ≤ C ∀i = 1, . . . , Nj ∀j = 1, . . . , M. (15) 2) Travel time constraint: Delay time should not exceed the maximum allowed delay time dtc(i,j) ≤ md ∀i = 1, . . . , Nj , ∀j = 1, . . . , M. (16) 3) Return time constraint: Vehicles should return to the depot before the closing time, that is ac(Nj +1,j) ≤ ec(Nj +1,j) ∀j = 1, . . . , M. (17) Thus, MO-VRPSDPTW with the five objectives considered in this paper can be summarized as min F, where F = { f1 , f2 , f3 , f4 , f5 } (18) is subject to (14)–(17), and minimization of the vector function F is supposed here in the sense of Pareto optimization (see Section III). B. Real-World Instances References [4] and [5] generated VRPSDPTW instances revised from Solomon benchmark instances originally designed for VRPTW [14]. But there are two disadvantages. Firstly, the Solomon dataset relies on Euclidean distance for both travel distance and travel time, and one unit of travel time corresponds to one unit of travel distance. Thus, the distance and time matrices are the same and symmetric. This is hardly a realistic scenario because the travel time is often not directly proportional to the travel distance. Triangle inequality holds for the travel distance and the travel time in the Solomon dataset. Secondly, Solomon dataset is not suitable to conduct a proper multiobjective study because weak dependency relationships are presented among objectives in the dataset. These disadvantages motivate us to propose new instances. Our new instances are based on the real-world multiobjective VRPTW (MO-VRPTW) instances recently proposed by Castro-Gutierrez et al. [20]. The MO-VRPTW instances are based on data from a distribution company in Tenerife, Spain, [26]. These MO-VRPTW instances are quite different from the Solomon instances [14]. On the one hand, the distance and time matrices are distinct and nonsymmetric in these real-world instance, hence, they represent a realistic trade-off between travel distance and travel time. The effects of the asymmetry on realistic VRP are studied in [27]. Moreover, the triangle inequality violations for both travel distance and travel time are prevalent in the real-world MO-VRPTW instances. The effects of the triangle inequality violations on VRP have also been studied in [28]. The triangle inequality violation for travel time has an effect on the algorithm design in this paper because it directly affects the feasibility of time constraints of a modified route. On the other hand, strong dependency relationships are presented among objectives in the real-world MO-VRPTW instances. Thus the real-world MO-VRPTW instances exhibit more realistic multiobjective nature and are suitable to assess the performance of multiobjective optimization algorithms [20], [26]. To generate our real-world MO-VRPSDPTW instances, the MO-VRPTW instances are modified by adding pickup quantity for the customers. The pickup quantity pi for customer i is set to 10, 20, or 30, each with probability 1/3 in the MO-VRPSDPTW instances. The proposed instances can be downloaded from our website [29]. Details will be described in Section IV. Although the modification seems to be minor, a new real-world benchmark dataset for MO-VRPSDPTW is proposed in this paper for the first time. The multiobjective nature of MO-VRPSDPTW can be better investigated and multiobjective optimization algorithms could be tested in a more reasonable manner using this realistic dataset. III. P ROPOSED A LGORITHMS FOR MO-VRPSDPTW A. Overview of Multiobjective Optimization For better understanding our algorithms and empirical studies, this section briefly reviews the basic concepts and algorithms of MOPs. In general, a MOP can be defined as follows: min F(x) = { f1 (x), . . . , fm (x)} (19) subject to x ∈ , where is the solution space. F : → Rm consists of m objective functions that often conflict with each other. Let x, y ∈ , y is said to Pareto dominate x if and only if fi (y) ≤ fi (x) for every i ∈ 1, . . . , m, and fj (y) < fj (x) for at least one j ∈ 1, . . . , m. A solution x∗ is Pareto optimal if there is no solution x ∈ such that x Pareto dominates x∗ . In this case, x∗ is also called a nondominated solution. F(x∗ ) is called a Pareto optimal objective vector. The set of all Pareto optimal solutions is called Pareto set, and the set of all Pareto optimal objective vectors is called Pareto front. The goal of a multiobjective optimization algorithm for a MOP is to seek a set of solutions which perform well in terms of convergence and diversity. That is, the solutions obtained by a multiobjective optimization algorithm should be close to and well-distributed along the Pareto front. Over the years, a number of metaheuristics have been extended to solve MOPs. Multiobjective evolutionary algorithms (MOEAs) strive to obtain an accurate and well-distributed approximation of the true Pareto front. Popular MOEAs include nondominated sorting genetic algorithm II (NSGA-II) [30] and MOEA based on decomposition (MOEA/D) [31]. More studies about the development This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. WANG et al.: MO-VRPSDPTW: FORMULATION, INSTANCES, AND ALGORITHMS and application of MOEAs can be found in [32]–[35]. Besides MOEAs, local search-based algorithms, such as Pareto local search [36] and multidirectional local search [37], are promising alternative approaches to solve MOPs. The merit of local search-based algorithms is that problem-specific knowledge can be directly used to guide the search toward the Pareto front. Thus, they are specially suitable for multiobjective combinatorial optimization problems. More details about local search-based algorithms can be found in [38]. Moreover, problem-specific heuristics for local search and the evolutionary algorithm framework for global search are often combined to maintain a good balance between local search (exploitation) and global search (exploration) in multiobjective optimization. This kind of algorithm is often called memetic algorithm [38]–[40]. In this paper, MOLS is firstly proposed for MO-VRPSDPTW, and then MOMA is proposed. Note that, in response to the particularities of a MOP, different multiobjective optimization algorithms may differ in the encoding scheme (responsible for the characterization of the search space), objective function, and operators that depend on the kind of encoding scheme adopted. As a consequence, the proposed algorithms in this paper are different from the previous studies [31], [32], [37] since our algorithms are composed of dedicated modules, for example, local search operators, to solve MO-VRPSDPTW. B. Solution Representation for MO-VRPSDPTW In the proposed algorithms, the solution representation for MO-VRPSDPTW is based on variable-length solution representation [21]. This representation has been shown in Fig. 1(b). A solution consists of several routes, and each route has a sequence of customers to be served. Note that each route should begin and end with a vertex 0 which denotes the depot. C. MOLS for MO-VRPSDPTW The general framework of MOLS is presented in Algorithm 1. This algorithm framework is successfully used to solve real-world MO-VRPTW and obtains better results than NSGA-II in [21]. It is extended to solve MO-VRPSDPTW in this paper, and expected to also obtain good results. MO-VRPSDPTW generalizes MO-VRPTW and it is conceptually a harder problem. The critical feature of MO-VRPSDPTW is that both pickup and delivery activities should be carried out simultaneously by the same vehicle. Hence, a mechanism that checks the fluctuating load on the vehicle at each customer should be imposed to prevent vehicle overload. The main idea of MOLS is to use different local search procedures, called objectivewise local searches, to optimize different objectives of a given solution in parallel [21], [37], [41]. More specifically, for each objective obj, an objectivewise local search LSobj is defined. LSobj (x) is performed to improve solution x with respect to objective obj. In MOLS, an archive A is initialized by generating several nondominated solutions. Then the main loop consists of: 1) selecting a solution from archive A; 2) using objectivewise local searches to improve each objective of the selected solution; and 3) updating archive 5 Algorithm 1 MOLS 1: archive A = ∅ 2: generate several nondominated solutions to initialize A /*Initialization*/ 3: while running time ≤ maximum computation time do 4: x = randomly select a solution from archive A 5: for obj = 1 to 5 do 6: perform objectivewise local search LSobj (x) /*Objectivewise local searches*/ 7: update archive A using neighbor solutions generated during the objectivewise local search. /*Archive updating scheme*/ 8: end for 9: end while 10: return A A to keep all nondominated solutions found during the search [37]. The main components in MOLS are described in detail as follows. 1) Initialization: A solution is randomly generated as follows. Firstly, we randomly select a customer and create a route. Then, another customer is randomly selected to be inserted into this route. If a customer cannot be inserted into any existing route, a new route is created. This procedure is repeated until all customers are inserted properly. Five (i.e., the number of objectives of MO-VRPSDPTW) solutions are generated by the procedure described above. Then the ith solution (i.e., xi ) is optimized by the objectivewise local search of the ith objective [i.e., LSi (xi )]. As a result, different solutions are improved along different objectives or directions. Finally, the nondominated solutions are stored into archive A. Since different objectives (directions) are selected to improve different solutions, the resultant nondominated solutions in the initial archive are expected to scatter over the objective space to a certain extent. 2) Archive Updating Scheme: Archive A is updated using the resultant solutions during the objectivewise local searches. However, the archive may contain more and more nondominated solutions during the search process since the number of nondominated solutions increases dramatically with objective number m. It is important to control the size of the archive. In MOLS, the concept of -dominance [42], [43] is adopted. The size of the -dominance archive is controlled by a parameter . Each solution in the archive is assigned an identification array B = {B1 , . . . , Bm }, where Bi = log fi /log(1 + ) [42]. The identification array divides the whole objective space into hyper-boxes. Each hyper-box can be occupied by only one solution, thereby it provides two properties: 1) well-distributed solutions can be maintained and 2) the final archive size is bounded [43]. The original -dominance archive often loses extreme solutions [44]. In MOLS, extreme solutions are stored during the search. In 3) Objectivewise Local Searches LSobj (x): MO-VRPSDPTW, different objectives have different physical meanings and characteristics, hence, objectivewise local searches are designed for these objectives as follows. A specific local search, Algorithm 2, is designed to deal with f1 . First, the route which has the fewest customers is selected. Then, we enumerate all customers in the selected route to try to insert them into other possible routes. The customer should be inserted into the first position where the This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 6 Algorithm 2 Local Search LS1 (x) for f1 1: flag = false 2: while !flag do 3: select the route with the fewest customers from solution x 4: enumerate all customers in the selected route to try to insert them into other routes 5: if a customer cannot be inserted into other routes properly then 6: flag = true 7: update A with x 8: break and return x 9: end if 10: if all customers in the selected route are inserted into other routes successfully then 11: one vehicle is reduced, i.e., f1 (x) = f1 (x) − 1 12: continue 13: end if 14: end while insertion does not delay the start time of other customers. If all customers in the selected route are inserted into other routes successfully, one vehicle can be reduced and the local search procedure proceeds to reduce one more vehicle. The local search procedure stops when a customer cannot be inserted into other routes properly. The resultant solution is used to update archive. Local search procedures for f2 , . . . , f5 are described in Algorithm 3. Based on the solution representation, three neighborhood operators, N1 , N2 , and N3 , are introduced for f2 , . . . , f5 , which are presented below. N1 removes a random customer from a selected route, and then reinserts it into the best position among all possible positions of all possible routes. This operator involves a basic function, selectRoute to select a route, and a definition of the best position to insert a customer. Both function and definition are also used in N2 and N3 . Since different objective-wise local searches are used to optimize their corresponding objectives, the function, selectRoute, and the definition of the best position are both based on different objectives. Function selectRoute selects a route with the longest travel time for f3 , and randomly selects a route for f2 , f4 , and f5 . The definitions of the best position to be inserted are described as follows for different objectives. f2 : The position which makes the resultant solution after insertion have the lowest total distance. f3 : The position which makes the resultant solution after insertion have the lowest travel time. f4 : The position which makes the resultant solution after insertion have the lowest total waiting time. f5 : The position which makes the resultant solution after insertion have the lowest total delay time. Given the function and definition above, this neighborhood operator can be described in detail as follows: first, a route is selected using the selectRoute function. Then, a customer, randomly chosen from the selected route, is removed from this route. Finally, the removed customer is reinserted into the best position. The criterion of selecting position to be inserted is according to the corresponding objective. Since the best position is selected for insertion, the best improvement strategy is adopted in this neighborhood operator. N2 firstly removes a random number of customers from a selected route. For example, for a selected route with seven IEEE TRANSACTIONS ON CYBERNETICS Algorithm 3 Local Search LSobj (x) for the objth Objective (obj = 2, ..., 5)/*Objectivewise Local Search*/ 1: depth = 1 2: while depth < MaxDepth do 3: randomly select a neighborhood operator from (N1 , N2 , N3 ) 4: x = generate a neighbor solution of x using the selected neighborhood operator 5: update A with x 6: if fobj (x ) < fobj (x) then 7: x = x 8: end if 9: depth = depth + 1 10: end while 11: return x customers, a random number, rn(1 ≤ rn ≤ 7) is generated, and then rn customers from this route are randomly selected to be removed. These removed customers are finally reinserted into the best position. The definition of the best position are the same as N1 . N3 exchanges a sequence of customers (i.e., subtour) between two routes, preserving the orientation of the sequences. In this operator, a sequence of customers (subtour) includes all customers after a selected customer. It can be handled as a single customer. For example, route 1, r1 = <1, 2, 3, 4, 5>, is selected using the selectRoute function, and a customer, for example customer 2, is randomly chosen from this route. The sequence of customers after customer 2 includes customers 3–5. They can be treated as a single customer S1. Then, we try to insert S1 into the best position. Suppose that the best position is found in the place after customer 6 in route 2, r2 = <6, 7, 8, 9, 10>. The sequence of customers after customer 6 including customers 7–9 is also handled as a single customer S2. Then, S2 is reinserted into the place after customer 2 in route 1 to complete the exchange process. Finally, two new routes, r1 = <1, 2, 7, 8, 9, 10> and r2 = <6, 3, 4, 5>, are obtained. More details can be found in [21]. For a solution x, one of the neighborhood operators (N1 , N2 , and N3 ) is randomly selected to generate a neighbor solution x of x. Then, x will be replaced by x if fobj (x ) is better than fobj (x). Once a neighbor solution x is generated, archive A is updated to avoid missing any nondominated solution during the search process. N1 only makes small changes to the current solution and carries out the search within a restricted part of the solution space, facilitating the algorithm’s convergence. In contrast, N2 and N3 make larger changes to the current solution and guide the search to different areas of the solution space. Thus, MOLS acts as variable neighborhood search [45]. The simple but powerful mixed neighborhood structures and the stochastic elements of MOLS allow effective diversification for escaping from local minima. 4) Feasibility Checking: In this paper, only feasible solutions are considered in all neighborhood operators. Efficient feasibility checking is necessary for time and capacity constraints to reduce computational complexity. For time constraints, a slack Sc(i,j) , which defines the maximum time shift allowed in route j after the ith customer, This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. WANG et al.: MO-VRPSDPTW: FORMULATION, INSTANCES, AND ALGORITHMS can be precomputed as follows [46], [47]: ⎧ ec(i,j) − ac(i,j) , for i = Nj + 1 ⎪ ⎪ ⎨ Sc(i,j) = (20) min(ec(i,j) + md − ac(i,j) ⎪ ⎪ ⎩ wc(i,j) + Sc(i+1,j) ), for i = Nj , . . . , 1. When a customer is inserted between the (i − 1)th and ith customers in the jth route, the insertion is legal if the difference between the arrival time at the ith customer location after and before the insertion is less or equal to Sc(i,j) . Hence, the computational complexity of checking a feasible position is reduced to O(1). Furthermore, only the routes which are changed by neighborhood operators need to be reevaluated. The feasibility checking process for capacity constraint in MO-VRPSDPTW has additional complexity because of the load fluctuation of vehicles. In order to speed up the feasibility checking process, special metrics [48] are used to capture the load fluctuation of vehicles along their routes. Two copies of depot are numbered as 0 and N + 1. Each customer c(k, j) is associated with the following quantities. − − and δc(k,j) indicate the amount of load picked 1) πc(k,j) up and load to deliver, respectively, on board of vehicle j, when vehicle j reaches customer c(k, j) with k = 1, . . . , Nj + 1. + + 2) πc(k,j) and δc(k,j) indicate the amount of load picked up and load to deliver, respectively, on board of vehicle j, when vehicle j leaves customer c(k, j) with k = 0, . . . , Nj . − − − = maxi=1,...,k {πc(i,j) + δc(i,j) } with k = 1, . . . , 3) Mc(k,j) Nj + 1 indicates the maximum total load of vehicle j since its departure from the depot to customer c(k, j). + + + = maxi=k,...,Nj {πc(i,j) + δc(i,j) } with k = 0, . . . , Nj 4) Mc(k,j) c(k, j) indicates the maximum total load of vehicle j from its departure from customer c(k, j) until to the depot. These quantities are precomputed. N1 and N2 both insert a customer into a route. The following two conditions must hold to satisfy the capacity constraint when customer i is inserted between c(k, j) and c(k, j + 1): − + gi ≤ C Mc(k+1,j) + Mc(k,j) + pi ≤ C. (21) N3 exchanges a sequence of customers. An exchange operator can be decomposed into two insert operators. Suppose that the exchange operator involves c(k1 , j1 ) and c(k1 + 1, j1 ) in route j1 , and c(k2 , j2 ) and c(k2 + 1, j2 ) in route j2 , the feasibility checking for capacity constraint is made by checking the following conditions: − − − − δc(k + δc(k ≤C Mc(k 1 +1,j1 ) 1 +1,j1 ) 2 +1,j2 ) + + + Mc(k − πc(k + πc(k ≤C 2 ,j2 ) 2 ,j2 ) 1 ,j1 ) − − − Mc(k − δc(k + δc(k ≤C 2 +1,j2 ) 2 +1,j2 ) 1 +1,j1 ) + + + Mc(k − πc(k + πc(k ≤ C. 1 ,j1 ) 1 ,j1 ) 2 ,j2 ) (22) Hence, the computational complexity of feasibility checking for capacity constraint is also reduced to O(1). During the process of the neighborhood operators, if a removed customer cannot be reinserted into any existing route, 7 then a new route is created for this customer. Since the triangle inequality may not hold for the travel time, once a customer is removed from a route, feasibility checking for time constraints should also be carried out on this route. More specifically, for customers i, j, and k, tik ≥ tij + tjk may happen; a feasibility checking for time constraints [see (16) and (17)] should be implemented on this route once customer j is removed. If the route is infeasible, then it is split into two or more feasible routes. 5) Complexity Analysis: The complexity of local search for f1 consists of reducing vehicles and updating the archive. The worst case for reducing vehicles is to reinsert all customers, which takes O(N 2 ). Thus, the total complexity of local search procedure is max(O(N 2 ), O(m · |A|)) for f1 . The main complexity of the local search procedures for f2 , . . . , f5 consists of generating a neighbor solution and updating the archive. The complexity of generating a neighbor solution is O(N 2 ), and the complexity of updating the archive is O(m · |A|), where |A| is the size of the archive. Hence, the total complexity of local search procedures is max(O(depth · N 2 ), O(depth · m · |A|)) for f2 , . . . , f5 . D. MOMA for MO-VRPSDPTW The MOEA/D framework [31] is chosen for solving the MO-VRPSDPTW defined by (18). The reason is twofold [32], [40]. 1) The research literature shows that MOEA/D seems to be more suitable for tackling multiobjective combinatorial optimization problems because it can directly use problem-specific (local search) techniques to intensify the exploration of promising regions in solution space. MOEA/D provides a very natural framework for using single-objective search techniques. 2) Pareto dominance-based algorithms, for example, NSGA-II, do not always work well on MOPs with many objectives [21]. Decomposition (or scalarizing function)-based fitness evaluation is a promising alternative to Pareto dominance-based fitness evaluation especially for the many-objective problems and multiobjective combinatorial optimization problems [32], [40]. Instead of solving a MOP directly, MOEA/D explicitly decomposes it into Q scalar optimization subproblems. MOEA/D solves these subproblems simultaneously by evolving a population of solutions. At each generation, the population is composed of the best solution found so far for each subproblem. The neighborhood relations among these subproblems are defined based on the distances among their aggregation weight vectors. Each subproblem is optimized by using information from its neighboring subproblems. In this paper, MOEA/D has been adapted, and thus a MOMA is proposed to solve MO-VRPSDPTW. The procedure of MOMA is given in Algorithm 4. Population initialization is the same as the initialization process of MOLS but without local search. At the same time, the same -dominance archive as in MOLS is adopted. The distinct components of the proposed MOMA include the decomposition, crossover operator and local search, which are described in detail as follows. This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 8 IEEE TRANSACTIONS ON CYBERNETICS Algorithm 4 MOMA 1: Archive A = ∅ 2: generate Q uniformly distributed weight vectors 1 , · · · , Q , where i = (λi1 , · · · , λi5 ) /*Decomposition*/ 3: for i = 1 to Q do 4: compute the Euclidean distance between each pair of weight vectors and get the T closest weight vectors to each weight vector. Set the neighborhood B(i) = i1 , . . . , iT . 5: initiate xi 6: end for 7: while stopping criterion is not met do 8: for i = 1 to Q do 9: choose p, q randomly from B(i) 10: o = crossover(xp , xq ) /*Crossover Operator*/ 11: if ∃obj ∈ {1, . . . , 5}, λiobj == 1 then 12: x = LSobj (o) and update archive A /*Objectivewise local searches: Algorithms 2 and 3*/ 13: else 14: x = LSi (o) and update archive A /*Decomposition-based local search: Algorithm 5*/ 15: end if 16: for each j ∈ B(i) do 17: if gws (x |j ) ≤ gws (xj |j ) then 18: xj = x 19: end if 20: end for 21: end for 22: end while 23: return A Algorithm 5 Local Search LSi (x)/*Decomposition-Based Local Search*/ 1: depth = 1 2: while depth < MaxDepth do 3: randomly se lect a neighborhood operator from (N1 , N2 , N3 ) 4: x = generate a neighbor solution of x using the selected neighborhood operator 5: update A with x 6: if gws (x |i ) < gws (x|i ) then 7: x = x 8: end if 9: depth = depth + 1 10: end while 11: return x 1) Decomposition: MOMA decomposes the MO-VRPSDPTW defined by (18) into Q single-objective subproblems using a weighted sum approach, thus the ith subproblem is defined as min g (x | ) = ws i i 5 λik fk i x Fig. 2. (23) k=1 where i = (λi1 , λi2 , . . . , λi5 ) is the weight vector of sub 5 i ws i i problem i(i = 1, . . . , Q), and k=1 λk = 1. g (x | ) is used to emphasize that i is a coefficient vector in (23) while xi is a solution to be optimized. According to the definitions in Section II-A, five objectives are of different scales, and direct use of them will make the algorithm bias toward objectives with larger scales. Therefore, normalization is required. The maximal and minimal values of all feasible solutions found in each objective are used to normalize each objective [21], [40]. 2) Crossover Operator: MOMA generates new solutions using a crossover operator described as follows. First, a random number of routes are selected from the first parent, and copied into the offspring. Then all the routes from the second parent, which are not in conflict with customers already copied from the first parent, are copied into the offspring. Thus, the crossover operator makes offspring inherit routes from parents. Once inherited routes are chosen, they can be regarded as seed routes. All un-routed customers should be inserted into existing routes. If one customer cannot be inserted properly, a new route should be created for this customer. The criterion of the insertion is according to weighted sum of objectives described in (23). An example has been given in Fig. 2 to illustrate the crossover operator for two parents. 3) Local Search: The offspring produced by crossover is further improved by local search described in Algorithm 5. Example of crossover operator. The acceptance rule of local search for multiobjective optimization is very important because it decides which solutions generated in the path of local search should be accepted as the improved ones. The acceptance rule in LSi (x) is to choose the solution with the lower value of the aggregation objective function [see (23)] in the path of local search as the improved solution. This local search can be viewed as decomposition-based local search. The subproblem with λiobj = 1 corresponds to an extreme solution, which focuses on optimizing objective obj. In this case, objectivewise local search is called to optimize the corresponding objective. E. Characteristics of Proposed Algorithms MOLS uses different objectivewise local search procedures to optimize different objectives, which means more problemspecific knowledge can be used to guide the search. On the other hand, the -dominance archive is adopted to maintain the nondominated solutions with convergence and diversity properties. Although MOEA/D provides a very natural framework for using single-objective search techniques, it is crucial to know when and how a single-objective search technique can be used [40]. The proposed MOMA combines the advanced features from both objectivewise and decomposition-based local searches with MOEA/D, and proposes a hybrid MOEA/D for MO-VRPSDPTW. Both MOLS and MOMA are quite simple to implement, and they can be applied to real-world problems effectively. This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. WANG et al.: MO-VRPSDPTW: FORMULATION, INSTANCES, AND ALGORITHMS IV. E XPERIMENTAL R ESULTS In order to assess the performance of the proposed algorithms, experiments are implemented in C on a PC (Pentium 2.70 GHz with 2 GB RAM). The parameter MaxDepth of MOLS is set to 10. For parameter , we find that = 0.05 can obtain satisfactory solutions on all instances. More information about the effect of can be found in [44]. In MOMA, T is set to 5. The settings of Q and 1 , . . . , Q are controlled by a parameter H [31]. More specifically, 1 , . . . , Q are all weight vectors in which each individual weight takes a value from {0, 1/H, . . . , H/H}. Therefore, the number of such vecm−1 5−1 4 = CH+5−1 = CH+4 . For all instances, tors is Q = CH+m−1 H = 8, and therefore Q = 495. More information about the effect of these parameters can be found in [31]. The parameters of local searches and -dominance archive used in MOMA are set as in MOLS. MOMA evolves a population of 495 individuals for 200 generations. Average running time (seconds) of MOMA over 30 runs for each instance is shown in the last column (T) of Tables I and II. Unlike the population-based evolutionary algorithm, there is no explicit concept of generation in MOLS. Thus, the running time of MOMA is set as the stopping criterion for MOLS in our experiments for fair comparison. 9 TABLE I AVERAGE VALUES OF IGD, HV, AND C-M ETRIC OF MOLS (D ENOTED AS L) AND MOMA (D ENOTED AS M) ON O UR R EAL -W ORLD I NSTANCES A. Benchmark Instances The main aim of this paper is to propose new algorithms for real-world MO-VRPSDPTW instances. In order to test the stability of two algorithms across datasets, the algorithms are also tested on the previous Solomon dataset-based instances. Hence, the proposed algorithms are tested on two sets of benchmark instances described as follows. 1) Our Real-World MO-VRPSDPTW Instances [29]: Forty-five real-world instances [20] are created using the combinations of three sizes of customers, three types of vehicle capacities and five time window profiles. The opening time of the depot is 8 h. The time windows are designed to imitate what the delivery company faces every day and can be described by five kinds of profiles, denoted as profiles 0–4. In profile 0, all customers are available all day in 480 min. In profile 1, three types of customers are considered: early customers, midday customers, and late customers. In order to cover the whole day with these customers, time windows are created with a length of 160 min/type of customer (i.e., 480 min/three types of customers). Therefore, early customers, midday customers, and late customers will be served in the time window [0, 160], [160, 320], and [320, 480] min, respectively. In profile 2, the length of each time window is set to 130 min: the opening hours will be [0, 130] for early customers, [175, 305] for midday customers, and [350, 480] for late customers. In profile 3, the length of each time window is set to 100 min. Therefore the time windows will be [0, 100] for early customers, [190, 290] for midday customers, and [380, 480] for late customers. In profile 4, customers are associated with one of the time windows from profiles aforementioned. The capacity of each vehicle can be set to C = D + δ/100(D − D), where δ ∈ [0, 100] is used to modulate the slack margin of the capacity [20], D = maxi {gi } and D= N i=1 gi . If δ takes value close to 0, the capacity of the vehicle C will be very limited. If δ takes value close to 100, the vehicle will have a capacity C close to the total demand. MO-VRPSDPTW instances are created by using the following combinations: 1) number of customers: {50, 150, 250}; 2) three types of δ: {60, 20, 5}; and 3) time windows: profiles {0, 1, 2, 3, 4}. Hence, a total of 45 MO-VRPSDPTW instances (3 ∗ 3 ∗ 5) are generated. As shown in the first column of Table I, the format of the instance name is “num1 -num2 -num3 ,” where num1 indicates the number of customers, num2 indicates the index of δ type, and num3 indicates the index of time window profile. The demand for each customer is set to 10, 20, or 30, each with probability 1/3, the pickup quantity for each customer is also set to 10, 20, or 30, each with probability 1/3, and the service time for each customer This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 10 IEEE TRANSACTIONS ON CYBERNETICS TABLE II AVERAGE VALUES OF IGD, HV, AND C-M ETRIC OF MOLS (D ENOTED AS L) AND MOMA (D ENOTED AS M) ON R EVISED S OLOMON I NSTANCES no unary performance metric is able to give a comprehensive measure on the performance of a multiobjective optimization algorithm. In this paper, the following three metrics are used. 1) Inverted generational distance (IGD) [31] is used to measure both the convergence and diversity of the nondominated solutions obtained. 2) Hypervolume (HV) [50] is used to indicate the area in the objective space that is dominated by at least one of the nondominated solutions. The larger the HV, the closer the corresponding nondominated solutions toward the Pareto front. Therefore, it reflects the closeness of the nondominated solutions to the Pareto front. HV also measures both the convergence and diversity. 3) Coverage metric (C-metric) [51] is a commonly used metric for comparing two sets of nondominated solutions (denoted as X and Y). C(X, Y) is defined as the percentage of the solutions in Y that are Pareto dominated by at least one solution in X. C(X, Y) = 1 means all nondominated solutions in Y are Pareto dominated by solutions in X, and C(Y, X) = 1 means all nondominated solutions in X are Pareto dominated by solutions in Y. The sum of C(X, Y) and C(Y, X) is not always equal to 1, since some solutions in X and Y may not Pareto dominate each other. Higher (lower) value of HV (IGD) can be considered as a better set of solutions approximating the true Pareto front from the convergence and diversity viewpoints. To provide additional information on convergence, C-metric is also used. To compute the performance metrics, the objective vectors of all nondominated solutions obtained by the considered algorithms are approximately seen as the Pareto front because the true Pareto front of the problems is unknown. Furthermore, all objective values are normalized because of the difference among the ranges of objectives. C. Experimental Results is set to 10, 20, or 30 min, also each with probability 1/3. A maximum delay of 30 min is allowed for each customer, that is, md = 30 min [20]. 2) Revised Solomon Instances by Wang and Chen [5]: They are revised from Solomon instances, and can be downloaded in [13]. This dataset includes 56 instances with 100 customers. In this paper, the maximum delay allowed for each customer i is set to 30% of the length of its time windows, that is, mdi = 30% · (ei − bi ) as in [24], since the instances within each problem class have different time window lengths. B. Performance Metrics The performance of an algorithm for MOPs is evaluated in terms of both convergence and diversity. As discussed in [49], A significant feature of the real-world MO-VRPSDPTW dataset is that the triangle inequality may not hold for the travel time. Thus, previous single-objective or multiobjective optimization approaches for the Solomon dataset [4], [5], [52], which assume that the triangle inequality always holds for the travel time, cannot be directly used to solve real-world MO-VRPSDPTW instances. Furthermore, most previous algorithms only consider VRPSDPTW/VRPTW with two objectives [5], [52]; thus, they cannot effectively solve the many-objective VRPSDPTW considered in this paper. Two algorithms proposed in this paper are compared with each other since there is no previous benchmark algorithm. Our algorithms can be seen as benchmark algorithms for real-world MO-VRPSDPTW instances, and can be compared by future research. Average values of the metrics over 30 independent runs of both algorithms on the test sets have been calculated, and are shown in Tables I and II. Due to space limitations, standard deviations of the metrics are not presented in the tables. To further show the difference between two algorithms, the Wilcoxon signed-rank test [53]–[55] at 5% significance level This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. WANG et al.: MO-VRPSDPTW: FORMULATION, INSTANCES, AND ALGORITHMS 11 Fig. 3. Plots of nondominated solutions obtained by MOLS (denoted as ◦) and MOMA (denoted as ×) on the selected instance 250–0–0, respectively. (a) f 1 − f2 plane. (b) f1 − f3 plane. (c) f2 − f3 plane. is conducted. The significantly better values between MOLS and MOMA are highlighted in boldface in tables. The result of the test is summarized as w/t/l, which means that the performance of MOLS is better than, similar to, and worse than that of MOMA in w, t, and l instances, respectively. From Tables I and II, several observations can be found as follows. 1) For 45 real-world instances, MOLS significantly outperforms MOMA in most instances. Specifically, MOLS significantly outperforms MOMA in 28 instances and is outperformed by MOMA in ten instances in terms of IGD. In terms of HV, MOLS significantly outperforms MOMA in 26 instances and is outperformed by MOMA in 12 instances. This means that MOLS can obtain better results than MOMA in terms of both convergence and diversity. MOLS also significantly outperforms MOMA in 38 instances in terms of C-metric. It means that MOLS shows better convergence performance. 2) For 56 revised Solomon instances, it is more obvious that MOLS significantly outperforms MOMA in terms of all metrics. Specifically, MOLS significantly outperforms MOMA in 46 instances and is outperformed by MOMA in only one instance in terms of IGD. In terms of HV, MOLS significantly outperforms MOMA in 47 instances and is outperformed by MOMA in six instances. MOLS significantly outperforms MOMA in 55 instances in terms of C-metric. 3) As shown above, for revised Solomon instances, MOLS significantly outperforms MOMA in almost all instances, while MOLS is significantly outperformed by MOMA in some real-world instances. The superiority of MOLS over MOMA in real-world instances is not so obvious as in revised Solomon instances. Thus, future researchers should test their algorithms on the proposed real-world MO-VRPSDPTW instances to prove usefulness in real life environment for their algorithms. To visually demonstrate the convergence and diversity properties of two algorithms, the projection of nondominated solutions obtained by MOLS (denoted as ◦) and MOMA (denoted as ×) in a selected instance 250–0–0, at f1 − f2 , f1 − f3 , and f2 − f3 planes are shown in Fig. 3. It is clearly shown that most objective vectors generated by MOLS are better than those generated by MOMA. Moreover, the distribution of solutions obtained by MOLS spreads much wider along the obtained Pareto front. Hence, the fact that MOLS is better than MOMA in this instance in terms of both convergence and diversity is confirmed. Finally, the relative benefits and limitations of the proposed algorithms are briefly discussed. In both MOLS and MOMA, single-objective local search can be easily adopted for MO-VRPSDPTW. In MOMA, diversity is naturally preserved by the diversity among subproblems, but some different subproblems may have the same optimal solution for a combinatorial optimization problem [31]. Therefore, even a large number of subproblems may not lead to a reasonably good approximation to the Pareto front of the combinatorial optimization problem. In MOLS, no subproblem (weight) needs to be defined in advance. Instead, objectivewise local search is adopted to optimize each objective, which promotes the convergence and diversity of search. This may be the reason that MOLS is better than MOMA in terms of both convergence and diversity in most of instances. V. C ONCLUSION This paper has introduced a multiobjective variant of VRPSDPTW and a set of realistic benchmark instances. Then two algorithms have been designed for the MO-VRPSDPTW. Extensive experiments have shown the effectiveness of the proposed algorithms. The proposed algorithms can be seen as benchmark algorithms for real-world MO-VRPSDPTW instances, which can be used for comparison by future research. In the future, this paper can be extended in multiple directions. Firstly, the proposed MO-VRPSDPTW model can be extended to other green VRPs, for example, pollution VRP for reducing CO2 or greenhouse gas emissions, by including broader objectives that reflect environmental cost [2], [3]. Secondly, the proposed algorithms can also be extended to solve other multiobjective VRPs in reverse logistics, including VRP with backhauls and VRP with mixed pickup and delivery problems. Finally, from the perspective of fundamental research, advanced multiobjective optimization algorithms for many-objective combinatorial optimization problems should be further studied and developed since existing multiobjective optimization algorithms mainly focus on continuous benchmark functions. This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. 12 IEEE TRANSACTIONS ON CYBERNETICS R EFERENCES [1] C. Lin, K. Choy, G. Ho, S. Chung, and H. Lam, “Survey of green vehicle routing problem: Past and future trends,” Expert Syst. Appl., vol. 41, no. 4, pp. 1118–1138, 2014. [2] K. Devika, A. Jafarian, and V. Nourbakhsh, “Designing a sustainable closed-loop supply chain network based on triple bottom line approach: A comparison of metaheuristics hybridization techniques,” Eur. J. Oper. Res., vol. 235, no. 3, pp. 594–615, 2014. [3] T. R. P. Ramos, M. I. Gomes, and A. P. Barbosa-Povoa, “Planning a sustainable reverse logistics system: Balancing costs with environmental and social concerns,” Omega, vol. 48, pp. 60–74, Oct. 2014. [4] S. Kassem and M. Chen, “Solving reverse logistics vehicle routing problems with time windows,” Int. J. Adv. Manuf. Technol., vol. 68, nos. 1–4, pp. 57–68, 2013. [5] H.-F. Wang and Y.-Y. Chen, “A genetic algorithm for the simultaneous delivery and pickup problems with time window,” Comput. Ind. Eng., vol. 62, no. 1, pp. 84–95, Feb. 2012. [6] M. Lai and E. Cao, “An improved differential evolution algorithm for vehicle routing problem with simultaneous pickups and deliveries and time windows,” Eng. Appl. Artif. Intell., vol. 23, no. 2, pp. 188–195, 2010. [7] B. Eksioglu, A. V. Vural, and A. Reisman, “The vehicle routing problem: A taxonomic review,” Comput. Ind. Eng., vol. 57, no. 4, pp. 1472–1483, 2009. [8] S.-C. Horng, “Combining artificial bee colony with ordinal optimization for stochastic economic lot scheduling problem,” IEEE Trans. Syst., Man, Cybern., Syst., vol. 45, no. 3, pp. 373–384, Mar. 2015. [9] Y. Jin and J.-K. Hao, “Effective learning-based hybrid search for bandwidth coloring,” IEEE Trans. Syst., Man, Cybern., Syst., to be published. [10] D. Li, M. Li, X. Meng, and Y. Tian, “A hyperheuristic approach for intercell scheduling with single processing machines and batch processing machines,” IEEE Trans. Syst., Man, Cybern., Syst., vol. 45, no. 2, pp. 315–325, Feb. 2015. [11] E. Angelelli and R. Mansini, “The vehicle routing problem with time windows and simultaneous pick-up and delivery,” in Quantitative Approaches to Distribution Logistics and Supply Chain Management, A. Klose, M. Speranza, and L. V. Wassenhove, Eds. Berlin, Germany: Springer, 2002, pp. 249–267. [12] L. Boubahri, S.-A. Addouche, and A. E. Mhamedi, “Multi-ant colonies algorithms for the VRPSPDTW,” in Proc. Int. Conf. Commun. Comput. Control Appl., Hammamet, Tunisia, 2011, pp. 1–6. [13] H.-F. Wang and Y.-Y. Chen. (2014). VRPSDPTW Instances Revised From Solomon Dataset. [Online]. Available: http://oz.nthu.edu.tw/~d933810/test.htm [14] M. M. Solomon, “Algorithms for the vehicle routing and scheduling problems with time window constraints,” Oper. Res., vol. 35, no. 2, pp. 254–265, 1987. [15] C. Wang, F. Zhao, D. Mu, and J. W. Sutherland, “Simulated annealing for a vehicle routing problem with simultaneous pickup-delivery and time windows,” in Advances in Production Management Systems. Sustainable Production and Service Supply Chains. Berlin, Germany: Springer, 2013, pp. 170–177. [16] A. Deng, C. Mao, and Y. Zhou, “Optimizing research of an improved simulated annealing algorithm to soft time windows vehicle routing problem with pick-up and delivery,” Syst. Eng. Theory Pract., vol. 29, no. 5, pp. 186–192, 2009. [17] R. Liu, X. Xie, V. Augusto, and C. Rodriguez, “Heuristic algorithms for a vehicle routing problem with simultaneous delivery and pickup and time windows in home health care,” Eur. J. Oper. Res., vol. 230, no. 3, pp. 475–486, Nov. 2013. [18] N. Jozefowiez, F. Semet, and E.-G. Talbi, “Multi-objective vehicle routing problems,” Eur. J. Oper. Res., vol. 189, no. 2, pp. 293–309, 2008. [19] N. Labadie and C. Prodhon, “A survey on multi-criteria analysis in logistics: Focus on vehicle routing problems,” in Applications of MultiCriteria and Game Theory Approaches, L. Benyoucef, J.-C. Hennet, and M. K. Tiwari, Eds. London, U.K.: Springer, 2014, pp. 3–29. [20] J. Castro-Gutierrez, D. Landa-Silva, and J. M. P´erez, “Nature of realworld multi-objective vehicle routing with evolutionary algorithms,” in Proc. IEEE Int. Conf. Syst. Man Cybern. (SMC), Anchorage, AK, USA, 2011, pp. 257–264. [21] Y. Zhou and J. Wang, “A local search-based multiobjective optimization algorithm for multiobjective vehicle routing problem with time windows,” IEEE Syst. J., to be published. [22] D. Tas, N. Dellaert, T. van Woensel, and T. de Kok, “Vehicle routing problem with stochastic travel times including soft time windows and service costs,” Comput. Oper. Res., vol. 40, no. 1, pp. 214–224, 2013. [23] H. Hashimotoa, T. Ibaraki, S. Imahori, and M. Yagiura, “The vehicle routing problem with flexible time windows and traveling times,” Discrete Appl. Math., vol. 154, no. 16, pp. 2271–2290, 2006. [24] R. Eglese, Z. Fu, and L. Y. O. Li, “A unified tabu search algorithm for vehicle routing problems with soft time windows,” J. Oper. Res. Soc., vol. 59, pp. 663–673, Feb. 2008. [25] W. C. Chiang and R. A. Russell, “A metaheuristic for the vehiclerouting problem with soft time windows,” J. Oper. Res. Soc., vol. 55, pp. 1298–1310, Jul. 2004. [26] J. Castro-Gutierrez, “Multi-objective tools for the vehicle routing problem with time windows,” Ph.D. dissertation, School Comput. Sci., Univ. Nottingham, Nottingham, U.K., 2012. [Online]. Available: http://etheses.nottingham.ac.uk/3713/ [27] A. Rodriguez and R. Ruiz, “A study on the effect of asymmetry on real capacitated vehicle routing problems,” Comput. Oper. Res., vol. 39, no. 7, pp. 2142–2151, 2012. [28] C. L. Fleming, S. E. Griffis, and J. E. Bell, “The effects of triangle inequality on the vehicle routing problem,” Eur. J. Oper. Res., vol. 224, no. 1, pp. 1–7, 2013. [29] J. Wang. (2014). MO-VRPTW and MO-VRPSDPTW Datasets. [Online]. Available: http://sist.sysu.edu.cn/~wangjiah/ [30] K. Deb, A. Pratap, S. Agarwal, and T. Meyarivan, “A fast and elitist multiobjective genetic algorithm: NSGA-II,” IEEE Trans. Evol. Comput., vol. 6, no. 2, pp. 182–197, Apr. 2002. [31] Q. Zhang and H. Li, “MOEA/D: A multiobjective evolutionary algorithm based on decomposition,” IEEE Trans. Evol. Comput., vol. 11, no. 6, pp. 712–731, Dec. 2007. [32] A. Zhou et al., “Multiobjective evolutionary algorithm: A survey of the state of the art,” Swarm Evol. Comput., vol. 1, no. 1, pp. 32–49, 2011. [33] K. Nag, T. Pal, and N. Pal, “ASMiGA: An archive-based steadystate micro genetic algorithm,” IEEE Trans. Cybern., vol. 45, no. 1, pp. 40–52, Jan. 2015. [34] M. Li, S. Yang, K. Li, and X. Liu, “Evolutionary algorithms with segment-based search for multiobjective optimization problems,” IEEE Trans. Cybern., vol. 44, no. 8, pp. 1295–1313, Aug. 2014. [35] Z.-H. Zhan et al., “Multiple populations for multiple objectives: A coevolutionary technique for solving multiobjective optimization problems,” IEEE Trans. Cybern., vol. 43, no. 2, pp. 445–463, Apr. 2013. [36] L. Paquete, T. Schiavinotto, and T. St¨utzle, “On local optima in multiobjective combinatorial optimization problems,” Ann. Oper. Res., vol. 156, no. 1, pp. 83–97, 2007. [37] F. Tricoire, “Multi-directional local search,” Comput. Oper. Res., vol. 39, no. 12, pp. 3089–3101, 2012. [38] E.-G. Talbi, M. Basseur, A. Nebro, and E. Alba, “Multi-objective optimization using metaheuristics: Non-standard algorithms,” Int. Trans. Oper. Res., vol. 19, nos. 1–2, pp. 283–305, 2012. [39] L. Ke, Q. Zhang, and R. Battiti, “Hybridization of decomposition and local search for multiobjective optimization,” IEEE Trans. Cybern., vol. 44, no. 10, pp. 1808–1820, Oct. 2014. [40] J. Wang and Y. Cai, “Multiobjective evolutionary algorithm for frequency assignment problem in satellite communications,” Soft Comput., to be published. [41] J. Wang, C. Zhong, Y. Zhou, and Y. Zhou, “Multiobjective optimization algorithm with objective-wise learning for continuous multiobjective problems,” J. Amb. Intell. Human. Comput., to be published. [42] M. Laumanns, L. Thiele, K. Deb, and E. Zitzler, “Combining convergence and diversity in evolutionary multiobjective optimization,” Evol. Comput., vol. 10, no. 3, pp. 263–282, 2002. [43] K. Deb, M. Mohan, and S. Mishra, “Evaluating the -domination based multi-objective evolutionary algorithm for a quick computation of Pareto-optimal solutions,” Evol. Comput., vol. 13, no. 4, pp. 501–525, 2005. [44] S. Bandyopadhyay, U. Maulik, and R. Chakraborty, “Incorporating -dominance in AMOSA: Application to multiobjective 0/1 knapsack problem and clustering gene expression data,” Appl. Soft Comput., vol. 13, no. 5, pp. 2405–2411, 2013. [45] P. Hansen and N. Mladenovic, “Variable neighborhood search: Principles and applications,” Eur. J. Oper. Res., vol. 130, no. 3, pp. 449–467, 2001. [46] G. A. P. Kindervater and M. W. P. Savelsbergh, “Vehicle routing: Handling edge exchanges,” in Local Search in Combinatorial Optimization, E. H. L. Aarts and J. K. Lenstra, Eds. Chichester, U.K.: Wiley, 1997, pp. 337–360. This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. WANG et al.: MO-VRPSDPTW: FORMULATION, INSTANCES, AND ALGORITHMS [47] N. Labadie and C. Prins, “Vehicle routing nowadays: Compact review and emerging problems,” in Production Systems and Supply Chain Management in Emerging Countries: Best Practices, G. Mej´ıa and N. Velasco, Eds. Berlin, Germany: Springer, 2012, pp. 141–166. [48] N. Bianchessi and G. Righini, “Heuristic algorithms for the vehicle routing problem with simultaneous pick-up and delivery,” Comput. Oper. Res., vol. 34, no. 2, pp. 578–594, 2007. [49] E. Zitzler, L. Thiele, M. Laumanns, C. M. Fonseca, and V. G. da Fonseca, “Performance assessment of multiobjective optimizers: An analysis and review,” IEEE Trans. Evol. Comput., vol. 7, no. 2, pp. 117–132, Apr. 2003. [50] M. Laumanns, E. Zitzler, and L. Thiele, “SPEA2: Improving the strength Pareto evolutionary algorithm,” in Evolutionary Methods for Design, Optimization and Control With Applications to Industrial Problems, K. Giannakoglou, D. T. Tsahalis, J. Periaux, K. D. Papailiou, and T. Fogarty, Eds. Barcelona, Spain: CIMNE, 2002, pp. 95–100. [51] E. Zitzler and L. Thiele, “Multi-objective evolutionary algorithms: A comparative study and the strength Pareto approach,” IEEE Trans. Evol. Comput., vol. 3, no. 4, pp. 257–271, Nov. 1999. [52] T.-C. Chiang and W.-H. Hsu, “A knowledge-based evolutionary algorithm for the multiobjective vehicle routing problem with time windows,” Comput. Oper. Res., vol. 45, no. 5, pp. 25–37, 2014. [53] F. Wilcoxon, “Individual comparisons by ranking methods,” Biometrics, vol. 1, no. 6, pp. 80–83, 1945. [54] J. Derrac, S. García, D. Molina, and F. Herrera, “A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms,” Swarm Evol. Comput., vol. 1, no. 1, pp. 3–18, 2011. [55] J. Alcalá-Fdez et al., “KEEL: A software tool to assess evolutionary algorithms to data mining problems,” Soft Comput., vol. 13, no. 3, pp. 307–318, 2008. Jiahai Wang (M’07) received the Ph.D. degree from Toyama University, Toyama, Japan, in 2005. In 2005, he joined Sun Yat-sen University, Guangzhou, China, where he is currently an Associate Professor with the Department of Computer Science. His current research interests include computational intelligence and its applications. Ying Zhou received the Ph.D. degree from Sun Yat-sen University, Guangzhou, China, in 2014. In 2014, she joined the Shenzhen Institute of Information Technology, Shenzhen, China. Her current research interests include local search algorithms and their applications, multiobjective optimization, and other evolutionary computation techniques. Yong Wang (M’08) received the B.S. degree in automation from the Wuhan Institute of Technology, Wuhan, China, in 2003, and the M.S. degree in pattern recognition and intelligent systems and Ph.D. degree in control science and engineering, both from Central South University (CSU), Changsha, China, in 2006 and 2011, respectively. He is currently an Associate Professor with the School of Information Science and Engineering, CSU. His current research interests include evolutionary computation, single-objective optimization, constrained optimization, multiobjective optimization, and their real-world applications. He was a Reviewer for over 40 international journals. Dr. Wang was a recipient of the Hong Kong Scholar from the Mainland—Hong Kong Joint Post-Doctoral Fellows Program, China, in 2013, the Excellent Doctoral Dissertation by Hunan Province, China, in 2013, the New Century Excellent Talents in University by the Ministry of Education, China, in 2013, and the 2015 IEEE Computational Intelligence Society Outstanding Ph.D. Dissertation Award. He was a PC Member of over 20 international conferences. 13 Jun Zhang (M’02–SM’08) received the Ph.D. degree in electrical engineering from the City University of Hong Kong, Hong Kong, in 2002. From 2003 to 2004, he was a Brain Korean 21 Post-Doctoral Fellow at the Department of Electrical Engineering and Computer Science, Korea Advanced Institute of Science and Technology, Daejeon, Korea. Since 2004, he has been at Sun Yat-sen University, Guangzhou, China, where he is currently a Cheung Kong Professor. His current research interests include computational intelligence, cloud computing, big data, high performance computing, data mining, wireless sensor networks, operations research, and power electronic circuits. He has authored seven research books and book chapters and over 100 technical papers in the above areas. Mr. Zhang was a recipient of the China National Funds for Distinguished Young Scientists from the National Natural Science Foundation of China, in 2011 and the First-Grade Award in Natural Science Research from the Ministry of Education, China, in 2009. He is currently an Associate Editor of the IEEE T RANSACTIONS ON E VOLUTIONARY C OMPUTATION, the IEEE T RANSACTIONS ON I NDUSTRIAL E LECTRONICS, and the IEEE T RANSACTIONS ON C YBERNETICS. He is the Founding and the Current Chair of the IEEE Guangzhou Subsection and ACM Guangzhou Chapter. C. L. Philip Chen (S’88–M’88–SM’94–F’07) received the M.S. degree from the University of Michigan, Ann Arbor, MI, USA, and the Ph.D. degree from Purdue University, West Lafayette, IN, USA, in 1985 and 1988, respectively, both in electrical engineering. He was a tenured professor, a Department Head, and an Associate Dean with two different universities in the U.S. for 23 years. He is currently the Dean of the Faculty of Science and Technology and a Chair Professor with the Department of Computer and Information Science, University of Macau, Macau, China. His current research interests include systems, cybernetics, and computational intelligence. Prof. Chen was the President of the IEEE Systems, Man, and Cybernetics Society from 2012 to 2013. He has been an Editor-in-Chief of the IEEE T RANSACTIONS ON S YSTEMS , M AN , AND C YBERNETICS : S YSTEMS since 2014 and an associate editor of several IEEE transactions. He is the Chair of Technical Committee 9.1 Economic and Business Systems of International Federation of Automatic Control. He is a Program Evaluator for the Accreditation Board of Engineering and Technology Education in computer engineering, electrical engineering, and software engineering programs, USA. He is a fellow of the American Association for the Advancement of Science. Zibin Zheng (S’05–M’11) received the Ph.D. degree from the Department of Computer Science and Engineering, Chinese University of Hong Kong (CUHK), Hong Kong, in 2010. He is an Associate Research Fellow with the Shenzhen Research Institute, Chinese University of Hong Kong. His current research interests include cloud computing, service computing, and software engineering. Mr. Zheng was a recipient of the Outstanding Thesis Award of CUHK in 2012, the ACM Special Interest Group on Software Engineering Distinguished Paper Award in International Conference on Software Engineering 2010, the IBM Ph.D. Fellowship Award in 2010. He served as a Program Committee Member of more than 20 conferences.
© Copyright 2025