Application of GATS, a hybrid mega-heuristic model, and DOE, to solve flexible flow shop scheduling problems: a case study

Phong Nguyen Nhu* and Thu Trang Nguyen Le

*Correspondence:
Phong Nguyen Nhu,
nnphong@hcmut.edu.vn

Received: 17 July 2023; Accepted: 26 July 2023; Published: 14 August 2023.

Flexible flow shop scheduling (FFSS) is an NP-hard combinatorial optimization problem. Solving this problem using mathematical modeling approaches is very difficult. Mega-heuristic algorithms, such as the genetic algorithm (GA) and tabu search (TS), are powerful tools for finding near-optimal solutions to problems of this type. This paper develops a GATS model by combining GA and TS for solving FFSS problems. In the model, GA is used as the platform for global search, and TS is used to support GA in local search. This paper also uses the design of experiments (DOE) to optimize the parameters of the GATS model. The performance of the models, GATS and GATS with DOE, is compared with traditional heuristics being used. The result indicates that the models are good approaches for FFSS problems.

Keywords: genetic algorithm, tabu search, design of experiment, flexible flow shop scheduling, mathematical modeling, parallel machine scheduling, setup time, batch production, hybrid mega heuristic model

Introduction

Scheduling problems define the sequence in which a group of orders is efficiently processed through several machines. Flow shop scheduling (FSS) problems consider m machines and n orders, and all orders are processed in the same sequence. Parallel machine scheduling (PMS) problems involve sending n jobs to m parallel machines. An order can be processed on any machine. Machines can be homogeneous or heterogeneous. Machines are homogenous when they have the same processing time for each job. Machines are heterogeneous when the times to process the same job on different machines are different.

Flow shop scheduling problems are problems that combine FFS problems and PMS problems. So, FFSS problems are more complicated than FFS problems and PMS problems. FFSS problems include dispatching n jobs across w stations; each station has several parallel machines. Each job is executed sequentially through the stations in the same sequence, and only one machine at each station is selected for execution.

The problem to be solved is an FFSS problem, with the assumption that the orders are ready at the start of the scheduling process. The objective of the problem is to minimize the total tardiness of orders. The constraints are on the sequence of orders, on the sequence of operations in each order, on the allocation of orders on parallel machines in each station, on machine changeover time, and on batches produced in each machine.

The model of the problem is built based on the above assumptions, objectives, and constraints. The GATS algorithm is a combination of GA and TS, with the foundation of GA. Based on the model of the problem, the GATS algorithm will find a good solution for the problem. Design of experiment (DOE) is used to optimize the parameters of the GATS model. The solutions of GATS and GATS with DOE are compared with the solutions of the traditional heuristics being used to evaluate their effectiveness.

Literature review

Flexible flow shop scheduling

Flow shop scheduling is renowned and classified as an NP-hard optimization problem (Berlinska and Przybylski, 2021), and several algorithms have been developed to overcome this problem (1, Lee and Loong, 2019). Palmer et al. and Gupta et al. proposed heuristic methods to solve n-job problems on m-machines (2). Chen et al. applied GA to these problems. However, the problem solving time of GAs is quite long.

The PMS problem has two distinct decisions: allocation and sequencing. Allocation is a decision concerning the assignment of jobs to machines, while sequencing is to order the jobs assigned to each machine (3). When the number of machines is large, mega-heuristic algorithms such as GA and TS are usually used instead of optimal methods to solve the problem. To reduce the time to solve the problem, one solution is to coordinate between GA and TS (where GA is used for global search and TS is used for local search).

Genetic algorithm

Genetic algorithm, first introduced by Holland in 1975, is a powerful search engine to solve optimization problems using evolution and natural selection of selected individuals called chromosomes. Feasible solutions to the problem are modeled by chromosomes. The population of chromosomes represents the solution space of the problem. Chromosomes are represented by strings. A method of coding is used to define the string format for the chromosomes.

Each chromosome has a corresponding fitness value. The fitness function is a measure of the extent to which the objective of the problem is achieved. The fitness function is derived from the objective function of the problem.

An initial population is made up of several chromosomes. The next population is created by applying genetic operators to the chromosomes of the current generation. Genetic operators include selection, crossover, mutation, and replacement. The algorithm creates a new population from the existing population. The next population is probably better than the current population. This process is repeated until a specified stopping condition is met.

Tabu search

Tabu search, introduced by Glover and Laguna in 1997, is also an effective search engine to solve optimization problems. Feasible solutions to the problem are modeled by strings. The population of strings represents the solution space of the problem. Strings are represented by codes. A method of coding is used to define the string format for the codes.

Each string has a corresponding evaluation value. The evaluation function is a measure of the extent to which the objective of the problem is achieved. The evaluation function is derived from the objective function of the problem. The F-best value is the best evaluation value found during the search.

Tabu search explores the solution space beyond local optimality. The properties of the search path are memorized, and choices are made to guide the search out of the local optimum and into different regions. Some moves to previously explored areas are prohibited to avoid local optimization. Recent moves are taboo and stored in the tabu list. The tabu list identifies all moves that do not apply to the current solution. The size of the tabu list is an algorithm parameter that needs to be determined.

An initial solution is identified. The neighborhood operator will determine the neighborhood region for the current solution. The next solution, which may not be as good as the current solution, is determined by applying the selection operator to select the best solution in the current neighborhood region. The selected solution must not violate the tabu list. The process is repeated until a specified stopping condition is met.

The flexible flow shop scheduling problem

The problem to be solved is an FFSS problem with nine orders, Oi, i = 1 ÷ 9, and scheduling on four stations, Sj, j = 1 ÷ 4. Each order consists of many products, manufactured in batches at each machine in the stations. Each station has three homogeneous machines. The due time Di, the processing time Pij, and the batch time Bij for order i, i = 1 ÷ 9, on station j, j = 1 ÷ 4, are estimated in Table 1.

TABLE 1
www.bohrpub.com

Table 1. Input data for model.

The setup times of the machines in station j are estimated in Table 2.

TABLE 2
www.bohrpub.com

Table 2. Setup times of machine stations.

The model is set up with variables. TSijk being the time to start order Oi, i = 1 ÷ 9 on station Sj, j = 1 ÷ 4, machine k, k = 1 ÷ 3; Xijk being the binary variable, whether order Oi, i = 1 ÷ 9 will process on station Sj, j = 1 ÷ 4, machine k, k = 1 ÷ 3 or not; TEijk being the time to end order Oi, i = 1 ÷ 9 on station Sj, j = 1 ÷ 4, machine k, k = 1 ÷ 3; Ti being the tardiness time of job i. The model of the problem is as follows:

T min = MinT , T = Σ ( T i , i = 1 ÷ 9 )

St.

– TEijk = TSijk + (Sj + Pij) * Xijk, i = 1 ÷ 9, j = 1 ÷ 4, k = 1 ÷ 3

– Ti = max (TEi4k – Di, 0), i = 1 ÷ 9, k = 1 ÷ 3

– ΣkXijk = 1, ΣiXijk = 1, i = 1 ÷ 9, j = 1 ÷ 4, k = 1 ÷ 3

– TSijk ≥ Max (TSi(j–1)k + Bi(j–1), TE(i–1)jk), i = 1 ÷ 9, j = 1 ÷ 4, k = 1 ÷ 3

– TSij1 = max(0, min(TE(i–1),j1)), i = 1 ÷ 9, j = 1 ÷ 4

– TSijk = max(min(TE(i–1)jk), TSij(k–1) + Bij), i = 1 ÷ 9, j = 1 ÷ 4, k = 2 ÷ 3

– TSijk, TEijk ≥ 0; Xijk = {0,1}, i = 1 ÷ 9, j = 1 ÷ 4, k = 1÷3

The company is currently using the LPT dispatching method. The objective value of T is 46.93 (h).

The GATS model for the Flexible flow shop scheduling problem

The above FFSS problem is a non-linear problem with a solution space of 4*9!, or 1,451,520. The GATS model is used to solve the problem. In the model, GA is used to perform a global search of the solution space, and TS is used to perform a local search to refine the solution found by GA. The GATS procedure is as follows:

Step 1: Initialize the GATS model.

Step 2: Generate the initial population, P(0). Set k to 0.

Step 3: Generate elite population PE(k).

Step 4: Generate the genetic population PG(k).

Step 5: Generate the neighborhood population PN(k).

Step 6: Generate the next population P(k+1). Set k = k + 1.

Step 7: Check the termination rule. If not, return to Step 3. If yes, finish the loop.

Step 8: Run the algorithm a number of times to choose the best scheduling result.

Step 1: Initialize the GATS model.

This step sets up factors in GATS models, including the method of coding, the GA parameters, the TS parameters, and the termination rule.

The method of coding: Each chromosome C is a string of four sub-chromosomes, Sj, j = 1 ÷ 4, corresponding to four stations. The orders are numbered from 1 to 9. Each sub-chromosome is a string of nine genes, Gi, i = 1 ÷ 9, corresponding to nine orders. The sequence of genes in a sub-chromosome represents the sequence of orders scheduled in the corresponding station:

C = [ S , 1 S , 2 S , 3 S ] 4 ; S = j
( G , j 1 G , j 2 G , j 3 G , j 4 G , j 5 G , j 6 G , j 7 G , j 8 G ) j 9 , j = 1 ÷ 4

The GA parameters include fitness function, population size, and the parameters of GA operators, including selection, crossover, mutation, and replacement operators. The fitness function F is defined as Fi = Tmax – Ti, where Fi and Ti are the fitness and objective values of chromosome i and Tmax is the maximum objective value in the population. The crossover method is POX (precedence operation crossover), the mutation method is REM (reciprocal exchange mutation), and the replacement method is acceptance threshold. The model parameters P, Pc, Pm, and K are chosen as follows:

P = 9 , P = c 0.8 ; P = m 0.2 , K = 2

The TS parameters include the neighborhood operator and the tabu list. The neighborhood operator uses the SWAP method to find the neighborhood chromosomes of a chromosome. The tabu list contains the chromosomes found in the previous iterations. At the end of each iteration, chromosomes moving into the next population must not be on the tabu list.

The termination rule: The best objective value of the population Tmin does not improve or decrease after 10 consecutive iterations.

Step 2: Generate the initial population P(0) and set k = 0.

The initial population consists of nine chromosomes. There are four chromosomes generated from four heuristic rules: EDD, ERD, SPT, and LPT. The remaining chromosomes R1, …, R5 are randomly generated. The chromosomes in the initial population, arranged in descending order of their objective values with their corresponding objective and fitness values, are shown in Table 3.

P = ( 0 ) { EDD , ERD , R2 , SPT , R1 , R4 , R3 , R5 , LPT }
TABLE 3
www.bohrpub.com

Table 3. The initial population P(0).

This step also initializes the tabu list, TL, containing the chromosomes in P(0).

TL = { EDD , ERD , R2 , SPT , R1 , R4 , R3 , R5 , LPT }

Step 3: Generate an elite population PE(k).

This step uses the selection operator to generate the elite population PE(k) from P(k). Each chromosome in the current population has a corresponding fitness value Fi and is selected for inclusion in the elite population PE(k), with the selection probability Pi determined as follows:

P i = F i / Σ i = 1 ÷ 9 ( F i )

The selection probabilities Pi are calculated as shown in Table 3. Random numbers are generated nine times based on Pi; the chromosomes in P(0) selected into the population PE(0) are as follows:

P E ( 0 ) = { R2 , ERD , SPT , R3 , R3 , SPT , ERD , R4 , EDD }

Step 4: Generate the genetic population PG(k).

This step uses the crossover and mutation operators to generate the genetic population PG(k) from the elite population PE(k). The genetic population PG(k) includes the new chromosome generated from the crossover and mutation operators. The chromosomes of PE(0) are selected to be included in the crossover list Pc with a crossover probability of 0.8. After generating nine random numbers, the set Pc is determined as follows:

P c = { EDD , ERD , SPT , R3 } .

Each pair of chromosomes in Pc is selected to cross over by the POX method, resulting in two new chromosomes in population PC. R2 and ERD are crossed over to each other and generate two children, C1 and C2. SPT and R4 are crossed over to each other and generate two children, C3 and C4. The chromosomes in PC and their objective values are shown in Table 4.

P C = { C1 , C2 , C3 , C4 }

The chromosomes of PE(0) are also selected to be included in the mutation list Pm, with a mutation probability of 0.2. After generating nine random numbers, the set Pm is determined as follows:

P m = { R2 , SPT } .

Each chromosome in Pm is selected to mutate by the REM method, resulting in one new chromosome in population PM. R3 is mutated and generates M1. SPT is mutated and generates M2. The chromosomes in PM and their objective values are shown in Table 5:

P M = { M1 , M2 }
TABLE 5
www.bohrpub.com

Table 5. The mutation population PM.

After crossover and mutation, six new chromosomes are created in the population PG(0). Their chromosomes and objective values are shown in Table 5:

P G ( 0 ) = { C1 , C2 , C3 , C4 , M1 , M2 }

Step 5: Generate the neighborhood population PN(k).

The populations P(k) and PG(k) form the union population PU(k) = P(k) ∪ PG(k). This step uses the neighborhood operator to generate the neighborhood population PN(k) from the union population PU(k). For the first iteration, PU(0) includes 15 chromosomes, of which 9 are in P(0) and 6 are in PG(0):

P U ( 0 ) = P ( 0 ) P G ( 0 ) = { EDD , ERD , R2 , SPT , R1 , R4 ,
R3 , R5 , LPT , C1 , C2 , C3 , C4 , M1 , M2 }

The neighborhood operator swaps adjacent sub-chromosomes in each chromosome to generate neighborhood chromosomes. Each chromosome in PU(0) will have three neighborhood chromosomes. The best chromosome will be selected to move into PN(0) to go forward. Population PN(0) includes the best neighborhood chromosomes, as shown in Table 6:

P N ( 0 ) = { N1 , N2 , N3 , N4 , N5 , N6 , N7 , N8 , N9 , N10 ,
N11 , N12 , N13 , N14 , N15 }
TABLE 6
www.bohrpub.com

Table 6. The neighborhood population PN(0).

In PN(0), there are two chromosomes, N1 and N2, in the tabu list; the rest are not.

Step 6: Generate the next population P(k+1).

The step uses the replacement operator to generate the next population P(k+1) from the populations P(k) and PN(k). The chromosomes from PN(k) will be added to the current population P(k) to make the next population P(k+1), if they are not in the current tabu list and their objective values exceed an acceptable threshold, defined by the objective value of the threshold chromosome. The threshold chromosome is a chromosome in P(k) with position n defined by population size P and threshold parameter k as follows:

n = P / k

With a population size of nine and a threshold parameter of two chosen, the threshold chromosome is the fourth chromosome of P(k) in the top-down ranking. In order to keep the next population size constant, chromosomes in P(k) with the lowest values are removed from the next population. This step also updates the tabu list by adding the chromosomes moved into the next population P(k+1).

In this iteration, the threshold chromosome is SPT, with a threshold value of 16.34. Chromosomes N3, N4, and N12 move into, and R3, R5, and LPT move out of the population. The chromosomes in the next population, P(1), are shown in Table 7:

P = ( 1 ) { EDD , N12 , ERD , N3 , N4 , R2 , SPT , R1 , R4 }
TABLE 7
www.bohrpub.com

Table 7. The next population P(1).

The tabu list is updated after iteration 1 as follows:

TL = { EDD , ERD , R2 , SPT , R1 , R4 , R3 , R5 ,
LPT , N3 , N4 , N12 }

Step 7: Check the termination rule.

After iteration 1, EDD is again the best chromosome with the best objective value of 9.41, appearing twice. The termination rule is not satisfied, so iteration 2 is executed. The results after 14 iterations are shown in Table 8.

TABLE 8
www.bohrpub.com

Table 8. The results after 21 iterations.

From the 12th iteration to the 21st iteration, the best objective value remains the same, the termination rule is satisfied, and the algorithm ends. The scheduling result for this run is as follows:

C = ( 413296857 , 413297856 , 142379856 , 472319856 ) ,
T = 2.37 ( h )

Step 8: Run the algorithm a number of times to choose the best scheduling result.

The algorithm is run 10 times with the results as shown in Table 9.

TABLE 9
www.bohrpub.com

Table 9. The results after 10 runs.

The best scheduling result is found on the first run. The scheduling sequence C and the objective value T are as follows:

C = ( 423981756 , 498132756 , 982713546 , 439782156 ) ,
T = 2.37 ( h ) .

The Gantt chart for the GATS model is shown in Figure 1.

FIGURE 1
www.bohrpub.com

Figure 1. The Gantt chart by GATS model.

The objective value according to the GATS model (2.37h) is less than the objective value according to the LPT model currently used (46.93h).

The GATS with DOE to solve the FFSS problem

The parameters of the above GATS model were selected empirically, so the results are not very good. Experimental design is used to determine the model parameters. Firstly, a binary experiment is conducted with the model parameters P, Pc, and Pm chosen as the input factors and the total tardiness time T chosen as the output factor. The levels of the input factors are chosen as in Table 10.

TABLE 10
www.bohrpub.com

Table 10. The binary levels of the input factors.

Each combination of input factors is tested five times. The total number of experiments is 40. The experimental results are collected. From the data, Analysis of variance is performed as in Figure 2.

FIGURE 2
www.bohrpub.com

Figure 2. The ANOVA of the binary experiment.

From the above ANOVA table, with α of 0.05, the affected factors are factors P and Pc. The values of parameters to give the minimum objective value are shown in Figure 3.

P = 12 , P = c 0.8 , P = m 0.2
FIGURE 3
www.bohrpub.com

Figure 3. The values of parameters to give the minimum objective value.

A further experiment is conducted with the two affected factors chosen as the input factors and the total tardiness time T chosen as the output factor. The mutation probability Pm is kept at 0.2. The levels of the input factors are chosen as in Table 11.

TABLE 11
www.bohrpub.com

Table 11. The multilevels of the input factors.

There are 25 combinations of input factors. Each combination is experimented with four times. The total number of experiments is 100. The experimental results are collected. From the data, analysis of variance is performed as in Figure 4.

FIGURE 4
www.bohrpub.com

Figure 4. The ANOVA of the multilevels experiment.

From the ANOVA shown in Figure 4, with α of 0.05, the most affected factor is population size P. The values of parameters to give the minimum objective value are shown in Figure 5.

P = 15 , P = c 0.9
FIGURE 5
www.bohrpub.com

Figure 5. The values of parameters must give the minimum objective value.

The GATS model with DOE is run 10 times with the parameters defined by the experiment as follows:

P = 15 , P = c 0.9 , P = m 0.2

The results of the GATS model with DOE are shown in Table 12.

TABLE 12
www.bohrpub.com

Table 12. The results of the GATS with DOE after 10 runs.

The best scheduling result is found on the 8th run. The scheduling sequence C and the objective value T are as follows:

C = ( 394218756 , 394127586 , 342791856 , 432198756 ) ,
T = 0 ( h ) .

The Gantt chart for the GATS model with DOE is shown in Figure 6.

FIGURE 6
www.bohrpub.com

Figure 6. The Gantt chart for the GATS model with DOE.

The objective value according to the GATS model with DOE (0h) is less than the objective value according to the GATS model without DOE (2.37h).

Conclusion

The GATS model has been used to solve the FFSS problem with nine orders on four stations and three homogeneous parallel machines. In the model, GA is used as the platform to perform a global search of the solution space, and TS is used to perform a local search to refine the solution found by GA. The results show that the GATS model gives a better objective value of tardiness time than the heuristic LPT method being used.

Design of experiments has been used to optimize the parameters of the GATS model. The results show that the GATS model with DOE gives a better objective value of tardiness time than the GATS model without DOE. The GATS models without and with DOE have been used to solve the FFSS problem with small sizes. Future research will use the models to solve real-world problems.

Author contributions

PN is the thesis advisor for TL. PN has developed the research models for the thesis. TL has collected the data and written a program to run the algorithms based on the models. PN has composed the research article based on the thesis. Both authors contributed to the article and approved the submitted version.

Acknowledgments

We extend our heartfelt appreciation to everyone who has contributed to the completion of this research article, especially our families, the HCMC University of Technology, and the scientific community for their invaluable support. We endorse the fact that your contributions have been instrumental in the successful completion of this work.

References

1. Umam M, Mustafid M, Suryono S. A hybrid genetic algorithm and tabu search for minimizing makespan in flow shop scheduling problem. J King Saud Univ Comput Inform Sci. (2022) 34:7459–67.

Google Scholar

2. Etiler O, Toklu B, Atak M, Wilson J. A genetic algorithm for flow shop scheduling problems. J Operat Res Soc. (2004) 55:830–5.

Google Scholar

3. Kim S, Choi H, Lee D. Tabu search heuristics for parallel machine scheduling with sequence-dependent setup and ready times. Proceedings of the International Conference on Computational Science and its Applications, vol Part III, Glasgow, UK, May 8-11, 2006. Glasgow: (2006). p. 728–37.

Google Scholar

4. Helal M, Rabadi G, Al-Salem A. A tabu search algorithm to minimize the makespan for the unrelated parallel machines scheduling problem with setup times. Int J Operat Res. (2006) 3:182–92.

Google Scholar

5. Bilge Ü, Kirac F, Kurtulan M, Pekgun P. A tabu search algorithm for parallel machine total tardiness problem. Comput Operat Res. (2002) 31:397–414.

Google Scholar

6. Kolahan F, Tavakoli A, Tajdin B, Hosayni M. Analysis of neighborhood generation and move selection strategies on the performance of Tabu Search. Proceedings of the 6th WSEAS International Conference on Applied Computer Science, Tenerife, Canary Islands, Spain, December 16-18, 2006. Tenerife: (2006).

Google Scholar

7. Gupta J, Palanimuthu N, Chen C-L. Designing a tabu search algorithm for the two-stage flow shop problem with secondary criterion. Product Plann Control Manag Operat. (1999) 10:251–65.

Google Scholar

8. Burduk A, Musiał K, Kochańska J, Górnicka D, Stetsenko A. Tabu Search and genetic algorithm for production process scheduling problem. Logforum. (2019) 15:181–9.

Google Scholar