1,721,022 research outputs found

    Sharing STEP data on the internet

    No full text

    Cooperative and Competitive Biases for Multi-Agent Reinforcement Learning

    No full text
    Training a multi-agent reinforcement learning (MARL) algorithm is more challenging than training a single-agent reinforcement learning algorithm, because the result of a multi-agent task strongly depends on the complex interactions among agents and their interactions with a stochastic and dynamic environment. We propose an algorithm that boosts MARL training using the biased action information of other agents based on a friend-or-foe concept. For a cooperative and competitive environment, there are generally two groups of agents: cooperative-agents and competitive-agents. In the proposed algorithm, each agent updates its value function using its own action and the biased action information of other agents in the two groups. The biased joint action of cooperative agents is computed as the sum of their actual joint action and the imaginary cooperative joint action, by assuming all the cooperative agents jointly maximize the target agent’s value function. The biased joint action of competitive agents can be computed similarly. Each agent then updates its own value function using the biased action information, resulting in a biased value function and corresponding biased policy. Subsequently, the biased policy of each agent is inevitably subjected to recommend an action to cooperate and compete with other agents, thereby introducing more active interactions among agents and enhancing the MARL policy learning. We empirically demonstrate that our algorithm outperforms existing algorithms in various mixed cooperative-competitive environments. Furthermore, the introduced biases gradually decrease as the training proceeds and the correction based on the imaginary assumption vanishes

    General EM Algorithm for Fitting Non-Monotone Hazard Functions from Truncated and Censored Observations

    No full text
    Recently, many researchers focused on modeling non-monotonic hazard functions such as bath-tube and hump shapes. However, most of their estimation methods are focused on complete observations. Since reliability data are typically censored and truncated, a general EM algorithm is proposed, which can fit any of those complex hazard functions. The proposed EM algorithm is analyzed by fitting well-known 4-parameter hazard functions, where its performance is compared by their specific direct methods through extensive Monte Carlo simulations. (c) 2022 Elsevier B.V. All rights reserved.11Nsciescopu

    Learning per-machine linear dispatching rule for heterogeneous multi-machines control

    No full text
    This paper proposes a per-machine linear dispatching rule learning approach to improve the scheduling of re-entrant flow shop such as semiconductor fab. Finding an optimal schedule of a complex manufacturing system is intractable; hence, a dispatching rule as a heuristic approach is widely used in actual practice. Also, to develop a good dispatching rule, an automated methodology for developing heuristics, also known as a hyper-heuristic, has been studied extensively. However, most of the literature has focused on finding a single-sophisticated dispatching rule, in which every machine uses the same rule. Such an approach often shows suboptimal performance when the optimal dispatching rule is different on each machine. To solve this problem, we introduce a simple and effective per-machine dispatching rule learning approach, in which each machine has one linear dispatching rule that is optimised by the Gradient-based Evolutionary Strategy (GES). This method is sample-efficient and can be applied to non-differentiable objective functions such as average Cycle Time. The proposed approach was mainly compared to two popular methods based on Genetic Programming (GP) and the Genetic Algorithm (GA) on a four-station and eight-machine re-entrant flow shop. Numerical results show that the proposed approach outperforms widely used methods.
    corecore