Machine Learning Archives - Page 2 of 3

Tackling Non-Stationary Problems in Reinforcement Learning using Fourier Q Learning

Typical algorithms for solving reinforcement learning (RL) problems, are built on an assumption of a stationary environment (modeled as a stationary MDP), meaning the agent is learning how to act in an environment in which the action chosen in each state is not time dependent. However, one can think of many everyday life problems that occur in non-stationary environments, which change over time. Such problems were discussed in former articles...

Categories: Machine Learning

Tags: Machine Learning

RL with trajectory feedback

The RL framework requires to define reward function, which generates scalar reward per action, which can be a hard task. Therefore, many RL environments are described with a ‘Sparse reward’, in which most of the agent actions would receive no reward, except for an action that would lead the agent to the final goal. A lot of RL algorithms can have difficulties getting to a good result in those kinds...

Categories: Machine Learning

Safety and reliability are highly important in real-world Reinforcement Learning (RL) systems – in particular in risk-intolerant applications such as autonomous driving and medical devices. A statistical test has been recently suggested to detect whenever the performance of the RL agent deteriorates. However, this test has several limitations: It measures the agent’s rewards, but ignores other information that is usually available in RL problems. For example, when an autonomous car...

Categories: Machine Learning

Tags: Machine Learning | not taken

Smart class to code allocations for ECOC

Binary classification is a popular Machine Learning (ML) task where we wish to classify an input instance into one class out of 2 possible classes. For instance, given an image of an animal, predict whether it is a dog or a cat. In multi-class classification we wish to classify an instance into one class from a set of many possible classes (> 2). For example, given an image of an...

Categories: Machine Learning

The goal of this project is to test a new continuous reinforcement learning algorithm in several different simulated environments. The algorithm, SIMPLE, uses a neural net to simulate the reward function for the environment’s state, similarly to the DDPG algorithm. The project’s contribution comes from its unique approach to finding an optimal action in a continuous action space. Using the neural net as a model for the reward function, it...

Categories: Machine Learning

Tags: Machine Learning

Categories: Machine Learning

Machine Learning from Disaster

The sinking of the RMS Titanic is one of the most infamous shipwrecks in history. On April 15, 1912, during her maiden voyage, the Titanic sank after colliding with an iceberg, killing 1502 out of 2224 passengers and crew. This sensational tragedy shocked the international community and led to better safety regulations for ships. One of the reasons that the shipwreck led to such loss of life was that there...

Categories: Machine Learning

L33- Deep Learning with Markovian Data

Machine learning has gained a lot of interest in the last decade, especially due to impressive advances in deep learning. A typical assumption in machine learning is that the data is i.i.d. from some unknown data distribution. However, in many real-world domains this assumption does not hold, and instead we have some temporal structure in the data. In such cases, it is known that standard optimization algorithms (e.g., SGD) suffer...

Categories: Machine Learning

Tags: Machine Learning | not taken

Convolutional neural networks (CNNs) compute their output using weighted-sums of adjacent input elements. This method enables CNNs to achieve state-of-the-art results in a wide range of applications such as computer vision and speech recognition. However, it also comes with the cost of high computational intensity. Shomron et al purposed exploiting the spatial correlation inherent in CNNs and predict activation values, thus reducing the needed computations in the network. They introduced...

Categories: Machine Learning