rlportfolio.policy.ei3 module
- class EI3
Bases:
Module- __init__(initial_features: int = 3, k_short: int = 3, k_medium: int = 21, conv_mid_features: int = 3, conv_final_features: int = 20, time_window: int = 50, device: str = 'cpu') EI3
EI3 (ensemble of identical independent inception) policy network initializer.
- Parameters:
initial_features – Number of input features.
k_short – Size of short convolutional kernel.
k_medium – Size of medium convolutional kernel.
conv_mid_features – Size of intermediate convolutional channels.
conv_final_features – Size of final convolutional channels.
time_window – Size of time window used as agent’s state.
device – Device in which the neural network will be run.
Note
Reference article: https://doi.org/10.1145/3357384.3357961.
- forward(observation: Tensor, last_action: Tensor) Tensor
Policy network’s forward propagation. Defines a most favorable action of this policy given the inputs.
- Parameters:
observation – environment observation.
last_action – Last action performed by agent.
- Returns:
Action to be taken.