class mindspore_rl.policy.RandomPolicy(action_space_dim)[源代码]

在[0, action_space_dim)之间产生随机动作。

参数:
  • action_space_dim (int) - 动作空间的维度。

样例:

>>> action_space_dim = 2
>>> policy = RandomPolicy(action_space_dim)
>>> output = policy()
>>> print(output.shape)
(1,)
construct()[源代码]

返回[0, action_space_dim)之间的随机数。

返回:

[0, action_space_dim)之间的随机数。