examples/rllib/self_play_train.py error message
unknown
powershell
3 years ago
259 kB
8
Indexable
(base) nell@Jeremiah norm-games % python3 examples/rllib/self_play_train.py
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:18: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
DESCRIPTOR = _descriptor.FileDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:36: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_descriptor.FieldDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:29: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_TENSORSHAPEPROTO_DIM = _descriptor.Descriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:19: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
DESCRIPTOR = _descriptor.FileDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:33: DeprecationWarning: Call to deprecated create function EnumValueDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_descriptor.EnumValueDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:27: DeprecationWarning: Call to deprecated create function EnumDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_DATATYPE = _descriptor.EnumDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:287: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_descriptor.FieldDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:280: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_SERIALIZEDDTYPE = _descriptor.Descriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:20: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
DESCRIPTOR = _descriptor.FileDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:39: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_descriptor.FieldDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:32: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_RESOURCEHANDLEPROTO_DTYPEANDSHAPE = _descriptor.Descriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:21: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
DESCRIPTOR = _descriptor.FileDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:40: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_descriptor.FieldDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:33: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_TENSORPROTO = _descriptor.Descriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/attr_value_pb2.py:21: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
DESCRIPTOR = _descriptor.FileDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/attr_value_pb2.py:40: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
_descriptor.FieldDescriptor(
/Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow_probability/python/__init__.py:57: DeprecationWarning: distutils Version classes are deprecated. Use packaging.version instead.
if (distutils.version.LooseVersion(tf.__version__) <
2023-01-30 13:47:50,054 INFO worker.py:1538 -- Started a local Ray instance.
2023-01-30 13:47:52,454 INFO algorithm_config.py:2503 -- Your framework setting is 'tf', meaning you are using static-graph mode. Set framework='tf2' to enable eager execution with tf2.x. You may also then want to set eager_tracing=True in order to reach similar execution speed as with static-graph mode.
2023-01-30 13:47:52,455 INFO algorithm_config.py:2503 -- Your framework setting is 'tf', meaning you are using static-graph mode. Set framework='tf2' to enable eager execution with tf2.x. You may also then want to set eager_tracing=True in order to reach similar execution speed as with static-graph mode.
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:18: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:36: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _descriptor.FieldDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:29: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _TENSORSHAPEPROTO_DIM = _descriptor.Descriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:19: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:33: DeprecationWarning: Call to deprecated create function EnumValueDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _descriptor.EnumValueDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:27: DeprecationWarning: Call to deprecated create function EnumDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _DATATYPE = _descriptor.EnumDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:287: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _descriptor.FieldDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:280: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _SERIALIZEDDTYPE = _descriptor.Descriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:20: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:39: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _descriptor.FieldDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:32: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _RESOURCEHANDLEPROTO_DTYPEANDSHAPE = _descriptor.Descriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:21: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:40: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _descriptor.FieldDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:33: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _TENSORPROTO = _descriptor.Descriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/attr_value_pb2.py:21: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/attr_value_pb2.py:40: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46962) _descriptor.FieldDescriptor(
(pid=46962) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow_probability/python/__init__.py:57: DeprecationWarning: distutils Version classes are deprecated. Use packaging.version instead.
(pid=46962) if (distutils.version.LooseVersion(tf.__version__) <
(PPO pid=46962) 2023-01-30 13:47:56,215 WARNING algorithm_config.py:488 -- Cannot create PPOConfig from given `config_dict`! Property __stdout_file__ not supported.
(PPO pid=46962) 2023-01-30 13:47:56,216 INFO algorithm_config.py:2503 -- Your framework setting is 'tf', meaning you are using static-graph mode. Set framework='tf2' to enable eager execution with tf2.x. You may also then want to set eager_tracing=True in order to reach similar execution speed as with static-graph mode.
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:18: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:36: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _descriptor.FieldDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:29: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _TENSORSHAPEPROTO_DIM = _descriptor.Descriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:19: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:33: DeprecationWarning: Call to deprecated create function EnumValueDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _descriptor.EnumValueDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:27: DeprecationWarning: Call to deprecated create function EnumDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _DATATYPE = _descriptor.EnumDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:287: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _descriptor.FieldDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:280: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _SERIALIZEDDTYPE = _descriptor.Descriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:20: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:39: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _descriptor.FieldDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:32: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _RESOURCEHANDLEPROTO_DTYPEANDSHAPE = _descriptor.Descriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:21: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:40: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _descriptor.FieldDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:33: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _TENSORPROTO = _descriptor.Descriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/attr_value_pb2.py:21: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/attr_value_pb2.py:40: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46974) _descriptor.FieldDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:18: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:36: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _descriptor.FieldDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_shape_pb2.py:29: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _TENSORSHAPEPROTO_DIM = _descriptor.Descriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:19: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:33: DeprecationWarning: Call to deprecated create function EnumValueDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _descriptor.EnumValueDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:27: DeprecationWarning: Call to deprecated create function EnumDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _DATATYPE = _descriptor.EnumDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:287: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _descriptor.FieldDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/types_pb2.py:280: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _SERIALIZEDDTYPE = _descriptor.Descriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:20: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:39: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _descriptor.FieldDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/resource_handle_pb2.py:32: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _RESOURCEHANDLEPROTO_DTYPEANDSHAPE = _descriptor.Descriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:21: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:40: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _descriptor.FieldDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/tensor_pb2.py:33: DeprecationWarning: Call to deprecated create function Descriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _TENSORPROTO = _descriptor.Descriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/attr_value_pb2.py:21: DeprecationWarning: Call to deprecated create function FileDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) DESCRIPTOR = _descriptor.FileDescriptor(
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow/core/framework/attr_value_pb2.py:40: DeprecationWarning: Call to deprecated create function FieldDescriptor(). Note: Create unlinked descriptors is going to go away. Please use get/find descriptors from generated code or query the descriptor_pool.
(pid=46975) _descriptor.FieldDescriptor(
(pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow_probability/python/__init__.py:57: DeprecationWarning: distutils Version classes are deprecated. Use packaging.version instead.
(pid=46974) if (distutils.version.LooseVersion(tf.__version__) <
(pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/tensorflow_probability/python/__init__.py:57: DeprecationWarning: distutils Version classes are deprecated. Use packaging.version instead.
(pid=46975) if (distutils.version.LooseVersion(tf.__version__) <
(RolloutWorker pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/gym/spaces/box.py:155: UserWarning: WARN: Casting input x to numpy array.
(RolloutWorker pid=46974) logger.warn("Casting input x to numpy array.")
(RolloutWorker pid=46974) /Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/utils/pre_checks/env.py:434: DeprecationWarning: `np.bool` is a deprecated alias for the builtin `bool`. To silence this warning, use `bool` by itself. Doing this will not modify any behavior and is safe. If you specifically wanted the numpy scalar type, use `np.bool_` here.
(RolloutWorker pid=46974) Deprecated in NumPy 1.20; for more details and guidance: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations
(RolloutWorker pid=46974) if not isinstance(done_, (bool, np.bool, np.bool_)):
(RolloutWorker pid=46974) 2023-01-30 13:48:00,643 DEBUG rollout_worker.py:1932 -- Creating policy for agent_0
(RolloutWorker pid=46974) 2023-01-30 13:48:00,645 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/gym/spaces/box.py:155: UserWarning: WARN: Casting input x to numpy array.
(RolloutWorker pid=46975) logger.warn("Casting input x to numpy array.")
(RolloutWorker pid=46975) /Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/utils/pre_checks/env.py:434: DeprecationWarning: `np.bool` is a deprecated alias for the builtin `bool`. To silence this warning, use `bool` by itself. Doing this will not modify any behavior and is safe. If you specifically wanted the numpy scalar type, use `np.bool_` here.
(RolloutWorker pid=46975) Deprecated in NumPy 1.20; for more details and guidance: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations
(RolloutWorker pid=46975) if not isinstance(done_, (bool, np.bool, np.bool_)):
(RolloutWorker pid=46975) 2023-01-30 13:48:00,682 DEBUG rollout_worker.py:1932 -- Creating policy for agent_0
(RolloutWorker pid=46975) 2023-01-30 13:48:00,684 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46974) 2023-01-30 13:48:00,956 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:00,957 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:00,957 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46974) 2023-01-30 13:48:00,957 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46975) 2023-01-30 13:48:00,993 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:00,993 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:00,993 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46975) 2023-01-30 13:48:00,993 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46974) Model: "model_5"
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46974) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_0_wk1/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_0_wk1/Sequen [()] 0 ['tf_op_layer_agent_0_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46974) yer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_0_wk1/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46974) pLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_0_wk1/Sequen [(None,)] 0 ['tf_op_layer_agent_0_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46974) r)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_0_wk1/Sequen [(None, 1)] 0 ['tf_op_layer_agent_0_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46974) )
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_0_wk1/Sequen [(None, None)] 0 ['tf_op_layer_agent_0_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46974) ) 'tf_op_layer_agent_0_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Cast[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46974) (None, 256), 'h[0][0]',
(RolloutWorker pid=46974) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46974) 'tf_op_layer_agent_0_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Less[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) Total params: 535,817
(RolloutWorker pid=46974) Trainable params: 535,817
(RolloutWorker pid=46974) Non-trainable params: 0
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Model: "model_5"
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_0_wk2/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_0_wk2/Sequen [()] 0 ['tf_op_layer_agent_0_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46975) yer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_0_wk2/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46975) pLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_0_wk2/Sequen [(None,)] 0 ['tf_op_layer_agent_0_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46975) r)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_0_wk2/Sequen [(None, 1)] 0 ['tf_op_layer_agent_0_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46975) )
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_0_wk2/Sequen [(None, None)] 0 ['tf_op_layer_agent_0_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46975) ) 'tf_op_layer_agent_0_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Cast[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46975) (None, 256), 'h[0][0]',
(RolloutWorker pid=46975) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46975) 'tf_op_layer_agent_0_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Less[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) Total params: 535,817
(RolloutWorker pid=46975) Trainable params: 535,817
(RolloutWorker pid=46975) Non-trainable params: 0
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46974) 2023-01-30 13:48:01,060 INFO policy.py:1147 -- Policy (worker=1) running on CPU.
(RolloutWorker pid=46974) 2023-01-30 13:48:01,060 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46975) 2023-01-30 13:48:01,087 INFO policy.py:1147 -- Policy (worker=2) running on CPU.
(RolloutWorker pid=46975) 2023-01-30 13:48:01,087 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46974) 2023-01-30 13:48:01,126 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:01,126 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:01,127 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:01,127 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:01,127 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46975) 2023-01-30 13:48:01,152 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:01,152 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:01,153 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:01,153 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:01,153 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46974) 2023-01-30 13:48:01,344 DEBUG dynamic_tf_policy_v2.py:755 -- Initializing loss function with dummy input:
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) { 'action_dist_inputs': <tf.Tensor 'agent_0_wk1/action_dist_inputs:0' shape=(?, 8) dtype=float32>,
(RolloutWorker pid=46974) 'action_logp': <tf.Tensor 'agent_0_wk1/action_logp:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'action_prob': <tf.Tensor 'agent_0_wk1/action_prob:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'actions': <tf.Tensor 'agent_0_wk1/actions:0' shape=(?,) dtype=int64>,
(RolloutWorker pid=46974) 'advantages': <tf.Tensor 'agent_0_wk1/advantages:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'agent_index': <tf.Tensor 'agent_0_wk1/agent_index:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'dones': <tf.Tensor 'agent_0_wk1/dones:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'eps_id': <tf.Tensor 'agent_0_wk1/eps_id:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'new_obs': <tf.Tensor 'agent_0_wk1/Placeholder_1:0' shape=(?, 23236) dtype=float32>,
(RolloutWorker pid=46974) 'obs': <tf.Tensor 'agent_0_wk1/Placeholder:0' shape=(?, 23236) dtype=float32>,
(RolloutWorker pid=46974) 'prev_actions': <tf.Tensor 'agent_0_wk1/prev_actions:0' shape=(?,) dtype=int64>,
(RolloutWorker pid=46974) 'prev_rewards': <tf.Tensor 'agent_0_wk1/prev_rewards:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'rewards': <tf.Tensor 'agent_0_wk1/rewards:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'seq_lens': <tf.Tensor 'agent_0_wk1/seq_lens:0' shape=(?,) dtype=int32>,
(RolloutWorker pid=46974) 'state_in_0': <tf.Tensor 'agent_0_wk1/state_in_0:0' shape=(?, 256) dtype=float32>,
(RolloutWorker pid=46974) 'state_in_1': <tf.Tensor 'agent_0_wk1/state_in_1:0' shape=(?, 256) dtype=float32>,
(RolloutWorker pid=46974) 't': <tf.Tensor 'agent_0_wk1/t:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'unroll_id': <tf.Tensor 'agent_0_wk1/unroll_id:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'value_targets': <tf.Tensor 'agent_0_wk1/value_targets:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'vf_preds': <tf.Tensor 'agent_0_wk1/vf_preds:0' shape=(?,) dtype=float32>}
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) 2023-01-30 13:48:01,816 DEBUG tf_policy.py:783 -- These tensors were used in the loss functions:
(RolloutWorker pid=46974) { 'action_dist_inputs': <tf.Tensor 'agent_0_wk1/action_dist_inputs:0' shape=(?, 8) dtype=float32>,
(RolloutWorker pid=46974) 'action_logp': <tf.Tensor 'agent_0_wk1/action_logp:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'action_prob': <tf.Tensor 'agent_0_wk1/action_prob:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'actions': <tf.Tensor 'agent_0_wk1/actions:0' shape=(?,) dtype=int64>,
(RolloutWorker pid=46974) 'advantages': <tf.Tensor 'agent_0_wk1/advantages:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'dones': <tf.Tensor 'agent_0_wk1/dones:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'new_obs': <tf.Tensor 'agent_0_wk1/Placeholder_1:0' shape=(?, 23236) dtype=float32>,
(RolloutWorker pid=46974) 'obs': <tf.Tensor 'agent_0_wk1/Placeholder:0' shape=(?, 23236) dtype=float32>,
(RolloutWorker pid=46974) 'prev_actions': <tf.Tensor 'agent_0_wk1/prev_actions:0' shape=(?,) dtype=int64>,
(RolloutWorker pid=46974) 'rewards': <tf.Tensor 'agent_0_wk1/rewards:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'seq_lens': <tf.Tensor 'agent_0_wk1/seq_lens:0' shape=(?,) dtype=int32>,
(RolloutWorker pid=46974) 'state_in_0': <tf.Tensor 'agent_0_wk1/state_in_0:0' shape=(?, 256) dtype=float32>,
(RolloutWorker pid=46974) 'state_in_1': <tf.Tensor 'agent_0_wk1/state_in_1:0' shape=(?, 256) dtype=float32>,
(RolloutWorker pid=46974) 'value_targets': <tf.Tensor 'agent_0_wk1/value_targets:0' shape=(?,) dtype=float32>,
(RolloutWorker pid=46974) 'vf_preds': <tf.Tensor 'agent_0_wk1/vf_preds:0' shape=(?,) dtype=float32>}
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) 2023-01-30 13:48:02,076 DEBUG rollout_worker.py:1932 -- Creating policy for agent_1
(RolloutWorker pid=46974) 2023-01-30 13:48:02,077 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46975) 2023-01-30 13:48:02,109 DEBUG rollout_worker.py:1932 -- Creating policy for agent_1
(RolloutWorker pid=46975) 2023-01-30 13:48:02,111 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46974) 2023-01-30 13:48:02,361 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:02,361 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:02,362 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46974) 2023-01-30 13:48:02,362 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46974) Model: "model_5"
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46974) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_1_wk1/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_1_wk1/Sequen [()] 0 ['tf_op_layer_agent_1_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46974) yer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_1_wk1/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46974) pLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_1_wk1/Sequen [(None,)] 0 ['tf_op_layer_agent_1_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46974) r)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_1_wk1/Sequen [(None, 1)] 0 ['tf_op_layer_agent_1_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46974) )
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_1_wk1/Sequen [(None, None)] 0 ['tf_op_layer_agent_1_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46974) ) 'tf_op_layer_agent_1_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Cast[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46974) (None, 256), 'h[0][0]',
(RolloutWorker pid=46974) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46974) 'tf_op_layer_agent_1_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Less[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) Total params: 535,817
(RolloutWorker pid=46974) Trainable params: 535,817
(RolloutWorker pid=46974) Non-trainable params: 0
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46974) 2023-01-30 13:48:02,448 INFO policy.py:1147 -- Policy (worker=1) running on CPU.
(RolloutWorker pid=46974) 2023-01-30 13:48:02,448 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46975) 2023-01-30 13:48:02,385 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:02,385 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:02,385 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46975) 2023-01-30 13:48:02,385 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46975) 2023-01-30 13:48:02,469 INFO policy.py:1147 -- Policy (worker=2) running on CPU.
(RolloutWorker pid=46975) 2023-01-30 13:48:02,469 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46975) Model: "model_5"
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_1_wk2/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_1_wk2/Sequen [()] 0 ['tf_op_layer_agent_1_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46975) yer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_1_wk2/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46975) pLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_1_wk2/Sequen [(None,)] 0 ['tf_op_layer_agent_1_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46975) r)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_1_wk2/Sequen [(None, 1)] 0 ['tf_op_layer_agent_1_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46975) )
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_1_wk2/Sequen [(None, None)] 0 ['tf_op_layer_agent_1_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46975) ) 'tf_op_layer_agent_1_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Cast[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46975) (None, 256), 'h[0][0]',
(RolloutWorker pid=46975) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46975) 'tf_op_layer_agent_1_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Less[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) Total params: 535,817
(RolloutWorker pid=46975) Trainable params: 535,817
(RolloutWorker pid=46975) Non-trainable params: 0
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46974) 2023-01-30 13:48:02,511 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:02,512 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:02,512 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:02,513 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:02,513 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46975) 2023-01-30 13:48:02,533 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:02,533 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:02,533 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:02,533 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:02,534 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46974) 2023-01-30 13:48:03,339 DEBUG rollout_worker.py:1932 -- Creating policy for agent_2
(RolloutWorker pid=46974) 2023-01-30 13:48:03,340 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46975) 2023-01-30 13:48:03,358 DEBUG rollout_worker.py:1932 -- Creating policy for agent_2
(RolloutWorker pid=46975) 2023-01-30 13:48:03,359 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46974) 2023-01-30 13:48:03,632 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:03,632 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:03,632 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46974) 2023-01-30 13:48:03,632 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46974) 2023-01-30 13:48:03,716 INFO policy.py:1147 -- Policy (worker=1) running on CPU.
(RolloutWorker pid=46974) 2023-01-30 13:48:03,717 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46975) 2023-01-30 13:48:03,634 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:03,634 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:03,634 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46975) 2023-01-30 13:48:03,634 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46975) 2023-01-30 13:48:03,718 INFO policy.py:1147 -- Policy (worker=2) running on CPU.
(RolloutWorker pid=46975) 2023-01-30 13:48:03,718 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46974) Model: "model_5"
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46974) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_2_wk1/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_2_wk1/Sequen [()] 0 ['tf_op_layer_agent_2_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46974) yer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_2_wk1/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46974) pLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_2_wk1/Sequen [(None,)] 0 ['tf_op_layer_agent_2_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46974) r)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_2_wk1/Sequen [(None, 1)] 0 ['tf_op_layer_agent_2_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46974) )
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_2_wk1/Sequen [(None, None)] 0 ['tf_op_layer_agent_2_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46974) ) 'tf_op_layer_agent_2_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Cast[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46974) (None, 256), 'h[0][0]',
(RolloutWorker pid=46974) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46974) 'tf_op_layer_agent_2_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Less[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) Total params: 535,817
(RolloutWorker pid=46974) Trainable params: 535,817
(RolloutWorker pid=46974) Non-trainable params: 0
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Model: "model_5"
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_2_wk2/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_2_wk2/Sequen [()] 0 ['tf_op_layer_agent_2_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46975) yer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_2_wk2/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46975) pLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_2_wk2/Sequen [(None,)] 0 ['tf_op_layer_agent_2_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46975) r)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_2_wk2/Sequen [(None, 1)] 0 ['tf_op_layer_agent_2_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46975) )
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_2_wk2/Sequen [(None, None)] 0 ['tf_op_layer_agent_2_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46975) ) 'tf_op_layer_agent_2_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Cast[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46975) (None, 256), 'h[0][0]',
(RolloutWorker pid=46975) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46975) 'tf_op_layer_agent_2_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Less[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) Total params: 535,817
(RolloutWorker pid=46975) Trainable params: 535,817
(RolloutWorker pid=46975) Non-trainable params: 0
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46974) 2023-01-30 13:48:03,782 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:03,782 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:03,783 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:03,783 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:03,783 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46975) 2023-01-30 13:48:03,778 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:03,781 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:03,781 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:03,782 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:03,782 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46974) 2023-01-30 13:48:04,609 DEBUG rollout_worker.py:1932 -- Creating policy for agent_3
(RolloutWorker pid=46974) 2023-01-30 13:48:04,611 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46975) 2023-01-30 13:48:04,612 DEBUG rollout_worker.py:1932 -- Creating policy for agent_3
(RolloutWorker pid=46975) 2023-01-30 13:48:04,614 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46974) 2023-01-30 13:48:04,890 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:04,890 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:04,890 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46974) 2023-01-30 13:48:04,890 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46975) 2023-01-30 13:48:04,891 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:04,891 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:04,891 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46975) 2023-01-30 13:48:04,891 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46974) Model: "model_5"
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46974) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_3_wk1/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_3_wk1/Sequen [()] 0 ['tf_op_layer_agent_3_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46974) yer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_3_wk1/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46974) pLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_3_wk1/Sequen [(None,)] 0 ['tf_op_layer_agent_3_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46974) r)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_3_wk1/Sequen [(None, 1)] 0 ['tf_op_layer_agent_3_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46974) )
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_3_wk1/Sequen [(None, None)] 0 ['tf_op_layer_agent_3_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46974) ) 'tf_op_layer_agent_3_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Cast[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46974) (None, 256), 'h[0][0]',
(RolloutWorker pid=46974) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46974) 'tf_op_layer_agent_3_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Less[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) Total params: 535,817
(RolloutWorker pid=46974) Trainable params: 535,817
(RolloutWorker pid=46974) Non-trainable params: 0
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Model: "model_5"
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_3_wk2/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_3_wk2/Sequen [()] 0 ['tf_op_layer_agent_3_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46975) yer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_3_wk2/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46975) pLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_3_wk2/Sequen [(None,)] 0 ['tf_op_layer_agent_3_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46975) r)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_3_wk2/Sequen [(None, 1)] 0 ['tf_op_layer_agent_3_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46975) )
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_3_wk2/Sequen [(None, None)] 0 ['tf_op_layer_agent_3_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46975) ) 'tf_op_layer_agent_3_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Cast[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46975) (None, 256), 'h[0][0]',
(RolloutWorker pid=46975) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46975) 'tf_op_layer_agent_3_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Less[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) Total params: 535,817
(RolloutWorker pid=46975) Trainable params: 535,817
(RolloutWorker pid=46975) Non-trainable params: 0
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46974) 2023-01-30 13:48:04,973 INFO policy.py:1147 -- Policy (worker=1) running on CPU.
(RolloutWorker pid=46974) 2023-01-30 13:48:04,973 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46974) 2023-01-30 13:48:05,035 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:05,035 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:05,035 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:05,035 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:05,036 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46975) 2023-01-30 13:48:04,975 INFO policy.py:1147 -- Policy (worker=2) running on CPU.
(RolloutWorker pid=46975) 2023-01-30 13:48:04,976 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46975) 2023-01-30 13:48:05,038 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:05,039 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:05,039 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:05,039 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:05,039 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46974) 2023-01-30 13:48:05,862 DEBUG rollout_worker.py:1932 -- Creating policy for agent_4
(RolloutWorker pid=46974) 2023-01-30 13:48:05,863 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46975) 2023-01-30 13:48:05,863 DEBUG rollout_worker.py:1932 -- Creating policy for agent_4
(RolloutWorker pid=46975) 2023-01-30 13:48:05,864 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46974) 2023-01-30 13:48:06,276 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:06,277 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:06,277 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46974) 2023-01-30 13:48:06,277 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46975) 2023-01-30 13:48:06,274 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:06,274 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:06,274 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46975) 2023-01-30 13:48:06,274 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46974) Model: "model_5"
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46974) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_4_wk1/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_4_wk1/Sequen [()] 0 ['tf_op_layer_agent_4_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46974) yer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_4_wk1/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46974) pLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_4_wk1/Sequen [(None,)] 0 ['tf_op_layer_agent_4_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46974) r)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_4_wk1/Sequen [(None, 1)] 0 ['tf_op_layer_agent_4_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46974) )
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_4_wk1/Sequen [(None, None)] 0 ['tf_op_layer_agent_4_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46974) ) 'tf_op_layer_agent_4_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Cast[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46974) (None, 256), 'h[0][0]',
(RolloutWorker pid=46974) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46974) 'tf_op_layer_agent_4_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Less[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) Total params: 535,817
(RolloutWorker pid=46974) Trainable params: 535,817
(RolloutWorker pid=46974) Non-trainable params: 0
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Model: "model_5"
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_4_wk2/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_4_wk2/Sequen [()] 0 ['tf_op_layer_agent_4_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46975) yer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_4_wk2/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46975) pLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_4_wk2/Sequen [(None,)] 0 ['tf_op_layer_agent_4_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46975) r)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_4_wk2/Sequen [(None, 1)] 0 ['tf_op_layer_agent_4_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46975) )
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_4_wk2/Sequen [(None, None)] 0 ['tf_op_layer_agent_4_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46975) ) 'tf_op_layer_agent_4_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Cast[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46975) (None, 256), 'h[0][0]',
(RolloutWorker pid=46975) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46975) 'tf_op_layer_agent_4_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Less[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) Total params: 535,817
(RolloutWorker pid=46975) Trainable params: 535,817
(RolloutWorker pid=46975) Non-trainable params: 0
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46974) 2023-01-30 13:48:06,361 INFO policy.py:1147 -- Policy (worker=1) running on CPU.
(RolloutWorker pid=46974) 2023-01-30 13:48:06,361 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46974) 2023-01-30 13:48:06,423 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:06,424 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:06,424 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:06,424 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:06,424 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46975) 2023-01-30 13:48:06,359 INFO policy.py:1147 -- Policy (worker=2) running on CPU.
(RolloutWorker pid=46975) 2023-01-30 13:48:06,359 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46975) 2023-01-30 13:48:06,423 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:06,423 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:06,424 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:06,424 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:06,424 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46974) 2023-01-30 13:48:07,247 DEBUG rollout_worker.py:1932 -- Creating policy for agent_5
(RolloutWorker pid=46974) 2023-01-30 13:48:07,248 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46975) 2023-01-30 13:48:07,253 DEBUG rollout_worker.py:1932 -- Creating policy for agent_5
(RolloutWorker pid=46975) 2023-01-30 13:48:07,255 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46974) 2023-01-30 13:48:07,529 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:07,529 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:07,529 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46974) 2023-01-30 13:48:07,529 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46975) 2023-01-30 13:48:07,532 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:07,532 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:07,532 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46975) 2023-01-30 13:48:07,532 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46974) Model: "model_5"
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46974) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_5_wk1/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_5_wk1/Sequen [()] 0 ['tf_op_layer_agent_5_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46974) yer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_5_wk1/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46974) pLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_5_wk1/Sequen [(None,)] 0 ['tf_op_layer_agent_5_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46974) r)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_5_wk1/Sequen [(None, 1)] 0 ['tf_op_layer_agent_5_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46974) )
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_5_wk1/Sequen [(None, None)] 0 ['tf_op_layer_agent_5_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46974) ) 'tf_op_layer_agent_5_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Cast[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46974) (None, 256), 'h[0][0]',
(RolloutWorker pid=46974) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46974) 'tf_op_layer_agent_5_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Less[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) Total params: 535,817
(RolloutWorker pid=46974) Trainable params: 535,817
(RolloutWorker pid=46974) Non-trainable params: 0
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Model: "model_5"
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_5_wk2/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_5_wk2/Sequen [()] 0 ['tf_op_layer_agent_5_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46975) yer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_5_wk2/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46975) pLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_5_wk2/Sequen [(None,)] 0 ['tf_op_layer_agent_5_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46975) r)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_5_wk2/Sequen [(None, 1)] 0 ['tf_op_layer_agent_5_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46975) )
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_5_wk2/Sequen [(None, None)] 0 ['tf_op_layer_agent_5_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46975) ) 'tf_op_layer_agent_5_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Cast[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46975) (None, 256), 'h[0][0]',
(RolloutWorker pid=46975) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46975) 'tf_op_layer_agent_5_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Less[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) Total params: 535,817
(RolloutWorker pid=46975) Trainable params: 535,817
(RolloutWorker pid=46975) Non-trainable params: 0
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46974) 2023-01-30 13:48:07,614 INFO policy.py:1147 -- Policy (worker=1) running on CPU.
(RolloutWorker pid=46974) 2023-01-30 13:48:07,614 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46974) 2023-01-30 13:48:07,675 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:07,676 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:07,676 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:07,676 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:07,676 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46975) 2023-01-30 13:48:07,616 INFO policy.py:1147 -- Policy (worker=2) running on CPU.
(RolloutWorker pid=46975) 2023-01-30 13:48:07,616 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46975) 2023-01-30 13:48:07,676 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:07,676 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:07,676 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:07,677 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:07,677 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46974) 2023-01-30 13:48:08,505 DEBUG rollout_worker.py:1932 -- Creating policy for agent_6
(RolloutWorker pid=46974) 2023-01-30 13:48:08,507 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46975) 2023-01-30 13:48:08,498 DEBUG rollout_worker.py:1932 -- Creating policy for agent_6
(RolloutWorker pid=46975) 2023-01-30 13:48:08,500 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 2, 'inter_op_parallelism_threads': 2, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46974) 2023-01-30 13:48:08,784 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:08,785 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46974) 2023-01-30 13:48:08,785 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46974) 2023-01-30 13:48:08,785 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46975) 2023-01-30 13:48:08,784 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:08,784 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(RolloutWorker pid=46975) 2023-01-30 13:48:08,784 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(RolloutWorker pid=46975) 2023-01-30 13:48:08,784 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(RolloutWorker pid=46974) Model: "model_5"
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46974) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_6_wk1/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_6_wk1/Sequen [()] 0 ['tf_op_layer_agent_6_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46974) yer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_6_wk1/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46974) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46974) pLayer)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_6_wk1/Sequen [(None,)] 0 ['tf_op_layer_agent_6_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46974) r)
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_6_wk1/Sequen [(None, 1)] 0 ['tf_op_layer_agent_6_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46974) )
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) tf_op_layer_agent_6_wk1/Sequen [(None, None)] 0 ['tf_op_layer_agent_6_wk1/Sequenc
(RolloutWorker pid=46974) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46974) ) 'tf_op_layer_agent_6_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Cast[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46974) (None, 256), 'h[0][0]',
(RolloutWorker pid=46974) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46974) 'tf_op_layer_agent_6_wk1/Sequenc
(RolloutWorker pid=46974) eMask/Less[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) ==================================================================================================
(RolloutWorker pid=46974) Total params: 535,817
(RolloutWorker pid=46974) Trainable params: 535,817
(RolloutWorker pid=46974) Non-trainable params: 0
(RolloutWorker pid=46974) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Model: "model_5"
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46975) Layer (type) Output Shape Param # Connected to
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) seq_in (InputLayer) [(None,)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_6_wk2/Sequen [()] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/Max (TensorFlowOpLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_6_wk2/Sequen [()] 0 ['tf_op_layer_agent_6_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Maximum (TensorFlowOpLa eMask/Max[0][0]']
(RolloutWorker pid=46975) yer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_6_wk2/Sequen [(None, 1)] 0 ['seq_in[0][0]']
(RolloutWorker pid=46975) ceMask/ExpandDims (TensorFlowO
(RolloutWorker pid=46975) pLayer)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_6_wk2/Sequen [(None,)] 0 ['tf_op_layer_agent_6_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Range (TensorFlowOpLaye eMask/Maximum[0][0]']
(RolloutWorker pid=46975) r)
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_6_wk2/Sequen [(None, 1)] 0 ['tf_op_layer_agent_6_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Cast (TensorFlowOpLayer eMask/ExpandDims[0][0]']
(RolloutWorker pid=46975) )
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) inputs (InputLayer) [(None, None, 264)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) h (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) c (InputLayer) [(None, 256)] 0 []
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) tf_op_layer_agent_6_wk2/Sequen [(None, None)] 0 ['tf_op_layer_agent_6_wk2/Sequenc
(RolloutWorker pid=46975) ceMask/Less (TensorFlowOpLayer eMask/Range[0][0]',
(RolloutWorker pid=46975) ) 'tf_op_layer_agent_6_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Cast[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(RolloutWorker pid=46975) (None, 256), 'h[0][0]',
(RolloutWorker pid=46975) (None, 256)] 'c[0][0]',
(RolloutWorker pid=46975) 'tf_op_layer_agent_6_wk2/Sequenc
(RolloutWorker pid=46975) eMask/Less[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(RolloutWorker pid=46975)
(RolloutWorker pid=46975) ==================================================================================================
(RolloutWorker pid=46975) Total params: 535,817
(RolloutWorker pid=46975) Trainable params: 535,817
(RolloutWorker pid=46975) Non-trainable params: 0
(RolloutWorker pid=46975) __________________________________________________________________________________________________
(RolloutWorker pid=46974) 2023-01-30 13:48:08,869 INFO policy.py:1147 -- Policy (worker=1) running on CPU.
(RolloutWorker pid=46974) 2023-01-30 13:48:08,870 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46974) 2023-01-30 13:48:08,931 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:08,931 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:08,932 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:08,932 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46974) 2023-01-30 13:48:08,932 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(RolloutWorker pid=46975) 2023-01-30 13:48:08,868 INFO policy.py:1147 -- Policy (worker=2) running on CPU.
(RolloutWorker pid=46975) 2023-01-30 13:48:08,868 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(RolloutWorker pid=46975) 2023-01-30 13:48:08,929 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:08,929 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:08,929 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:08,930 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(RolloutWorker pid=46975) 2023-01-30 13:48:08,930 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(PPO pid=46962) 2023-01-30 13:48:09,770 INFO worker_set.py:309 -- Inferred observation/action spaces from remote worker (local worker has no env): {'agent_3': (Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), Discrete(8)), 'agent_0': (Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), Discrete(8)), 'agent_4': (Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), Discrete(8)), 'agent_2': (Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), Discrete(8)), 'agent_5': (Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), Discrete(8)), 'agent_6': (Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), Discrete(8)), 'agent_1': (Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), Discrete(8)), '__env__': (Dict(player_0:Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), player_1:Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), player_2:Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), player_3:Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), player_4:Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), player_5:Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)), player_6:Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8))), Dict(player_0:Discrete(8), player_1:Discrete(8), player_2:Discrete(8), player_3:Discrete(8), player_4:Discrete(8), player_5:Discrete(8), player_6:Discrete(8)))}
(PPO pid=46962) 2023-01-30 13:48:09,776 DEBUG rollout_worker.py:1932 -- Creating policy for agent_0
(PPO pid=46962) 2023-01-30 13:48:09,777 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:09,778 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:09,778 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:09,778 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) 2023-01-30 13:48:09,779 DEBUG catalog.py:813 -- Created preprocessor <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x1738c1270>: Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)) -> (23236,)
(PPO pid=46962) 2023-01-30 13:48:09,781 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 8, 'inter_op_parallelism_threads': 8, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(RolloutWorker pid=46974) 2023-01-30 13:48:09,753 DEBUG rollout_worker.py:841 -- Created rollout worker with env <ray.rllib.env.multi_agent_env.MultiAgentEnvWrapper object at 0x31aa5b250> (<MeltingPotEnv instance>), policies {}
(RolloutWorker pid=46975) 2023-01-30 13:48:09,761 DEBUG rollout_worker.py:841 -- Created rollout worker with env <ray.rllib.env.multi_agent_env.MultiAgentEnvWrapper object at 0x31595b040> (<MeltingPotEnv instance>), policies {}
(PPO pid=46962) 2023-01-30 13:48:10,100 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:10,100 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:10,100 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:10,100 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) Model: "model_5"
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) Layer (type) Output Shape Param # Connected to
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) seq_in (InputLayer) [(None,)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_0/SequenceMa [()] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/Max (TensorFlowOpLayer)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_0/SequenceMa [()] 0 ['tf_op_layer_agent_0/SequenceMas
(PPO pid=46962) sk/Maximum (TensorFlowOpLayer) k/Max[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_0/SequenceMa [(None, 1)] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/ExpandDims (TensorFlowOpLay
(PPO pid=46962) er)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_0/SequenceMa [(None,)] 0 ['tf_op_layer_agent_0/SequenceMas
(PPO pid=46962) sk/Range (TensorFlowOpLayer) k/Maximum[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_0/SequenceMa [(None, 1)] 0 ['tf_op_layer_agent_0/SequenceMas
(PPO pid=46962) sk/Cast (TensorFlowOpLayer) k/ExpandDims[0][0]']
(PPO pid=46962)
(PPO pid=46962) inputs (InputLayer) [(None, None, 264)] 0 []
(PPO pid=46962)
(PPO pid=46962) h (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) c (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_0/SequenceMa [(None, None)] 0 ['tf_op_layer_agent_0/SequenceMas
(PPO pid=46962) sk/Less (TensorFlowOpLayer) k/Range[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_0/SequenceMas
(PPO pid=46962) k/Cast[0][0]']
(PPO pid=46962)
(PPO pid=46962) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(PPO pid=46962) (None, 256), 'h[0][0]',
(PPO pid=46962) (None, 256)] 'c[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_0/SequenceMas
(PPO pid=46962) k/Less[0][0]']
(PPO pid=46962)
(PPO pid=46962) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) Total params: 535,817
(PPO pid=46962) Trainable params: 535,817
(PPO pid=46962) Non-trainable params: 0
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) 2023-01-30 13:48:10,195 INFO policy.py:1147 -- Policy (worker=local) running on CPU.
(PPO pid=46962) 2023-01-30 13:48:10,195 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(PPO pid=46962) 2023-01-30 13:48:10,374 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:10,375 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:10,375 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:10,375 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:10,375 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(PPO pid=46962) 2023-01-30 13:48:10,486 DEBUG dynamic_tf_policy_v2.py:755 -- Initializing loss function with dummy input:
(PPO pid=46962)
(PPO pid=46962) { 'action_dist_inputs': <tf.Tensor 'agent_0/action_dist_inputs:0' shape=(?, 8) dtype=float32>,
(PPO pid=46962) 'action_logp': <tf.Tensor 'agent_0/action_logp:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'action_prob': <tf.Tensor 'agent_0/action_prob:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'actions': <tf.Tensor 'agent_0/actions:0' shape=(?,) dtype=int64>,
(PPO pid=46962) 'advantages': <tf.Tensor 'agent_0/advantages:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'agent_index': <tf.Tensor 'agent_0/agent_index:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'dones': <tf.Tensor 'agent_0/dones:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'eps_id': <tf.Tensor 'agent_0/eps_id:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'new_obs': <tf.Tensor 'agent_0/new_obs:0' shape=(?, 23236) dtype=float32>,
(PPO pid=46962) 'obs': <tf.Tensor 'agent_0/Placeholder:0' shape=(?, 23236) dtype=float32>,
(PPO pid=46962) 'prev_actions': <tf.Tensor 'agent_0/prev_actions:0' shape=(?,) dtype=int64>,
(PPO pid=46962) 'prev_rewards': <tf.Tensor 'agent_0/prev_rewards:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'rewards': <tf.Tensor 'agent_0/rewards:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'seq_lens': <tf.Tensor 'agent_0/seq_lens:0' shape=(?,) dtype=int32>,
(PPO pid=46962) 'state_in_0': <tf.Tensor 'agent_0/state_in_0:0' shape=(?, 256) dtype=float32>,
(PPO pid=46962) 'state_in_1': <tf.Tensor 'agent_0/state_in_1:0' shape=(?, 256) dtype=float32>,
(PPO pid=46962) 't': <tf.Tensor 'agent_0/t:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'unroll_id': <tf.Tensor 'agent_0/unroll_id:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'value_targets': <tf.Tensor 'agent_0/value_targets:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'vf_preds': <tf.Tensor 'agent_0/vf_preds:0' shape=(?,) dtype=float32>}
(PPO pid=46962)
(PPO pid=46962) 2023-01-30 13:48:10,966 DEBUG tf_policy.py:783 -- These tensors were used in the loss functions:
(PPO pid=46962) { 'action_dist_inputs': <tf.Tensor 'agent_0/action_dist_inputs:0' shape=(?, 8) dtype=float32>,
(PPO pid=46962) 'action_logp': <tf.Tensor 'agent_0/action_logp:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'action_prob': <tf.Tensor 'agent_0/action_prob:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'actions': <tf.Tensor 'agent_0/actions:0' shape=(?,) dtype=int64>,
(PPO pid=46962) 'advantages': <tf.Tensor 'agent_0/advantages:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'dones': <tf.Tensor 'agent_0/dones:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'new_obs': <tf.Tensor 'agent_0/new_obs:0' shape=(?, 23236) dtype=float32>,
(PPO pid=46962) 'obs': <tf.Tensor 'agent_0/Placeholder:0' shape=(?, 23236) dtype=float32>,
(PPO pid=46962) 'prev_actions': <tf.Tensor 'agent_0/prev_actions:0' shape=(?,) dtype=int64>,
(PPO pid=46962) 'rewards': <tf.Tensor 'agent_0/rewards:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'seq_lens': <tf.Tensor 'agent_0/seq_lens:0' shape=(?,) dtype=int32>,
(PPO pid=46962) 'state_in_0': <tf.Tensor 'agent_0/state_in_0:0' shape=(?, 256) dtype=float32>,
(PPO pid=46962) 'state_in_1': <tf.Tensor 'agent_0/state_in_1:0' shape=(?, 256) dtype=float32>,
(PPO pid=46962) 'value_targets': <tf.Tensor 'agent_0/value_targets:0' shape=(?,) dtype=float32>,
(PPO pid=46962) 'vf_preds': <tf.Tensor 'agent_0/vf_preds:0' shape=(?,) dtype=float32>}
(PPO pid=46962)
(PPO pid=46962) 2023-01-30 13:48:11,221 DEBUG rollout_worker.py:1932 -- Creating policy for agent_1
(PPO pid=46962) 2023-01-30 13:48:11,222 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:11,222 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:11,223 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:11,223 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) 2023-01-30 13:48:11,224 DEBUG catalog.py:813 -- Created preprocessor <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x1738c1510>: Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)) -> (23236,)
(PPO pid=46962) 2023-01-30 13:48:11,224 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 8, 'inter_op_parallelism_threads': 8, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(PPO pid=46962) 2023-01-30 13:48:11,502 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:11,502 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:11,502 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:11,502 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) Model: "model_5"
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) Layer (type) Output Shape Param # Connected to
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) seq_in (InputLayer) [(None,)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_1/SequenceMa [()] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/Max (TensorFlowOpLayer)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_1/SequenceMa [()] 0 ['tf_op_layer_agent_1/SequenceMas
(PPO pid=46962) sk/Maximum (TensorFlowOpLayer) k/Max[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_1/SequenceMa [(None, 1)] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/ExpandDims (TensorFlowOpLay
(PPO pid=46962) er)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_1/SequenceMa [(None,)] 0 ['tf_op_layer_agent_1/SequenceMas
(PPO pid=46962) sk/Range (TensorFlowOpLayer) k/Maximum[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_1/SequenceMa [(None, 1)] 0 ['tf_op_layer_agent_1/SequenceMas
(PPO pid=46962) sk/Cast (TensorFlowOpLayer) k/ExpandDims[0][0]']
(PPO pid=46962)
(PPO pid=46962) inputs (InputLayer) [(None, None, 264)] 0 []
(PPO pid=46962)
(PPO pid=46962) h (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) c (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_1/SequenceMa [(None, None)] 0 ['tf_op_layer_agent_1/SequenceMas
(PPO pid=46962) sk/Less (TensorFlowOpLayer) k/Range[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_1/SequenceMas
(PPO pid=46962) k/Cast[0][0]']
(PPO pid=46962)
(PPO pid=46962) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(PPO pid=46962) (None, 256), 'h[0][0]',
(PPO pid=46962) (None, 256)] 'c[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_1/SequenceMas
(PPO pid=46962) k/Less[0][0]']
(PPO pid=46962)
(PPO pid=46962) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) Total params: 535,817
(PPO pid=46962) Trainable params: 535,817
(PPO pid=46962) Non-trainable params: 0
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) 2023-01-30 13:48:11,586 INFO policy.py:1147 -- Policy (worker=local) running on CPU.
(PPO pid=46962) 2023-01-30 13:48:11,586 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(PPO pid=46962) 2023-01-30 13:48:11,648 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:11,648 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:11,648 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:11,649 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:11,649 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(PPO pid=46962) 2023-01-30 13:48:12,489 DEBUG rollout_worker.py:1932 -- Creating policy for agent_2
(PPO pid=46962) 2023-01-30 13:48:12,490 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:12,490 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:12,490 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:12,490 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) 2023-01-30 13:48:12,491 DEBUG catalog.py:813 -- Created preprocessor <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x17382b520>: Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)) -> (23236,)
(PPO pid=46962) 2023-01-30 13:48:12,492 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 8, 'inter_op_parallelism_threads': 8, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(PPO pid=46962) 2023-01-30 13:48:12,770 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:12,770 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:12,770 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:12,770 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) Model: "model_5"
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) Layer (type) Output Shape Param # Connected to
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) seq_in (InputLayer) [(None,)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_2/SequenceMa [()] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/Max (TensorFlowOpLayer)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_2/SequenceMa [()] 0 ['tf_op_layer_agent_2/SequenceMas
(PPO pid=46962) sk/Maximum (TensorFlowOpLayer) k/Max[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_2/SequenceMa [(None, 1)] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/ExpandDims (TensorFlowOpLay
(PPO pid=46962) er)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_2/SequenceMa [(None,)] 0 ['tf_op_layer_agent_2/SequenceMas
(PPO pid=46962) sk/Range (TensorFlowOpLayer) k/Maximum[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_2/SequenceMa [(None, 1)] 0 ['tf_op_layer_agent_2/SequenceMas
(PPO pid=46962) sk/Cast (TensorFlowOpLayer) k/ExpandDims[0][0]']
(PPO pid=46962)
(PPO pid=46962) inputs (InputLayer) [(None, None, 264)] 0 []
(PPO pid=46962)
(PPO pid=46962) h (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) c (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_2/SequenceMa [(None, None)] 0 ['tf_op_layer_agent_2/SequenceMas
(PPO pid=46962) sk/Less (TensorFlowOpLayer) k/Range[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_2/SequenceMas
(PPO pid=46962) k/Cast[0][0]']
(PPO pid=46962)
(PPO pid=46962) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(PPO pid=46962) (None, 256), 'h[0][0]',
(PPO pid=46962) (None, 256)] 'c[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_2/SequenceMas
(PPO pid=46962) k/Less[0][0]']
(PPO pid=46962)
(PPO pid=46962) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) Total params: 535,817
(PPO pid=46962) Trainable params: 535,817
(PPO pid=46962) Non-trainable params: 0
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) 2023-01-30 13:48:12,855 INFO policy.py:1147 -- Policy (worker=local) running on CPU.
(PPO pid=46962) 2023-01-30 13:48:12,856 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(PPO pid=46962) 2023-01-30 13:48:12,917 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:12,918 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:12,918 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:12,918 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:12,918 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(PPO pid=46962) 2023-01-30 13:48:13,759 DEBUG rollout_worker.py:1932 -- Creating policy for agent_3
(PPO pid=46962) 2023-01-30 13:48:13,760 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:13,760 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:13,760 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:13,761 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) 2023-01-30 13:48:13,762 DEBUG catalog.py:813 -- Created preprocessor <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x176753cd0>: Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)) -> (23236,)
(PPO pid=46962) 2023-01-30 13:48:13,762 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 8, 'inter_op_parallelism_threads': 8, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(PPO pid=46962) 2023-01-30 13:48:14,042 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:14,042 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:14,042 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:14,042 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) Model: "model_5"
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) Layer (type) Output Shape Param # Connected to
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) seq_in (InputLayer) [(None,)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_3/SequenceMa [()] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/Max (TensorFlowOpLayer)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_3/SequenceMa [()] 0 ['tf_op_layer_agent_3/SequenceMas
(PPO pid=46962) sk/Maximum (TensorFlowOpLayer) k/Max[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_3/SequenceMa [(None, 1)] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/ExpandDims (TensorFlowOpLay
(PPO pid=46962) er)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_3/SequenceMa [(None,)] 0 ['tf_op_layer_agent_3/SequenceMas
(PPO pid=46962) sk/Range (TensorFlowOpLayer) k/Maximum[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_3/SequenceMa [(None, 1)] 0 ['tf_op_layer_agent_3/SequenceMas
(PPO pid=46962) sk/Cast (TensorFlowOpLayer) k/ExpandDims[0][0]']
(PPO pid=46962)
(PPO pid=46962) inputs (InputLayer) [(None, None, 264)] 0 []
(PPO pid=46962)
(PPO pid=46962) h (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) c (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_3/SequenceMa [(None, None)] 0 ['tf_op_layer_agent_3/SequenceMas
(PPO pid=46962) sk/Less (TensorFlowOpLayer) k/Range[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_3/SequenceMas
(PPO pid=46962) k/Cast[0][0]']
(PPO pid=46962)
(PPO pid=46962) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(PPO pid=46962) (None, 256), 'h[0][0]',
(PPO pid=46962) (None, 256)] 'c[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_3/SequenceMas
(PPO pid=46962) k/Less[0][0]']
(PPO pid=46962)
(PPO pid=46962) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) Total params: 535,817
(PPO pid=46962) Trainable params: 535,817
(PPO pid=46962) Non-trainable params: 0
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) 2023-01-30 13:48:14,126 INFO policy.py:1147 -- Policy (worker=local) running on CPU.
(PPO pid=46962) 2023-01-30 13:48:14,126 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(PPO pid=46962) 2023-01-30 13:48:14,188 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:14,189 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:14,189 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:14,189 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:14,189 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(PPO pid=46962) 2023-01-30 13:48:15,020 DEBUG rollout_worker.py:1932 -- Creating policy for agent_4
(PPO pid=46962) 2023-01-30 13:48:15,021 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:15,021 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:15,021 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:15,021 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) 2023-01-30 13:48:15,022 DEBUG catalog.py:813 -- Created preprocessor <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x3114fe950>: Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)) -> (23236,)
(PPO pid=46962) 2023-01-30 13:48:15,023 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 8, 'inter_op_parallelism_threads': 8, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(PPO pid=46962) 2023-01-30 13:48:15,439 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:15,439 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:15,439 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:15,439 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) Model: "model_5"
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) Layer (type) Output Shape Param # Connected to
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) seq_in (InputLayer) [(None,)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_4/SequenceMa [()] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/Max (TensorFlowOpLayer)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_4/SequenceMa [()] 0 ['tf_op_layer_agent_4/SequenceMas
(PPO pid=46962) sk/Maximum (TensorFlowOpLayer) k/Max[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_4/SequenceMa [(None, 1)] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/ExpandDims (TensorFlowOpLay
(PPO pid=46962) er)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_4/SequenceMa [(None,)] 0 ['tf_op_layer_agent_4/SequenceMas
(PPO pid=46962) sk/Range (TensorFlowOpLayer) k/Maximum[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_4/SequenceMa [(None, 1)] 0 ['tf_op_layer_agent_4/SequenceMas
(PPO pid=46962) sk/Cast (TensorFlowOpLayer) k/ExpandDims[0][0]']
(PPO pid=46962)
(PPO pid=46962) inputs (InputLayer) [(None, None, 264)] 0 []
(PPO pid=46962)
(PPO pid=46962) h (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) c (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_4/SequenceMa [(None, None)] 0 ['tf_op_layer_agent_4/SequenceMas
(PPO pid=46962) sk/Less (TensorFlowOpLayer) k/Range[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_4/SequenceMas
(PPO pid=46962) k/Cast[0][0]']
(PPO pid=46962)
(PPO pid=46962) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(PPO pid=46962) (None, 256), 'h[0][0]',
(PPO pid=46962) (None, 256)] 'c[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_4/SequenceMas
(PPO pid=46962) k/Less[0][0]']
(PPO pid=46962)
(PPO pid=46962) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) Total params: 535,817
(PPO pid=46962) Trainable params: 535,817
(PPO pid=46962) Non-trainable params: 0
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) 2023-01-30 13:48:15,526 INFO policy.py:1147 -- Policy (worker=local) running on CPU.
(PPO pid=46962) 2023-01-30 13:48:15,526 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(PPO pid=46962) 2023-01-30 13:48:15,588 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:15,588 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:15,589 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:15,589 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:15,589 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(PPO pid=46962) 2023-01-30 13:48:16,429 DEBUG rollout_worker.py:1932 -- Creating policy for agent_5
(PPO pid=46962) 2023-01-30 13:48:16,430 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:16,430 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:16,430 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:16,430 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) 2023-01-30 13:48:16,431 DEBUG catalog.py:813 -- Created preprocessor <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x313046590>: Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)) -> (23236,)
(PPO pid=46962) 2023-01-30 13:48:16,432 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 8, 'inter_op_parallelism_threads': 8, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(PPO pid=46962) 2023-01-30 13:48:16,712 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:16,712 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:16,712 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:16,712 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) Model: "model_5"
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) Layer (type) Output Shape Param # Connected to
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) seq_in (InputLayer) [(None,)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_5/SequenceMa [()] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/Max (TensorFlowOpLayer)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_5/SequenceMa [()] 0 ['tf_op_layer_agent_5/SequenceMas
(PPO pid=46962) sk/Maximum (TensorFlowOpLayer) k/Max[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_5/SequenceMa [(None, 1)] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/ExpandDims (TensorFlowOpLay
(PPO pid=46962) er)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_5/SequenceMa [(None,)] 0 ['tf_op_layer_agent_5/SequenceMas
(PPO pid=46962) sk/Range (TensorFlowOpLayer) k/Maximum[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_5/SequenceMa [(None, 1)] 0 ['tf_op_layer_agent_5/SequenceMas
(PPO pid=46962) sk/Cast (TensorFlowOpLayer) k/ExpandDims[0][0]']
(PPO pid=46962)
(PPO pid=46962) inputs (InputLayer) [(None, None, 264)] 0 []
(PPO pid=46962)
(PPO pid=46962) h (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) c (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_5/SequenceMa [(None, None)] 0 ['tf_op_layer_agent_5/SequenceMas
(PPO pid=46962) sk/Less (TensorFlowOpLayer) k/Range[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_5/SequenceMas
(PPO pid=46962) k/Cast[0][0]']
(PPO pid=46962)
(PPO pid=46962) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(PPO pid=46962) (None, 256), 'h[0][0]',
(PPO pid=46962) (None, 256)] 'c[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_5/SequenceMas
(PPO pid=46962) k/Less[0][0]']
(PPO pid=46962)
(PPO pid=46962) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) Total params: 535,817
(PPO pid=46962) Trainable params: 535,817
(PPO pid=46962) Non-trainable params: 0
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) 2023-01-30 13:48:16,800 INFO policy.py:1147 -- Policy (worker=local) running on CPU.
(PPO pid=46962) 2023-01-30 13:48:16,800 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(PPO pid=46962) 2023-01-30 13:48:16,861 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:16,862 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:16,862 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:16,862 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:16,862 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(PPO pid=46962) 2023-01-30 13:48:17,701 DEBUG rollout_worker.py:1932 -- Creating policy for agent_6
(PPO pid=46962) 2023-01-30 13:48:17,702 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:17,702 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:17,702 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:17,702 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) 2023-01-30 13:48:17,703 DEBUG catalog.py:813 -- Created preprocessor <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x3146ec430>: Dict(ORIENTATION:Box(-2147483648, 2147483647, (), int32), POSITION:Box(-2147483648, 2147483647, (2,), int32), READY_TO_SHOOT:Box(-inf, inf, (), float64), RGB:Box(0, 255, (88, 88, 3), uint8)) -> (23236,)
(PPO pid=46962) 2023-01-30 13:48:17,704 DEBUG worker_set.py:938 -- Creating TF session {'intra_op_parallelism_threads': 8, 'inter_op_parallelism_threads': 8, 'gpu_options': {'allow_growth': True}, 'log_device_placement': False, 'device_count': {'CPU': 1}, 'allow_soft_placement': True}
(PPO pid=46962) 2023-01-30 13:48:17,986 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (), int32)
(PPO pid=46962) 2023-01-30 13:48:17,986 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-2147483648, 2147483647, (2,), int32)
(PPO pid=46962) 2023-01-30 13:48:17,986 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(-inf, inf, (), float64)
(PPO pid=46962) 2023-01-30 13:48:17,986 DEBUG preprocessors.py:272 -- Creating sub-preprocessor for Box(0, 255, (88, 88, 3), uint8)
(PPO pid=46962) 2023-01-30 13:48:18,069 INFO policy.py:1147 -- Policy (worker=local) running on CPU.
(PPO pid=46962) 2023-01-30 13:48:18,069 INFO tf_policy.py:171 -- Found 0 visible cuda devices.
(PPO pid=46962) Model: "model_5"
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) Layer (type) Output Shape Param # Connected to
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) seq_in (InputLayer) [(None,)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_6/SequenceMa [()] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/Max (TensorFlowOpLayer)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_6/SequenceMa [()] 0 ['tf_op_layer_agent_6/SequenceMas
(PPO pid=46962) sk/Maximum (TensorFlowOpLayer) k/Max[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_6/SequenceMa [(None, 1)] 0 ['seq_in[0][0]']
(PPO pid=46962) sk/ExpandDims (TensorFlowOpLay
(PPO pid=46962) er)
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_6/SequenceMa [(None,)] 0 ['tf_op_layer_agent_6/SequenceMas
(PPO pid=46962) sk/Range (TensorFlowOpLayer) k/Maximum[0][0]']
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_6/SequenceMa [(None, 1)] 0 ['tf_op_layer_agent_6/SequenceMas
(PPO pid=46962) sk/Cast (TensorFlowOpLayer) k/ExpandDims[0][0]']
(PPO pid=46962)
(PPO pid=46962) inputs (InputLayer) [(None, None, 264)] 0 []
(PPO pid=46962)
(PPO pid=46962) h (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) c (InputLayer) [(None, 256)] 0 []
(PPO pid=46962)
(PPO pid=46962) tf_op_layer_agent_6/SequenceMa [(None, None)] 0 ['tf_op_layer_agent_6/SequenceMas
(PPO pid=46962) sk/Less (TensorFlowOpLayer) k/Range[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_6/SequenceMas
(PPO pid=46962) k/Cast[0][0]']
(PPO pid=46962)
(PPO pid=46962) lstm (LSTM) [(None, None, 256), 533504 ['inputs[0][0]',
(PPO pid=46962) (None, 256), 'h[0][0]',
(PPO pid=46962) (None, 256)] 'c[0][0]',
(PPO pid=46962) 'tf_op_layer_agent_6/SequenceMas
(PPO pid=46962) k/Less[0][0]']
(PPO pid=46962)
(PPO pid=46962) logits (Dense) (None, None, 8) 2056 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) values (Dense) (None, None, 1) 257 ['lstm[0][0]']
(PPO pid=46962)
(PPO pid=46962) ==================================================================================================
(PPO pid=46962) Total params: 535,817
(PPO pid=46962) Trainable params: 535,817
(PPO pid=46962) Non-trainable params: 0
(PPO pid=46962) __________________________________________________________________________________________________
(PPO pid=46962) 2023-01-30 13:48:18,130 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_prob` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:18,131 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_logp` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:18,131 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `action_dist_inputs` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:18,131 INFO dynamic_tf_policy_v2.py:709 -- Adding extra-action-fetch `vf_preds` to view-reqs.
(PPO pid=46962) 2023-01-30 13:48:18,131 INFO dynamic_tf_policy_v2.py:721 -- Testing `postprocess_trajectory` w/ dummy batch.
(PPO pid=46962) 2023-01-30 13:48:18,961 INFO rollout_worker.py:2004 -- Built policy map: {}
(PPO pid=46962) 2023-01-30 13:48:18,961 INFO rollout_worker.py:2005 -- Built preprocessor map: {'agent_0': <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x1738c1270>, 'agent_1': <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x1738c1510>, 'agent_2': <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x17382b520>, 'agent_3': <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x176753cd0>, 'agent_4': <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x3114fe950>, 'agent_5': <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x313046590>, 'agent_6': <ray.rllib.models.preprocessors.DictFlatteningPreprocessor object at 0x3146ec430>}
(PPO pid=46962) 2023-01-30 13:48:18,961 INFO rollout_worker.py:740 -- Built filter map: defaultdict(<class 'ray.rllib.utils.filter.NoFilter'>, {'agent_0': <ray.rllib.utils.filter.NoFilter object at 0x1738e41c0>, 'agent_1': <ray.rllib.utils.filter.NoFilter object at 0x3082d7f10>, 'agent_2': <ray.rllib.utils.filter.NoFilter object at 0x3128d83a0>, 'agent_3': <ray.rllib.utils.filter.NoFilter object at 0x313dc0940>, 'agent_4': <ray.rllib.utils.filter.NoFilter object at 0x31411e800>, 'agent_5': <ray.rllib.utils.filter.NoFilter object at 0x316d814b0>, 'agent_6': <ray.rllib.utils.filter.NoFilter object at 0x318269ae0>})
(PPO pid=46962) 2023-01-30 13:48:18,961 DEBUG rollout_worker.py:841 -- Created rollout worker with env None (None), policies {}
== Status ==
Current time: 2023-01-30 13:48:19 (running for 00:00:26.73)
Memory usage on this node: 11.4/16.0 GiB
Using FIFO scheduling algorithm.
Resources requested: 3.0/8 CPUs, 0/0 GPUs, 0.0/6.42 GiB heap, 0.0/2.0 GiB objects
Result logdir: /Users/nell/ray_results/PPO
Number of trials: 1/1 (1 RUNNING)
(PPO pid=46962) 2023-01-30 13:48:19,170 INFO algorithm_config.py:2503 -- Your framework setting is 'tf', meaning you are using static-graph mode. Set framework='tf2' to enable eager execution with tf2.x. You may also then want to set eager_tracing=True in order to reach similar execution speed as with static-graph mode.
(PPO pid=46962) 2023-01-30 13:48:19,170 INFO trainable.py:172 -- Trainable.setup took 22.915 seconds. If your trainable is slow to initialize, consider setting reuse_actors=True to reduce actor creation overheads.
(PPO pid=46962) 2023-01-30 13:48:19,171 WARNING util.py:66 -- Install gputil for GPU system monitoring.
(RolloutWorker pid=46974) 2023-01-30 13:48:19,400 INFO rollout_worker.py:894 -- Generating sample batch of size 100
(RolloutWorker pid=46974) 2023-01-30 13:48:19,400 DEBUG sampler.py:631 -- No episode horizon specified, assuming inf.
(RolloutWorker pid=46975) 2023-01-30 13:48:19,424 DEBUG sampler.py:631 -- No episode horizon specified, assuming inf.
(RolloutWorker pid=46974) 2023-01-30 13:48:19,645 INFO sampler.py:664 -- Raw obs from env: { 0: { 'player_0': { 'ORIENTATION': np.ndarray((), dtype=int32, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=7.0, max=7.0, mean=7.0),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=37.0, max=255.0, mean=161.841)},
(RolloutWorker pid=46974) 'player_1': { 'ORIENTATION': np.ndarray((), dtype=int32, min=2.0, max=2.0, mean=2.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=7.0, max=16.0, mean=11.5),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=167.564)},
(RolloutWorker pid=46974) 'player_2': { 'ORIENTATION': np.ndarray((), dtype=int32, min=2.0, max=2.0, mean=2.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=14.0, max=19.0, mean=16.5),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=67.101)},
(RolloutWorker pid=46974) 'player_3': { 'ORIENTATION': np.ndarray((), dtype=int32, min=0.0, max=0.0, mean=0.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=7.0, max=16.0, mean=11.5),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=161.451)},
(RolloutWorker pid=46974) 'player_4': { 'ORIENTATION': np.ndarray((), dtype=int32, min=3.0, max=3.0, mean=3.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=14.0, max=14.0, mean=14.0),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=131.464)},
(RolloutWorker pid=46974) 'player_5': { 'ORIENTATION': np.ndarray((), dtype=int32, min=3.0, max=3.0, mean=3.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=10.0, max=16.0, mean=13.0),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=104.322)},
(RolloutWorker pid=46974) 'player_6': { 'ORIENTATION': np.ndarray((), dtype=int32, min=0.0, max=0.0, mean=0.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=3.0, max=14.0, mean=8.5),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=136.955)}}}
(RolloutWorker pid=46974) 2023-01-30 13:48:19,646 INFO sampler.py:665 -- Info return from env: {0: {}}
(RolloutWorker pid=46974) 2023-01-30 13:48:19,646 WARNING deprecation.py:47 -- DeprecationWarning: `policy_mapping_fn(agent_id)` has been deprecated. Use `policy_mapping_fn(agent_id, episode, worker, **kwargs)` instead. This will raise an error in the future!
(RolloutWorker pid=46974) 2023-01-30 13:48:19,646 INFO sampler.py:929 -- Filtered obs: { 'ORIENTATION': np.ndarray((), dtype=int32, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=7.0, max=7.0, mean=7.0),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=37.0, max=255.0, mean=161.841)}
(RolloutWorker pid=46974) 2023-01-30 13:48:19,646 WARNING agent_collector.py:155 -- Provided tensor
(RolloutWorker pid=46974) {'READY_TO_SHOOT': array(1.), 'ORIENTATION': array(1, dtype=int32), 'POSITION': array([7, 7], dtype=int32), 'RGB': array([[[158, 194, 101],
(RolloutWorker pid=46974) [158, 194, 101],
(RolloutWorker pid=46974) [158, 194, 101],
(RolloutWorker pid=46974) ...,
(RolloutWorker pid=46974) [210, 195, 175],
(RolloutWorker pid=46974) [210, 195, 175],
(RolloutWorker pid=46974) [220, 205, 185]],
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) [[158, 194, 101],
(RolloutWorker pid=46974) [158, 194, 101],
(RolloutWorker pid=46974) [158, 194, 101],
(RolloutWorker pid=46974) ...,
(RolloutWorker pid=46974) [210, 195, 175],
(RolloutWorker pid=46974) [220, 205, 185],
(RolloutWorker pid=46974) [210, 195, 175]],
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) [[158, 194, 101],
(RolloutWorker pid=46974) [158, 194, 101],
(RolloutWorker pid=46974) [ 53, 132, 49],
(RolloutWorker pid=46974) ...,
(RolloutWorker pid=46974) [220, 205, 185],
(RolloutWorker pid=46974) [220, 205, 185],
(RolloutWorker pid=46974) [220, 205, 185]],
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) ...,
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) [[210, 195, 175],
(RolloutWorker pid=46974) [220, 205, 185],
(RolloutWorker pid=46974) [210, 195, 175],
(RolloutWorker pid=46974) ...,
(RolloutWorker pid=46974) [210, 195, 175],
(RolloutWorker pid=46974) [210, 195, 175],
(RolloutWorker pid=46974) [220, 205, 185]],
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) [[220, 205, 185],
(RolloutWorker pid=46974) [210, 195, 175],
(RolloutWorker pid=46974) [220, 205, 185],
(RolloutWorker pid=46974) ...,
(RolloutWorker pid=46974) [220, 205, 185],
(RolloutWorker pid=46974) [220, 205, 185],
(RolloutWorker pid=46974) [220, 205, 185]],
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) [[210, 195, 175],
(RolloutWorker pid=46974) [220, 205, 185],
(RolloutWorker pid=46974) [210, 195, 175],
(RolloutWorker pid=46974) ...,
(RolloutWorker pid=46974) [220, 205, 185],
(RolloutWorker pid=46974) [210, 195, 175],
(RolloutWorker pid=46974) [220, 205, 185]]], dtype=uint8)}
(RolloutWorker pid=46974) does not match space of view requirements obs.
(RolloutWorker pid=46974) Provided tensor has shape () and view requirement has shape shape None.Make sure dimensions match to resolve this warning.
(RolloutWorker pid=46974) 2023-01-30 13:48:19,648 INFO sampler.py:1187 -- Inputs to compute_actions():
(RolloutWorker pid=46974)
(RolloutWorker pid=46974) { 'agent_0': [ { 'data': { 'agent_id': 'player_0',
(RolloutWorker pid=46974) 'env_id': 0,
(RolloutWorker pid=46974) 'info': {},
(RolloutWorker pid=46974) 'obs': { 'ORIENTATION': np.ndarray((), dtype=int32, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=7.0, max=7.0, mean=7.0),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=37.0, max=255.0, mean=161.841)},
(RolloutWorker pid=46974) 'prev_action': None,
(RolloutWorker pid=46974) 'prev_reward': 0.0,
(RolloutWorker pid=46974) 'rnn_state': None},
(RolloutWorker pid=46974) 'type': '_PolicyEvalData'}],
(RolloutWorker pid=46974) 'agent_1': [ { 'data': { 'agent_id': 'player_1',
(RolloutWorker pid=46974) 'env_id': 0,
(RolloutWorker pid=46974) 'info': {},
(RolloutWorker pid=46974) 'obs': { 'ORIENTATION': np.ndarray((), dtype=int32, min=2.0, max=2.0, mean=2.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=7.0, max=16.0, mean=11.5),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=167.564)},
(RolloutWorker pid=46974) 'prev_action': None,
(RolloutWorker pid=46974) 'prev_reward': 0.0,
(RolloutWorker pid=46974) 'rnn_state': None},
(RolloutWorker pid=46974) 'type': '_PolicyEvalData'}],
(RolloutWorker pid=46974) 'agent_2': [ { 'data': { 'agent_id': 'player_2',
(RolloutWorker pid=46974) 'env_id': 0,
(RolloutWorker pid=46974) 'info': {},
(RolloutWorker pid=46974) 'obs': { 'ORIENTATION': np.ndarray((), dtype=int32, min=2.0, max=2.0, mean=2.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=14.0, max=19.0, mean=16.5),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=67.101)},
(RolloutWorker pid=46974) 'prev_action': None,
(RolloutWorker pid=46974) 'prev_reward': 0.0,
(RolloutWorker pid=46974) 'rnn_state': None},
(RolloutWorker pid=46974) 'type': '_PolicyEvalData'}],
(RolloutWorker pid=46974) 'agent_3': [ { 'data': { 'agent_id': 'player_3',
(RolloutWorker pid=46974) 'env_id': 0,
(RolloutWorker pid=46974) 'info': {},
(RolloutWorker pid=46974) 'obs': { 'ORIENTATION': np.ndarray((), dtype=int32, min=0.0, max=0.0, mean=0.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=7.0, max=16.0, mean=11.5),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=161.451)},
(RolloutWorker pid=46974) 'prev_action': None,
(RolloutWorker pid=46974) 'prev_reward': 0.0,
(RolloutWorker pid=46974) 'rnn_state': None},
(RolloutWorker pid=46974) 'type': '_PolicyEvalData'}],
(RolloutWorker pid=46974) 'agent_4': [ { 'data': { 'agent_id': 'player_4',
(RolloutWorker pid=46974) 'env_id': 0,
(RolloutWorker pid=46974) 'info': {},
(RolloutWorker pid=46974) 'obs': { 'ORIENTATION': np.ndarray((), dtype=int32, min=3.0, max=3.0, mean=3.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=14.0, max=14.0, mean=14.0),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=131.464)},
(RolloutWorker pid=46974) 'prev_action': None,
(RolloutWorker pid=46974) 'prev_reward': 0.0,
(RolloutWorker pid=46974) 'rnn_state': None},
(RolloutWorker pid=46974) 'type': '_PolicyEvalData'}],
(RolloutWorker pid=46974) 'agent_5': [ { 'data': { 'agent_id': 'player_5',
(RolloutWorker pid=46974) 'env_id': 0,
(RolloutWorker pid=46974) 'info': {},
(RolloutWorker pid=46974) 'obs': { 'ORIENTATION': np.ndarray((), dtype=int32, min=3.0, max=3.0, mean=3.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=10.0, max=16.0, mean=13.0),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=104.322)},
(RolloutWorker pid=46974) 'prev_action': None,
(RolloutWorker pid=46974) 'prev_reward': 0.0,
(RolloutWorker pid=46974) 'rnn_state': None},
(RolloutWorker pid=46974) 'type': '_PolicyEvalData'}],
(RolloutWorker pid=46974) 'agent_6': [ { 'data': { 'agent_id': 'player_6',
(RolloutWorker pid=46974) 'env_id': 0,
(RolloutWorker pid=46974) 'info': {},
(RolloutWorker pid=46974) 'obs': { 'ORIENTATION': np.ndarray((), dtype=int32, min=0.0, max=0.0, mean=0.0),
(RolloutWorker pid=46974) 'POSITION': np.ndarray((2,), dtype=int32, min=3.0, max=14.0, mean=8.5),
(RolloutWorker pid=46974) 'READY_TO_SHOOT': np.ndarray((), dtype=float64, min=1.0, max=1.0, mean=1.0),
(RolloutWorker pid=46974) 'RGB': np.ndarray((88, 88, 3), dtype=uint8, min=0.0, max=255.0, mean=136.955)},
(RolloutWorker pid=46974) 'prev_action': None,
(RolloutWorker pid=46974) 'prev_reward': 0.0,
(RolloutWorker pid=46974) 'rnn_state': None},
(RolloutWorker pid=46974) 'type': '_PolicyEvalData'}]}
(RolloutWorker pid=46974)
2023-01-30 13:48:19,693 ERROR trial_runner.py:1088 -- Trial PPO_meltingpot_4f50d_00000: Error processing event.
ray.exceptions.RayTaskError(ValueError): ray::PPO.train() (pid=46962, ip=127.0.0.1, repr=PPO)
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/tune/trainable/trainable.py", line 367, in train
raise skipped from exception_cause(skipped)
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/tune/trainable/trainable.py", line 364, in train
result = self.step()
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/algorithms/algorithm.py", line 749, in step
results, train_iter_ctx = self._run_one_training_iteration()
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/algorithms/algorithm.py", line 2623, in _run_one_training_iteration
results = self.training_step()
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/algorithms/ppo/ppo.py", line 318, in training_step
train_batch = synchronous_parallel_sample(
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/execution/rollout_ops.py", line 85, in synchronous_parallel_sample
sample_batches = worker_set.foreach_worker(
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/worker_set.py", line 696, in foreach_worker
handle_remote_call_result_errors(remote_results, self._ignore_worker_failures)
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/worker_set.py", line 73, in handle_remote_call_result_errors
raise r.get()
ray.exceptions.RayTaskError(ValueError): ray::RolloutWorker.apply() (pid=46974, ip=127.0.0.1, repr=<ray.rllib.evaluation.rollout_worker.RolloutWorker object at 0x17dd7bfd0>)
ValueError: The two structures don't have the same nested structure.
First structure: type=Tensor str=Tensor("agent_0_wk1/Placeholder:0", shape=(?, 23236), dtype=float32)
Second structure: type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'POSITION': array([[7, 7]], dtype=int32), 'RGB': array([[[[158, 194, 101],
[158, 194, 101],
[158, 194, 101],
...,
[210, 195, 175],
[210, 195, 175],
[220, 205, 185]],
[[158, 194, 101],
[158, 194, 101],
[158, 194, 101],
...,
[210, 195, 175],
[220, 205, 185],
[210, 195, 175]],
[[158, 194, 101],
[158, 194, 101],
[ 53, 132, 49],
...,
[220, 205, 185],
[220, 205, 185],
[220, 205, 185]],
...,
[[210, 195, 175],
[220, 205, 185],
[210, 195, 175],
...,
[210, 195, 175],
[210, 195, 175],
[220, 205, 185]],
[[220, 205, 185],
[210, 195, 175],
[220, 205, 185],
...,
[220, 205, 185],
[220, 205, 185],
[220, 205, 185]],
[[210, 195, 175],
[220, 205, 185],
[210, 195, 175],
...,
[220, 205, 185],
[210, 195, 175],
[220, 205, 185]]]], dtype=uint8)}
More specifically: Substructure "type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'POSITION': array([[7, 7]], dtype=int32), 'RGB': array([[[[158, 194, 101],
[158, 194, 101],
[158, 194, 101],
...,
[210, 195, 175],
[210, 195, 175],
[220, 205, 185]],
[[158, 194, 101],
[158, 194, 101],
[158, 194, 101],
...,
[210, 195, 175],
[220, 205, 185],
[210, 195, 175]],
[[158, 194, 101],
[158, 194, 101],
[ 53, 132, 49],
...,
[220, 205, 185],
[220, 205, 185],
[220, 205, 185]],
...,
[[210, 195, 175],
[220, 205, 185],
[210, 195, 175],
...,
[210, 195, 175],
[210, 195, 175],
[220, 205, 185]],
[[220, 205, 185],
[210, 195, 175],
[220, 205, 185],
...,
[220, 205, 185],
[220, 205, 185],
[220, 205, 185]],
[[210, 195, 175],
[220, 205, 185],
[210, 195, 175],
...,
[220, 205, 185],
[210, 195, 175],
[220, 205, 185]]]], dtype=uint8)}" is a sequence, while substructure "type=Tensor str=Tensor("agent_0_wk1/Placeholder:0", shape=(?, 23236), dtype=float32)" is not
During handling of the above exception, another exception occurred:
ray::RolloutWorker.apply() (pid=46974, ip=127.0.0.1, repr=<ray.rllib.evaluation.rollout_worker.RolloutWorker object at 0x17dd7bfd0>)
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/utils/actor_manager.py", line 183, in apply
raise e
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/utils/actor_manager.py", line 174, in apply
return func(self, *args, **kwargs)
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/execution/rollout_ops.py", line 86, in <lambda>
lambda w: w.sample(), local_worker=False, healthy_only=True
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/rollout_worker.py", line 900, in sample
batches = [self.input_reader.next()]
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 92, in next
batches = [self.get_data()]
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 285, in get_data
item = next(self._env_runner)
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 694, in _env_runner
eval_results = _do_policy_eval(
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 1207, in _do_policy_eval
eval_results[policy_id] = policy.compute_actions_from_input_dict(
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/policy/tf_policy.py", line 321, in compute_actions_from_input_dict
to_fetch = self._build_compute_actions(
File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/policy/tf_policy.py", line 1087, in _build_compute_actions
tree.map_structure(
File "/Users/nell/miniforge3/lib/python3.10/site-packages/tree/__init__.py", line 433, in map_structure
assert_same_structure(structures[0], other, check_types=check_types)
File "/Users/nell/miniforge3/lib/python3.10/site-packages/tree/__init__.py", line 288, in assert_same_structure
raise type(e)("%s\n"
ValueError: The two structures don't have the same nested structure.
First structure: type=Tensor str=Tensor("agent_0_wk1/Placeholder:0", shape=(?, 23236), dtype=float32)
Second structure: type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'POSITION': array([[7, 7]], dtype=int32), 'RGB': array([[[[158, 194, 101],
[158, 194, 101],
[158, 194, 101],
...,
[210, 195, 175],
[210, 195, 175],
[220, 205, 185]],
[[158, 194, 101],
[158, 194, 101],
[158, 194, 101],
...,
[210, 195, 175],
[220, 205, 185],
[210, 195, 175]],
[[158, 194, 101],
[158, 194, 101],
[ 53, 132, 49],
...,
[220, 205, 185],
[220, 205, 185],
[220, 205, 185]],
...,
[[210, 195, 175],
[220, 205, 185],
[210, 195, 175],
...,
[210, 195, 175],
[210, 195, 175],
[220, 205, 185]],
[[220, 205, 185],
[210, 195, 175],
[220, 205, 185],
...,
[220, 205, 185],
[220, 205, 185],
[220, 205, 185]],
[[210, 195, 175],
[220, 205, 185],
[210, 195, 175],
...,
[220, 205, 185],
[210, 195, 175],
[220, 205, 185]]]], dtype=uint8)}
More specifically: Substructure "type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'POSITION': array([[7, 7]], dtype=int32), 'RGB': array([[[[158, 194, 101],
[158, 194, 101],
[158, 194, 101],
...,
[210, 195, 175],
[210, 195, 175],
[220, 205, 185]],
[[158, 194, 101],
[158, 194, 101],
[158, 194, 101],
...,
[210, 195, 175],
[220, 205, 185],
[210, 195, 175]],
[[158, 194, 101],
[158, 194, 101],
[ 53, 132, 49],
...,
[220, 205, 185],
[220, 205, 185],
[220, 205, 185]],
...,
[[210, 195, 175],
[220, 205, 185],
[210, 195, 175],
...,
[210, 195, 175],
[210, 195, 175],
[220, 205, 185]],
[[220, 205, 185],
[210, 195, 175],
[220, 205, 185],
...,
[220, 205, 185],
[220, 205, 185],
[220, 205, 185]],
[[210, 195, 175],
[220, 205, 185],
[210, 195, 175],
...,
[220, 205, 185],
[210, 195, 175],
[220, 205, 185]]]], dtype=uint8)}" is a sequence, while substructure "type=Tensor str=Tensor("agent_0_wk1/Placeholder:0", shape=(?, 23236), dtype=float32)" is not
Entire first structure:
.
Entire second structure:
{'READY_TO_SHOOT': ., 'ORIENTATION': ., 'POSITION': ., 'RGB': .}
== Status ==
Current time: 2023-01-30 13:48:19 (running for 00:00:27.26)
Memory usage on this node: 11.5/16.0 GiB
Using FIFO scheduling algorithm.
Resources requested: 0/8 CPUs, 0/0 GPUs, 0.0/6.42 GiB heap, 0.0/2.0 GiB objects
Result logdir: /Users/nell/ray_results/PPO
Number of trials: 1/1 (1 ERROR)
Number of errored trials: 1
+----------------------------+--------------+----------------------------------------------------------------------------------------+
| Trial name | # failures | error file |
|----------------------------+--------------+----------------------------------------------------------------------------------------|
| PPO_meltingpot_4f50d_00000 | 1 | /Users/nell/ray_results/PPO/PPO_meltingpot_4f50d_00000_0_2023-01-30_13-47-52/error.txt |
+----------------------------+--------------+----------------------------------------------------------------------------------------+
== Status ==
Current time: 2023-01-30 13:48:19 (running for 00:00:27.27)
Memory usage on this node: 11.5/16.0 GiB
Using FIFO scheduling algorithm.
Resources requested: 0/8 CPUs, 0/0 GPUs, 0.0/6.42 GiB heap, 0.0/2.0 GiB objects
Result logdir: /Users/nell/ray_results/PPO
Number of trials: 1/1 (1 ERROR)
Number of errored trials: 1
+----------------------------+--------------+----------------------------------------------------------------------------------------+
| Trial name | # failures | error file |
|----------------------------+--------------+----------------------------------------------------------------------------------------|
| PPO_meltingpot_4f50d_00000 | 1 | /Users/nell/ray_results/PPO/PPO_meltingpot_4f50d_00000_0_2023-01-30_13-47-52/error.txt |
+----------------------------+--------------+----------------------------------------------------------------------------------------+
(PPO pid=46962) 2023-01-30 13:48:19,688 ERROR actor_manager.py:486 -- Ray error, taking actor 1 out of service. ray::RolloutWorker.apply() (pid=46974, ip=127.0.0.1, repr=<ray.rllib.evaluation.rollout_worker.RolloutWorker object at 0x17dd7bfd0>)
(PPO pid=46962) ValueError: The two structures don't have the same nested structure.
(PPO pid=46962)
(PPO pid=46962) First structure: type=Tensor str=Tensor("agent_0_wk1/Placeholder:0", shape=(?, 23236), dtype=float32)
(PPO pid=46962)
(PPO pid=46962) Second structure: type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'POSITION': array([[7, 7]], dtype=int32), 'RGB': array([[[[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [ 53, 132, 49],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) ...,
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]]]], dtype=uint8)}
(PPO pid=46962)
(PPO pid=46962) More specifically: Substructure "type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'POSITION': array([[7, 7]], dtype=int32), 'RGB': array([[[[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [ 53, 132, 49],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) ...,
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]]]], dtype=uint8)}" is a sequence, while substructure "type=Tensor str=Tensor("agent_0_wk1/Placeholder:0", shape=(?, 23236), dtype=float32)" is not
(PPO pid=46962)
(PPO pid=46962) During handling of the above exception, another exception occurred:
(PPO pid=46962)
(PPO pid=46962) ray::RolloutWorker.apply() (pid=46974, ip=127.0.0.1, repr=<ray.rllib.evaluation.rollout_worker.RolloutWorker object at 0x17dd7bfd0>)
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/utils/actor_manager.py", line 183, in apply
(PPO pid=46962) raise e
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/utils/actor_manager.py", line 174, in apply
(PPO pid=46962) return func(self, *args, **kwargs)
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/execution/rollout_ops.py", line 86, in <lambda>
(PPO pid=46962) lambda w: w.sample(), local_worker=False, healthy_only=True
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/rollout_worker.py", line 900, in sample
(PPO pid=46962) batches = [self.input_reader.next()]
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 92, in next
(PPO pid=46962) batches = [self.get_data()]
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 285, in get_data
(PPO pid=46962) item = next(self._env_runner)
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 694, in _env_runner
(PPO pid=46962) eval_results = _do_policy_eval(
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 1207, in _do_policy_eval
(PPO pid=46962) eval_results[policy_id] = policy.compute_actions_from_input_dict(
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/policy/tf_policy.py", line 321, in compute_actions_from_input_dict
(PPO pid=46962) to_fetch = self._build_compute_actions(
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/policy/tf_policy.py", line 1087, in _build_compute_actions
(PPO pid=46962) tree.map_structure(
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/tree/__init__.py", line 433, in map_structure
(PPO pid=46962) assert_same_structure(structures[0], other, check_types=check_types)
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/tree/__init__.py", line 288, in assert_same_structure
(PPO pid=46962) raise type(e)("%s\n"
(PPO pid=46962) ValueError: The two structures don't have the same nested structure.
(PPO pid=46962)
(PPO pid=46962) First structure: type=Tensor str=Tensor("agent_0_wk1/Placeholder:0", shape=(?, 23236), dtype=float32)
(PPO pid=46962)
(PPO pid=46962) Second structure: type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'POSITION': array([[7, 7]], dtype=int32), 'RGB': array([[[[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [ 53, 132, 49],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) ...,
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]]]], dtype=uint8)}
(PPO pid=46962)
(PPO pid=46962) More specifically: Substructure "type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'POSITION': array([[7, 7]], dtype=int32), 'RGB': array([[[[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [ 53, 132, 49],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) ...,
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]]]], dtype=uint8)}" is a sequence, while substructure "type=Tensor str=Tensor("agent_0_wk1/Placeholder:0", shape=(?, 23236), dtype=float32)" is not
(PPO pid=46962) Entire first structure:
(PPO pid=46962) .
(PPO pid=46962) Entire second structure:
(PPO pid=46962) {'READY_TO_SHOOT': ., 'ORIENTATION': ., 'POSITION': ., 'RGB': .}
(PPO pid=46962) 2023-01-30 13:48:19,688 ERROR actor_manager.py:486 -- Ray error, taking actor 2 out of service. ray::RolloutWorker.apply() (pid=46975, ip=127.0.0.1, repr=<ray.rllib.evaluation.rollout_worker.RolloutWorker object at 0x302a7ffd0>)
(PPO pid=46962) ValueError: The two structures don't have the same nested structure.
(PPO pid=46962)
(PPO pid=46962) First structure: type=Tensor str=Tensor("agent_0_wk2/Placeholder:0", shape=(?, 23236), dtype=float32)
(PPO pid=46962)
(PPO pid=46962) Second structure: type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'RGB': array([[[[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [ 53, 132, 49],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) ...,
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]]]], dtype=uint8), 'POSITION': array([[7, 7]], dtype=int32)}
(PPO pid=46962)
(PPO pid=46962) More specifically: Substructure "type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'RGB': array([[[[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [ 53, 132, 49],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) ...,
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]]]], dtype=uint8), 'POSITION': array([[7, 7]], dtype=int32)}" is a sequence, while substructure "type=Tensor str=Tensor("agent_0_wk2/Placeholder:0", shape=(?, 23236), dtype=float32)" is not
(PPO pid=46962)
(PPO pid=46962) During handling of the above exception, another exception occurred:
(PPO pid=46962)
(PPO pid=46962) ray::RolloutWorker.apply() (pid=46975, ip=127.0.0.1, repr=<ray.rllib.evaluation.rollout_worker.RolloutWorker object at 0x302a7ffd0>)
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/utils/actor_manager.py", line 183, in apply
(PPO pid=46962) raise e
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/utils/actor_manager.py", line 174, in apply
(PPO pid=46962) return func(self, *args, **kwargs)
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/execution/rollout_ops.py", line 86, in <lambda>
(PPO pid=46962) lambda w: w.sample(), local_worker=False, healthy_only=True
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/rollout_worker.py", line 900, in sample
(PPO pid=46962) batches = [self.input_reader.next()]
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 92, in next
(PPO pid=46962) batches = [self.get_data()]
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 285, in get_data
(PPO pid=46962) item = next(self._env_runner)
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 694, in _env_runner
(PPO pid=46962) eval_results = _do_policy_eval(
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/evaluation/sampler.py", line 1207, in _do_policy_eval
(PPO pid=46962) eval_results[policy_id] = policy.compute_actions_from_input_dict(
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/policy/tf_policy.py", line 321, in compute_actions_from_input_dict
(PPO pid=46962) to_fetch = self._build_compute_actions(
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/ray/rllib/policy/tf_policy.py", line 1087, in _build_compute_actions
(PPO pid=46962) tree.map_structure(
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/tree/__init__.py", line 433, in map_structure
(PPO pid=46962) assert_same_structure(structures[0], other, check_types=check_types)
(PPO pid=46962) File "/Users/nell/miniforge3/lib/python3.10/site-packages/tree/__init__.py", line 288, in assert_same_structure
(PPO pid=46962) raise type(e)("%s\n"
(PPO pid=46962) ValueError: The two structures don't have the same nested structure.
(PPO pid=46962)
(PPO pid=46962) First structure: type=Tensor str=Tensor("agent_0_wk2/Placeholder:0", shape=(?, 23236), dtype=float32)
(PPO pid=46962)
(PPO pid=46962) Second structure: type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'RGB': array([[[[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [ 53, 132, 49],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) ...,
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]]]], dtype=uint8), 'POSITION': array([[7, 7]], dtype=int32)}
(PPO pid=46962)
(PPO pid=46962) More specifically: Substructure "type=dict str={'READY_TO_SHOOT': array([1.]), 'ORIENTATION': array([1], dtype=int32), 'RGB': array([[[[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175]],
(PPO pid=46962)
(PPO pid=46962) [[158, 194, 101],
(PPO pid=46962) [158, 194, 101],
(PPO pid=46962) [ 53, 132, 49],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) ...,
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [220, 205, 185]],
(PPO pid=46962)
(PPO pid=46962) [[210, 195, 175],
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) ...,
(PPO pid=46962) [220, 205, 185],
(PPO pid=46962) [210, 195, 175],
(PPO pid=46962) [220, 205, 185]]]], dtype=uint8), 'POSITION': array([[7, 7]], dtype=int32)}" is a sequence, while substructure "type=Tensor str=Tensor("agent_0_wk2/Placeholder:0", shape=(?, 23236), dtype=float32)" is not
(PPO pid=46962) Entire first structure:
(PPO pid=46962) .
(PPO pid=46962) Entire second structure:
(PPO pid=46962) {'READY_TO_SHOOT': ., 'ORIENTATION': ., 'RGB': ., 'POSITION': .}
2023-01-30 13:48:20,055 ERROR tune.py:758 -- Trials did not complete: [PPO_meltingpot_4f50d_00000]
2023-01-30 13:48:20,056 INFO tune.py:762 -- Total run time: 27.62 seconds (27.27 seconds for the tuning loop).
<ray.tune.result_grid.ResultGrid object at 0x1780dbc10>
Traceback (most recent call last):
File "/Users/nell/Documents/GitHub/norm-games/examples/rllib/self_play_train.py", line 159, in <module>
main()
File "/Users/nell/Documents/GitHub/norm-games/examples/rllib/self_play_train.py", line 155, in main
assert results.num_errors == 0
AssertionError
Editor is loading...