(v2.2.5) - Multi-Objective Reward compability in Sinergym #303

AlejandroCN7 · 2023-03-09T16:07:11Z

Description

In order to be compatible with MORL, this PR proposes a wrapper that will return a vector with reward components instead of a scalar value.

Motivation and Context

We are interested in enabling the use of MORL algorithms within Sinergym. This involves some minor changes and enhancements such as logger update (see changelog).

I have raised an issue to propose this change (required for new features and bug fixes)

Why is this change required? What problem does it solve? Please, reference issue or issues opened previously.

Fixes #301

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Improvement (of an existing feature)
Others

Checklist:

I've read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests.
I have updated the documentation accordingly.
I have reformatted the code using autopep8 second level aggressive.
I have reformatted the code using isort.
I have ensured cd docs && make spelling && make html pass (required if documentation has been updated.)
I have ensured pytest tests/ -vv pass. (required).
I have ensured pytype -d import-error sinergym/ pass. (required)

Changelog:

Update version from 2.2.4 to 2.2.5.
Updated reward terms dictionary in rewards.py.
Improved info dictionary return un env layer (return reward and all reward terms automatically, no matter reward class used).
Added MultiObjectiveReward in wrappers.py.
Enhanced Sinergym logger: Using info for reward information in order to be more general.
Fixed logger wrapper according to CSVLogger update.
Added timestep (value 0) and time_elapsed (value 0) in simulator info construction (reset method).
Updated fixture and test in order to cover this new functionality.
Added documentation and notebook information.

…to be more general

…ltiObjective wrapper section

AlejandroCN7 added 15 commits March 9, 2023 14:23

Updated Sinergym version from 2.2.4 to 2.2.5

05e249e

Updated reward terms dictionary

5ee81fe

Improved environment layer info dict construction

e80757c

Added MultiObjectiveReward in wrappers.py

ed73247

Enhanced Sinergym logger: Using info for reward information in order …

44fd7ad

…to be more general

Fixed loggerWrapper to logger update

07b42dc

Added timestep and time_elapsed (both 0) to simulator info in reset

0a370b5

Added fixture with multiobjective wrapper

8d33f41

Added tests for multiobjective wrapper

8857f07

Fix simulator test to new info dict dimension

6bac8c7

Fixed pytype errors

a079f12

Update documentation modules for API reference documentation

d0cb2b3

Documentation: Updated wrapper section

0a1ece1

Update wrapper notebook example

f36b000

Documentation: Updated reward section and added issue reference un mu…

aec5a5c

…ltiObjective wrapper section

AlejandroCN7 marked this pull request as ready for review March 10, 2023 10:48

AlejandroCN7 merged commit f33f130 into main Mar 10, 2023

AlejandroCN7 deleted the feat/issue-301 branch March 10, 2023 10:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(v2.2.5) - Multi-Objective Reward compability in Sinergym #303

(v2.2.5) - Multi-Objective Reward compability in Sinergym #303

AlejandroCN7 commented Mar 9, 2023 •

edited

Loading

(v2.2.5) - Multi-Objective Reward compability in Sinergym #303

(v2.2.5) - Multi-Objective Reward compability in Sinergym #303

Conversation

AlejandroCN7 commented Mar 9, 2023 • edited Loading

Description

Motivation and Context

Types of changes

Checklist:

Changelog:

AlejandroCN7 commented Mar 9, 2023 •

edited

Loading