Hello,
during evaluation of trained RL models I noticed that the observation values of features that are not present in a scenario (e.g. lane_based_rel_1) take constant values which are different from zero. These values are independent from the evaluated sceanrio, but dependent of the RL model used for the evaluation.
Edit: The default values for absent features represent 0.0 in the model’s normalized input space.
Can this be interpreted to characterize the trained model? E.g. as a bias, that the model has for each individual feature?
Best,
Annabel