return_calculator module¶
Author: Banafsheh Rafiee
Description: ReturnCalculator samples some time steps from the behavior policy and computes the return for them. In order to compute the return for each sample time step, it switches from the behavior policy to the target policy.