The Author Online Book Forums are Moving

The Author Online Book Forums will soon redirect to Manning's liveBook and liveVideo. All book forum content will migrate to liveBook's discussion forum and all video forum content will migrate to liveVideo. Log in to liveBook or liveVideo with your Manning credentials to join the discussion!

Thank you for your engagement in the AoF over the years! We look forward to offering you a more enhanced forum experience.

57334 (1) [Avatar] Offline
#1
page 139
action_q_vals[0, next_action_idx] = reward + self.gamma * next_action_q_vals[0,
next_action_idx]

I think action_q_vals[0, next_action_idx] should be action_q_vals[0,current_action_idx] ???

Thanks
367062 (2) [Avatar] Offline
#2
I also wonder if this is right an makes sense (see my topic: https://forums.manning.com/posts/list/41375.page)
Nishant Shukla (52) [Avatar] Offline
#3
Indeed it should be current_action_idx. Thank you!