Studying Sutton and Barto's RL book and its connections to RL for LLMs (e.g., tool use, math reasoning, agents, and so on)? [D]
Hi everyone, I graduated from a Master in Math program last summer. In recent months, I have been trying to understand more about ML/DL a...