TY  - BOOK
AU  - Lapan, Maxin
TI  - Deep reinforcement learning hands - on: Apply moder RL methods to practical problems of chatbots, robotics, discrete optimization, web automation, and more 
SN  - 978-1-83882-6999-4 (Pasta rústica)
U1  - 006.31 L299d 2020 
PY  - 2020///
CY  - Reino Unido
PB  - Packt Publishing
KW  - APRENDIZAJE PROFUNDO ( APRENDIZAJE AUTOMÁTICO)
KW  - APRENDIZAJE AUTOMÁTICO (INTELIGENCIA ARTIFICIAL)
KW  - INGENIERIA DE SOFTWARE
KW  - PROCEDIMIENTO DE LENGUAJE NATURAL ( COMPUTADORES)
N1  - Incluye indice; Perface -- Chapter 1 : What is reinforcement learning ? -- Chapter 2 : OpenAL Gym -- Chapter 3: Deep learning with pytorch -- Chapter 4: The cross-Entropy Method -- Chapter 5 : Tabular learning and the bellman equation -- Chapter 6: Deep Q- Networks -- Chapter 7: Higher- level RL libraries -- Chapters 8: DQN extensions -- Chapter 9: Ways to speed up RL -- Chapter 10: Stock traing using RL -- Chapter 11: Policy grandients -an alternative -- Chapter 12: The actor - Critic Method -- Chapter 13: Asynchronous advantage actor - critic -- Chapter 14: Training chatbots with RL -- Chapter 15: : The texworld enviroment -- Chapter 16: Web navigation -- Chapter 17: Continuous action space -- Chapter 18: RL in robotics -- Chapter 19: Trus regions - PPO, TRPO, ACKT, and SAC -- Chapter 20: Black - box optimization in RL -- Chapter 21 Advanced exploration -- Chapter 22: Beyond model - free - imagination -- Chapter 23: AlphaGo zero -- Chapter 24: RL indiscrete optimization -- Chapter 25: Multi- agent RL
ER  -