Hernandez-Lerma | Adaptive Markov Control Processes | Buch | 978-1-4612-6454-5 | sack.de

Buch, Englisch, Band 79, 148 Seiten, Format (B × H): 155 mm x 235 mm, Gewicht: 265 g

Reihe: Applied Mathematical Sciences

Leseprobe

Hernandez-Lerma

Adaptive Markov Control Processes

Softcover Nachdruck of the original 1. Auflage 1989
ISBN: 978-1-4612-6454-5
Verlag: Springer

Buch, Englisch, Band 79, 148 Seiten, Format (B × H): 155 mm x 235 mm, Gewicht: 265 g

Reihe: Applied Mathematical Sciences

Adaptive Markov Control Processes
Erscheinungsjahr 2012, 978-1-4419-8714-3, eBook, PDF, 1 - PDF Watermark
Adaptive Markov Control Processes
Erscheinungsjahr 2012, 978-1-4419-8714-3, eBook, PDF, 1 - PDF Watermark

Buch, Englisch, Band 79, 148 Seiten, Format (B × H): 155 mm x 235 mm, Gewicht: 265 g

Reihe: Applied Mathematical Sciences

ISBN: 978-1-4612-6454-5
Verlag: Springer

53,49 €

(inkl. MwSt.)

versandkostenfreie Lieferung
Lieferfrist: bis zu 10 Tage

Bücher versandkostenfrei

kostenlose Rücksendung

This book is concerned with a class of discrete-time stochastic control processes known as controlled Markov processes (CMP's), also known as Markov decision processes or Markov dynamic programs. Starting in the mid-1950swith Richard Bellman, many contributions to CMP's have been made, and applications to engineering, statistics and operations research, among other areas, have also been developed. The purpose of this book is to present some recent developments on the theory of adaptive CMP's, i. e., CMP's that depend on unknown parameters. Thus at each decision time, the controller or decision-maker must estimate the true parameter values, and then adapt the control actions to the estimated values. We do not intend to describe all aspects of stochastic adaptive control; rather, the selection of material reflects our own research interests. The prerequisite for this book is a knowledgeof real analysis and prob ability theory at the level of, say, Ash (1972) or Royden (1968), but no previous knowledge of control or decision processes is required. The pre sentation, on the other hand, is meant to beself-contained,in the sensethat whenever a result from analysisor probability is used, it is usually stated in full and references are supplied for further discussion, if necessary. Several appendices are provided for this purpose. The material is divided into six chapters. Chapter 1 contains the basic definitions about the stochastic control problems we are interested in; a brief description of some applications is also provided.

Hernandez-Lerma Adaptive Markov Control Processes jetzt bestellen!

Zielgruppe

Research

Autoren/Hrsg.

Hernandez-Lerma, Onesimo

Fachgebiete

Mathematik | Informatik Mathematik Stochastik

Weitere Infos & Material

Inhaltsverzeichnis

1 Controlled Markov Processes.- 1.1 Introduction.- 1.2 Stochastic Control Problems.- 1.3 Examples.- 1.4 Further Comments.- 2 Discounted Reward Criterion.- 2.1 Introduction.- 2.2 Optimality Conditions.- 2.3 Asymptotic Discount Optimality.- 2.4 Approximation of MCM’s.- 2.5 Adaptive Control Models.- 2.6 Nonparametric Adaptive Control.- 2.7 Comments and References.- 3 Average Reward Criterion.- 3.1 Introduction.- 3.2 The Optimality Equation.- 3.3 Ergodicity Conditions.- 3.4 Value Iteration.- 3.5 Approximating Models.- 3.6 Nonstationary Value Iteration.- 3.7 Adaptive Control Models.- 3.8 Comments and References.- 4 Partially Observable Control Models.- 4.1 Introduction.- 4.2 PO-CM: Case of Known Parameters.- 4.3 Transformation into a CO Control Problem.- 4.4 Optimal I-Policies.- 4.5 PO-CM’s with Unknown Parameters.- 4.6 Comments and References.- 5 Parameter Estimation in MCM’s.- 5.1 Introduction.- 5.2 Contrast Functions.- 5.3 Minimum Contrast Estimators.- 5.4 Comments and References.- 6 Discretization Procedures.- 6.1 Introduction.- 6.2 Preliminaries.- 6.3 The Non-Adaptive Case.- 6.4 Adaptive Control Problems.- 6.5 Proofs.- 6.6 Comments and References.- Appendix A. Contraction Operators.- Appendix B. Probability Measures.- Total Variation Norm.- Weak Convergence.- Appendix C. Stochastic Kernels.- Appendix D. Multifunctions and Measurable Selectors.- The Hausdorff Metric.- Multifunctions.- References.- Author Index.

Produktsicherheit

Fragen zum Artikel?

Ihre Fragen, Wünsche oder Anmerkungen

Vorname*

Nachname*

Ihre E-Mail-Adresse*

Kundennr.

Ihre Nachricht*

Lediglich mit * gekennzeichnete Felder sind Pflichtfelder.

Wenn Sie die im Kontaktformular eingegebenen Daten durch Klick auf den nachfolgenden Button übersenden, erklären Sie sich damit einverstanden, dass wir Ihr Angaben für die Beantwortung Ihrer Anfrage verwenden. Selbstverständlich werden Ihre Daten vertraulich behandelt und nicht an Dritte weitergegeben. Sie können der Verwendung Ihrer Daten jederzeit widersprechen. Das Datenhandling bei Sack Fachmedien erklären wir Ihnen in unserer Datenschutzerklärung.

53,49 € (inkl. MwSt.)

Lieferfrist: bis zu 10 Tage

Bücher versandkostenfrei

kostenlose Rücksendung

Webcode: sack.de/8y6hf