osnaDocs: Relevance-based Online Planning in Complex POMDPs

Relevance-based Online Planning in Complex POMDPs

Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen:
https://osnadocs.ub.uni-osnabrueck.de/handle/urn:nbn:de:gbv:700-202007173302

Langanzeige der Metadaten

DC Element	Wert	Sprache
dc.contributor.advisor	Prof. Dr. Joachim Hertzberg	ger
dc.creator	Saborío Morales, Juan Carlos	-
dc.date.accessioned	2020-07-17T10:08:18Z	-
dc.date.available	2020-07-17T10:08:18Z	-
dc.date.issued	2020-07-17T10:08:20Z	-
dc.identifier.uri	https://osnadocs.ub.uni-osnabrueck.de/handle/urn:nbn:de:gbv:700-202007173302	-
dc.description.abstract	Planning under uncertainty is a central topic at the intersection of disciplines such as artificial intelligence, cognitive science and robotics, and its aim is to enable artificial agents to solve challenging problems through a systematic approach to decision-making. Some of these challenges include generating expectations about different outcomes governed by a probability distribution and estimating the utility of actions based only on partial information. In addition, an agent must incorporate observations or information from the environment into its deliberation process and produce the next best action to execute, based on an updated understanding of the world. This process is commonly modeled as a POMDP, a discrete stochastic system that becomes intractable very quickly. Many real-world problems, however, can be simplified following cues derived from contextual information about the relative expected value of actions. Based on an intuitive approach to problem solving, and relying on ideas related to attention and relevance estimation, we propose a new approach to planning supported by our two main contributions: PGS grants an agent the ability to generate internal preferences and biases to guide action selection, and IRE allows the agent to reduce the dimensionality of complex problems while planning online. Unlike existing work that improves the performance of planning on POMDPs, PGS and IRE do not rely on detailed heuristics or domain knowledge, explicit action hierarchies or manually designed dependencies for state factoring. Our results show that this level of autonomy is important to solve increasingly more challenging problems, where manually designed simplifications scale poorly.	eng
dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Germany	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/de/	*
dc.subject	Planning under uncertainty	eng
dc.subject	POMDP planning	eng
dc.subject	Monte Carlo Tree Search	eng
dc.subject.ddc	004 - Informatik	ger
dc.title	Relevance-based Online Planning in Complex POMDPs	eng
dc.type	Dissertation oder Habilitation [doctoralThesis]	-
thesis.location	Osnabrück	-
thesis.institution	Universität	-
thesis.type	Dissertation [thesis.doctoral]	-
thesis.date	2020-06-25	-
orcid.creator	https://orcid.org/0000-0003-3625-0661	-
dc.contributor.referee	Prof. Dr. Marc Toussaint	ger
dc.subject.bk	54.72 - Künstliche Intelligenz	ger
dc.subject.msc	60-08 - Computational methods	ger
dc.subject.ccs	I.2.8 - Problem Solving, Control Methods, and Search	ger
Enthalten in den Sammlungen:	FB06 - E-Dissertationen

Dateien zu dieser Ressource:

Datei	Beschreibung	Größe	Format
thesis_saborio_morales.pdf	Präsentationsformat	1,02 MB	Adobe PDF	thesis_saborio_morales.pdf Öffnen/Anzeigen

Zur Kurzanzeige

Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons