Withdraw
Loading…
On informational nudging and control of payoff-based learning
Guers, Robin
Loading…
Permalink
https://hdl.handle.net/2142/45433
Description
- Title
- On informational nudging and control of payoff-based learning
- Author(s)
- Guers, Robin
- Issue Date
- 2013-08-22T16:40:02Z
- Director of Research (if dissertation) or Advisor (if thesis)
- Langbort, Cedric
- Department of Study
- Aerospace Engineering
- Discipline
- Aerospace Engineering
- Degree Granting Institution
- University of Illinois at Urbana-Champaign
- Degree Name
- M.S.
- Degree Level
- Thesis
- Keyword(s)
- Stochastic Averaging
- Payoff Learning
- Nonlinear Systems
- Abstract
- This thesis investigates a model of informational nudging. It focuses on a situation where a decision maker faces a set of alternatives. Each alternative brings a certain stochastic reward/payoff to the decision maker. The decision maker/user repeatedly chooses from the set of alternatives so as to maximize the reward she obtains. The user remembers her past experiments and builds an estimate of the reward of each alternative to make her future decision. The reward estimate is built with the assumption that the user averages the reward of the alternative she just chose with her past reward estimate, using a non summable, square summable sequence of averaging factors, while leaving the estimate of the alternative she did not choose unchanged. The decision process is repeated over an infinite time horizon and the relative importance she gives to new experiment compared to her past experiment decreases as time goes on. This is a key assumption to study the asymptotic behavior of the process, since we use stochastic averaging techniques. At each step of the process the user chooses the alternative using her payoff estimate and a logit rule. With this model the user can only gather information about one alternative at each step of the process, hence the estimate of a rarely chosen alternative is not often updated. Therefore we introduce a recommender who provides information about the unchosen alternatives at every step, making it possible for the user to update the payoff estimate of all alternatives at every step of the process. This modifies the payoff estimate, modifies the subsequent choice of the users, i.e. the whole decision process. We are particularly interested in studying the situation where the recommender provides incorrect or misleading information to influence the decision maker behavior, as a way to achieve more desirable equilibria. Building on the theory of stochastic averaging, control strategies are derived to enforce a desired equilibrium.
- Graduation Semester
- 2013-08
- Permalink
- http://hdl.handle.net/2142/45433
- Copyright and License Information
- Copyright 2013 Robin Guers
Owning Collections
Graduate Dissertations and Theses at Illinois PRIMARY
Graduate Theses and Dissertations at IllinoisManage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…