Replies: 2 comments
-
Thinking about this model, the main questions: what are they trying to express? I think it's something like the agent is an approximately rational agent. If it were the agent were a rational agent, then it's actions would be determinsitic, the arg_max E(utility). But they want to say it is approximately optimal, but then the question is in what way. But what about their particular construction? As described, given the uniform prior, it makes it so that the probability of any action is something like
Why don't we just allow Omega to express this directly? Perhaps if we had the general ability to construct random variables from their densities, we wouldn't need any machinery. This deserves an issue by itself. There are two advantages that
|
Beta Was this translation helpful? Give feedback.
-
For completeness, let me just say what I don't like about the factor interface:
|
Beta Was this translation helpful? Give feedback.
-
Many PPLs, such as WebPPL, have a factor statement. When included in a random procedure,
factor(x)
addsx
to the logpdf of the trace.This is used for all kinds of interesting things That said, the design of it worries me. The main question I'd like to consider is whether Omega needs a factor statement, and if not, what it should have instead.
Example:
Here, factor is used to make actions with high expected utility more probable. It implements a kind of planning as inference.
How should we think about this?
Beta Was this translation helpful? Give feedback.
All reactions