Skip to content

Commit

Permalink
Pluralise "problem" (#218)
Browse files Browse the repository at this point in the history
In the section "Probabilistic modeling with Bayesian networks", I believe the intended formulation might be "which will often be the case in the problems we will consider" 🙂
  • Loading branch information
nilq authored May 19, 2023
1 parent 22a23d0 commit a3008e2
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion representation/directed/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,7 @@ $$ p(x_i \mid x_{i-1}, \dotsc, x_1) = p(x_i \mid x_{A_i}). $$

For example, in a model with five variables, we may choose to approximate the factor $$p(x_5 \mid x_4, x_3, x_2, x_1)$$ with $$p(x_5 \mid x_4, x_3)$$. In this case, we write $$x_{A_5} = \{x_4, x_3\}$$.

When the variables are discrete (which will often be the case in the problem we will consider), we may think of the factors $$p(x_i\mid x_{A_i})$$ as *probability tables*, in which rows correspond to assignments to $$x_{A_i}$$ and columns correspond to values of $$x_i$$; the entries contain the actual probabilities $$p(x_i\mid x_{A_i})$$. If each variable takes $$d$$ values and has at most $$k$$ ancestors, then the entire table will contain at most $$O(d^{k+1})$$ entries. Since we have one table per variable, the entire probability distribution can be compactly described with only $$O(nd^{k+1})$$ parameters (compared to $$O(d^n)$$ with a naive approach).
When the variables are discrete (which will often be the case in the problems we will consider), we may think of the factors $$p(x_i\mid x_{A_i})$$ as *probability tables*, in which rows correspond to assignments to $$x_{A_i}$$ and columns correspond to values of $$x_i$$; the entries contain the actual probabilities $$p(x_i\mid x_{A_i})$$. If each variable takes $$d$$ values and has at most $$k$$ ancestors, then the entire table will contain at most $$O(d^{k+1})$$ entries. Since we have one table per variable, the entire probability distribution can be compactly described with only $$O(nd^{k+1})$$ parameters (compared to $$O(d^n)$$ with a naive approach).

### Graphical representation.

Expand Down

0 comments on commit a3008e2

Please sign in to comment.