Finish docs

penelopeysm · penelopeysm · commit 16793e0aa933 · 2025-04-18T01:16:23.000+01:00
diff --git a/docs/Project.toml b/docs/Project.toml
@@ -1,4 +1,5 @@
 [deps]
+AbstractPPL = "7a57a42e-76ec-4ea3-a279-07e840d6d9cf"
 Accessors = "7d9f7c33-5ae7-4f3b-8dc6-eff91059b697"
 DataStructures = "864edb3b-99cc-5e75-8d2d-829cb0a9cfe8"
 Distributions = "31c24e10-a181-5473-b8eb-7969acd0382f"
diff --git a/docs/src/api.md b/docs/src/api.md
@@ -78,7 +78,7 @@ decondition
 
 ## Fixing and unfixing
 
-We can also _fix_ a collection of variables in a [`Model`](@ref) to certain using [`fix`](@ref).
+We can also _fix_ a collection of variables in a [`Model`](@ref) to certain using [`DynamicPPL.fix`](@ref).
 
 This might seem quite similar to the aforementioned [`condition`](@ref) and its siblings,
 but they are indeed different operations:
@@ -89,19 +89,19 @@ but they are indeed different operations:
   - `fix`ed variables are considered to be _constant_, and are thus not included
     in any log-probability computations.
 
-The differences are more clearly spelled out in the docstring of [`fix`](@ref) below.
+The differences are more clearly spelled out in the docstring of [`DynamicPPL.fix`](@ref) below.
 
 ```@docs
-fix
+DynamicPPL.fix
 DynamicPPL.fixed
 ```
 
-The difference between [`fix`](@ref) and [`condition`](@ref) is described in the docstring of [`fix`](@ref) above.
+The difference between [`DynamicPPL.fix`](@ref) and [`DynamicPPL.condition`](@ref) is described in the docstring of [`DynamicPPL.fix`](@ref) above.
 
-Similarly, we can [`unfix`](@ref) variables, i.e. return them to their original meaning:
+Similarly, we can revert this with [`DynamicPPL.unfix`](@ref), i.e. return the variables to their original meaning:
 
 ```@docs
-unfix
+DynamicPPL.unfix
 ```
 
 ## Predicting
diff --git a/docs/src/internals/submodel_condition.md b/docs/src/internals/submodel_condition.md
@@ -35,16 +35,9 @@ keys(vi)
     In this case, where `to_submodel` is called without any other arguments, the prefix to be used is automatically inferred from the name of the variable on the left-hand side of the tilde.
     We will return to the 'manual prefixing' case later.
 
-What does it really mean to 'become' a different variable?
-We can see this from [the definition of `tilde_assume`, for example](https://github.com/TuringLang/DynamicPPL.jl/blob/60ee68e2ce28a15c6062c243019e6208d16802a5/src/context_implementations.jl#L87-L89):
-
-```
-function tilde_assume(context::PrefixContext, right, vn, vi)
-    return tilde_assume(context.context, right, prefix(context, vn), vi)
-end
-```
-
-Functionally, this means that even though the _initial_ entry to the tilde-pipeline has `vn` as `x` and `y`, once the `PrefixContext` has been applied, the later functions will see `a.x` and `a.y` instead.
+The phrase 'becoming' a different variable is a little underspecified: it is useful to pinpoint the exact location where the prefixing occurs, which is `tilde_assume`.
+The method responsible for it is `tilde_assume(::PrefixContext, right, vn, vi)`: this attaches the prefix in the context to the `VarName` argument, before recursively calling `tilde_assume` with the new prefixed `VarName`.
+This means that even though a statement `x ~ dist` still enters the tilde pipeline at the top level as `x`, if the model evaluation context contains a `PrefixContext`, any function from `tilde_assume` onwards will see `a.x` instead.
 
 ## ConditionContext
 
@@ -205,29 +198,158 @@ DynamicPPL.hasconditioned_nested(inner_ctx_with_outer_cond, @varname(a.x))
 DynamicPPL.hasconditioned_nested(inner_ctx_with_inner_cond, @varname(a.x))
 ```
 
-Essentially, our job is threefold:
+This allows us to finally specify our task as follows:
 
-  - Firstly, given the correct arguments, we need to make sure that `hasconditioned_nested` and `getconditioned_nested` behave correctly.
+(1) Given the correct arguments, we need to make sure that `hasconditioned_nested` and `getconditioned_nested` behave correctly.
 
-  - Secondly, we need to make sure that both the correct arguments are supplied. In order to do so:
-    
-      + We need to make sure that when evaluating a submodel, the context stack is arranged such that prefixes are applied _inside_ the parent model's context, but _outside_ the submodel's own context.
-      + We also need to make sure that the `VarName` passed to it is prefixed correctly. This is, in fact, _not_ handled by `tilde_assume`, because `contextual_isassumption` is much higher in the call stack than `tilde_assume` is. So, we need to explicitly prefix it.
+(2) We need to make sure that both the correct arguments are supplied. In order to do so:
+
+  - (2a) We need to make sure that when evaluating a submodel, the context stack is arranged such that `PrefixContext` is applied _inside_ the parent model's context, but _outside_ the submodel's own context.
+
+  - (2b) We also need to make sure that the `VarName` passed to it is prefixed correctly.
 
 ## How do we do it?
 
-`hasconditioned_nested` accomplishes this by doing the following:
+(1) `hasconditioned_nested` and `getconditioned_nested` accomplish this by first 'collapsing' the context stack, i.e. they go through the context stack, remove all `PrefixContext`s, and apply those prefixes to any conditioned variables below it in the stack.
+Once the `PrefixContext`s have been removed, one can then iterate through the context stack and check if any of the `ConditionContext`s contain the variable, or get the value itself.
+For more details the reader is encouraged to read the source code.
+
+(2a) We ensure that the context stack is correctly arranged by relying on the behaviour of `make_evaluate_args_and_kwargs`.
+This function is called whenever a model (which itself contains a context) is evaluated with a separate ('external') context, and makes sure to arrange both of these contexts such that _the model's context is nested inside the external context_.
+Thus, as long as prefixing is implemented by applying a `PrefixContext` on the outermost layer of the _inner_ model context, this will be correctly combined with an external context to give the behaviour seen above.
+
+(2b) At first glance, it seems like `tilde_assume` can take care of the `VarName` prefixing for us (as described in the first section).
+However, this is not actually the case: `contextual_isassumption`, which is the function that calls `hasconditioned_nested`, is much higher in the call stack than `tilde_assume` is.
+So, we need to explicitly prefix it before passing it to `contextual_isassumption`.
+This is done inside the `@model` macro, or technically, its subsidiary function `isassumption`.
+
+## Nested submodels
+
+Just in case the above wasn't complicated enough, we need to also be very careful when dealing with nested submodels, which have multiple layers of `PrefixContext`s which may be interspersed with `ConditionContext`s.
+For example, in this series of nested submodels,
+
+```@example
+@model function charlie()
+    x ~ Normal()
+    y ~ Normal()
+    return z ~ Normal()
+end
+@model function bravo()
+    return b ~ to_submodel(charlie() | (@varname(x) => 1.0))
+end
+@model function alpha()
+    return a ~ to_submodel(bravo() | (@varname(b.y) => 1.0))
+end
+```
+
+we expect that the only variable to be sampled should be `z` inside `charlie`, or rather, `a.b.z` once it has been through the prefixes.
+
+```@example
+keys(VarInfo(alpha()))
+```
+
+The general strategy that we adopt is similar to above.
+Following the principle that `PrefixContext` should be nested inside the outer context, but outside the inner submodel's context, we can infer that the correct context inside `charlie` should be:
+
+```@example
+big_ctx = PrefixContext{:a}(
+    ConditionContext(
+        Dict(@varname(b.y) => 1.0),
+        PrefixContext{:b}(ConditionContext(Dict(@varname(x) => 1.0))),
+    ),
+)
+```
+
+We need several things to work correctly here: we need the `VarName` prefixing to behave correctly, and then we need to implement `hasconditioned_nested` and `getconditioned_nested` on the resulting prefixed `VarName`.
+It turns out that the prefixing itself is enough to illustrate the most important point in this section, namely, the need to traverse the context stack in a _different direction_ to what most of DynamicPPL does.
+
+Let's work with a function called `myprefix(::AbstractContext, ::VarName)` (to avoid confusion with any existing DynamicPPL function).
+We should like `myprefix(big_ctx, @varname(x))` to return `@varname(a.b.x)`.
+Consider the following naive implementation, which mirrors a lot of code in the tilde-pipeline:
+
+```@example
+using DynamicPPL: NodeTrait, IsLeaf, IsParent, childcontext, AbstractContext
+using AbstractPPL: AbstractPPL
+
+function myprefix(ctx::DynamicPPL.AbstractContext, vn::VarName)
+    return myprefix(NodeTrait(ctx), ctx, vn)
+end
+function myprefix(::IsLeaf, ::AbstractContext, vn::VarName)
+    return vn
+end
+function myprefix(::IsParent, ctx::AbstractContext, vn::VarName)
+    return myprefix(childcontext(ctx), vn)
+end
+function myprefix(ctx::DynamicPPL.PrefixContext{Prefix}, vn::VarName) where {Prefix}
+    # The functionality to actually manipulate the VarNames is in AbstractPPL
+    new_vn = AbstractPPL.prefix(vn, VarName{Prefix}())
+    # Then pass to the child context
+    return myprefix(childcontext(ctx), new_vn)
+end
+
+myprefix(big_ctx, @varname(x))
+```
+
+This implementation clearly is not correct, because it applies the _inner_ `PrefixContext` before the outer one.
+
+The right way to implement `myprefix` is to, essentially, reverse the order of two lines above:
+
+```@example
+function myprefix(ctx::DynamicPPL.PrefixContext{Prefix}, vn::VarName) where {Prefix}
+    # Pass to the child context first
+    new_vn = myprefix(childcontext(ctx), vn)
+    # Then apply this context's prefix
+    return AbstractPPL.prefix(new_vn, VarName{Prefix}())
+end
+
+myprefix(big_ctx, @varname(x))
+```
+
+This is a much better result!
+The implementation of related functions such as `hasconditioned_nested` and `getconditioned_nested`, under the hood, use a similar recursion scheme, so you will find that this is a common pattern when reading the source code of various prefixing-related functions, you will find that this is a common pattern
+When editing this code, it is worth being mindful of this as a potential source of incorrectness.
+
+!!! info
+    
+    If you have encountered left and right folds, the above discussion illustrates the difference between them: the wrong implementation of `myprefix` uses a left fold (which collects prefixes in the opposite order from which they are encountered), while the correct implementation uses a right fold.
 
-  - If the outermost layer is a `ConditionContext`, it checks whether the variable is contained in its values.
-  - If the outermost layer is a `PrefixContext`, it goes through the `PrefixContext`'s child context and prefixes any inner conditioned variables, before checking whether the variable is contained.
+## Loose ends 1: Manual prefixing
 
-We ensure that the context stack is correctly arranged by relying on the behaviour of `make_evaluate_args_and_kwargs`.
-This function is called whenever a model (which itself contains a context) is evaluated with a separate ('outer') context, and makes sure to arrange it such that the model's context is nested inside the outer context.
-Thus, as long as prefixing is implemented by applying a `PrefixContext` on the outermost layer of the _inner_ model context, this will be correctly combined with an outer context to give the behaviour seen above.
+Sometimes users may want to manually prefix a model, for example:
 
-And finally, we ensure that the `VarName` is correctly prefixed by modifying the `@model` macro (or, technically, its subsidiary `isassumption`) to explicitly prefix the variable before passing it to `contextual_isassumption`.
+```@example
+@model function inner_manual()
+    x ~ Normal()
+    return y ~ Normal()
+end
+
+@model function outer_manual()
+    return _unused ~ to_submodel(prefix(inner_manual(), :a), false)
+end
+```
+
+In this case, the `VarName` on the left-hand side of the tilde is not used, and the prefix is instead specified using the `prefix` function.
+
+The way to deal with this follows on from the previous discussion.
+Specifically, we said that:
+
+> [...] as long as prefixing is implemented by applying a `PrefixContext` on the outermost layer of the _inner_ model context, this will be correctly combined [...]
+
+When automatic prefixing is used, this application of `PrefixContext` occurs inside the `tilde_assume!!` method.
+In the manual prefixing case, we need to make sure that `prefix(submodel::Model, ::Symbol)` does the same thing, i.e. it inserts a `PrefixContext` at the outermost layer of `submodel`'s context.
+We can see that this is precisely what happens:
+
+```@example
+@model f() = x ~ Normal()
+
+model = f()
+prefixed_model = prefix(model, :a)
+
+(model.context, prefixed_model.context)
+```
 
-## FixedContext
+## Loose ends 2: FixedContext
 
 Finally, note that all of the above also applies to the interaction between `PrefixContext` and `FixedContext`, except that the functions have different names.
 (`FixedContext` behaves the same way as `ConditionContext`, except that unlike conditioned variables, fixed variables do not contribute to the log probability density.)
+This generally results in a large amount of code duplication, but the concepts that underlie both contexts are exactly the same.
diff --git a/src/context_implementations.jl b/src/context_implementations.jl
@@ -85,17 +85,15 @@ function tilde_assume(rng::Random.AbstractRNG, ::LikelihoodContext, sampler, rig
 end
 
 function tilde_assume(context::PrefixContext, right, vn, vi)
-    # The slightly tricky thing about PrefixContext is that they are applied
-    # from the outside in, so `PrefixContext{:a}(PrefixContext{:b}(ctx))` means
-    # that variables get prefixed like `a.b.x`.
-    # This motivates the implementation shown here, where the function
-    # `prefix_and_strip_contexts` is responsible for not only adding the
-    # prefixes, but also removing the `PrefixContext`s from the context stack
-    # so that they don't get applied twice when recursing.
-    # TODO(penelopeysm): It would be nice to switch this round, but it's a very
-    # tricky task. Essentially it forces us to use a foldr inside
-    # `prefix_and_strip_contexts`, rather than a foldl which is what most of
-    # DynamicPPL uses.
+    # Note that we can't use something like this here:
+    #     new_vn = prefix(context, vn)
+    #     return tilde_assume(childcontext(context), right, new_vn, vi)
+    # This is because `prefix` applies _all_ prefixes in a given context to a
+    # variable name. Thus, if we had two levels of nested prefixes e.g.
+    # `PrefixContext{:a}(PrefixContext{:b}(DefaultContext()))`, then the
+    # first call would apply the prefix `a.b._`, and the recursive call
+    # would apply the prefix `b._`, resulting in `b.a.b._`.
+    # This is why we need a special function, `prefix_and_strip_contexts`.
     new_vn, new_context = prefix_and_strip_contexts(context, vn)
     return tilde_assume(new_context, right, new_vn, vi)
 end
diff --git a/src/contexts.jl b/src/contexts.jl
@@ -281,6 +281,19 @@ end
 
 Same as `prefix`, but additionally returns a new context stack that has all the
 PrefixContexts removed.
+
+NOTE: This does _not_ modify any variables in any `ConditionContext` and
+`FixedContext` that may be present in the context stack. This is because this
+function is only used in `tilde_assume`, which is lower in the tilde-pipeline
+than `contextual_isassumption` and `contextual_isfixed` (the functions which
+actually use the `ConditionContext` and `FixedContext` values). Thus, by this
+time, any `ConditionContext`s and `FixedContext`s present have already served
+their purpose.
+
+If you call this function, you must therefore be careful to ensure that you _do
+not_ need to modify any inner `ConditionContext`s and `FixedContext`s. If you
+_do_ need to modify them, then you may need to use
+`prefix_cond_and_fixed_variables` instead.
 """
 function prefix_and_strip_contexts(ctx::PrefixContext{Prefix}, vn::VarName) where {Prefix}
     child_context = childcontext(ctx)

Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,5 @@`
`1`	`1`	`[deps]`
	`2`	`+AbstractPPL = "7a57a42e-76ec-4ea3-a279-07e840d6d9cf"`
`2`	`3`	`Accessors = "7d9f7c33-5ae7-4f3b-8dc6-eff91059b697"`
`3`	`4`	`DataStructures = "864edb3b-99cc-5e75-8d2d-829cb0a9cfe8"`
`4`	`5`	`Distributions = "31c24e10-a181-5473-b8eb-7969acd0382f"`