edit README

paul-buerkner · paul-buerkner · commit 55c0d59f139e · 2022-11-22T16:16:32.000+01:00
diff --git a/README.md b/README.md
@@ -1,106 +1,165 @@
 # BayesFlow
 
-Welcome to the beta-version of our BayesFlow library for simulation-based Bayesian workflows.
+Welcome to our BayesFlow library for amortized simulation-based Bayesian inference.
+
+For starters, check out some or our walk-through notebooks:
 
-For starters, check out the walk-through notebooks:
 1. [Basic amortized posterior estimation](docs/source/tutorial_notebooks/Intro_Amortized_Posterior_Estimation.ipynb) 
 2. [Intermediate posterior estimation](docs/source/tutorial_notebooks/Covid19_Initial_Posterior_Estimation.ipynb) 
 3. [Posterior estimation for ODEs](docs/source/tutorial_notebooks/Linear%20ODE%20system.ipynb)
-4. Coming soon...
 
 ## Project Documentation
+
 The project documentation is available at <http://bayesflow.readthedocs.io>
 
 ## Conceptual Overview
 
-A cornerstone idea of amortized Bayesian inference is to employ generative neural networks for parameter estimation, model comparison, and model validation
-when working with intractable simulators whose behavior as a whole is too complex to be described analytically. The figure below presents a higher-level overview of neurally bootstrapped Bayesian inference. 
+A cornerstone idea of amortized Bayesian inference is to employ generative
+neural networks for parameter estimation, model comparison, and model validation
+when working with intractable simulators whose behavior as a whole is too
+complex to be described analytically. The figure below presents a higher-level
+overview of neurally bootstrapped Bayesian inference.
 
-<img src="https://github.com/stefanradev93/BayesFlow/blob/Future/img/high_level_framework.png" width=80% height=80%>
+<img src="img/high_level_framework.png" width=80% height=80%>
 
 ## Parameter Estimation
 
-The BayesFlow approach for amortized parameter estimation is based on our paper:
+The original BayesFlow approach for amortized parameter estimation is based on our paper:
 
-Radev, S. T., Mertens, U. K., Voss, A., Ardizzone, L., & Köthe, U. (2020). BayesFlow: Learning complex stochastic models with invertible neural networks. <em>IEEE Transactions on Neural Networks and Learning Systems</em>, available for free at: https://arxiv.org/abs/2003.06281. 
+Radev, S. T., Mertens, U. K., Voss, A., Ardizzone, L., & Köthe, U. (2020).
+BayesFlow: Learning complex stochastic models with invertible neural networks.
+<em>IEEE Transactions on Neural Networks and Learning Systems</em>, available
+for free at: https://arxiv.org/abs/2003.06281.
+
+However, since then, we have substantially extended the BayesFlow library such that
+it is now much more general and cleaner than what we describe in the above paper.
 
 ### Minimal Example
 
 ```python
 import numpy as np
 import bayesflow as bf
+```
 
-# First, we define a simple 2D toy model with a Gaussian prior and a Gaussian simulator (likelihood):
-def prior(D=2, mu=0., sigma=1.0):
-    return np.random.default_rng().normal(loc=mu, scale=sigma, size=D)
+To introduce you to the basic workflow of the library, let's consider
+a simple 2D Gaussian model, from which we want to obtain
+posterior inference.  We assume a Gaussian simulator (likelihood)
+and a Gaussian prior for the means of the two components,
+which are our only model parameters in this example:
 
+```python
 def simulator(theta, n_obs=50, scale=1.0):
     return np.random.default_rng().normal(loc=theta, scale=scale, size=(n_obs, theta.shape[0]))
+    
+def prior(D=2, mu=0., sigma=1.0):
+    return np.random.default_rng().normal(loc=mu, scale=sigma, size=D)
+```
+
+Then, we connect the `prior` with the `simulator` using a `GenerativeModel` wrapper:
+
+```python
+generative_model = bf.simulation.GenerativeModel(prior, simulator)
+```
 
-# Then, we create our BayesFlow setup consisting of a summary and an inference network:
+Next, we create our BayesFlow setup consisting of a summary and an inference network:
+
+```python
 summary_net = bf.networks.InvariantNetwork()
 inference_net = bf.networks.InvertibleNetwork(num_params=2)
 amortizer = bf.amortizers.AmortizedPosterior(inference_net, summary_net)
+```
 
-# Next, we connect the `prior` with the `simulator` using a `GenerativeModel` wrapper:
-generative_model = bf.simulation.GenerativeModel(prior, simulator)
+Finally, we connect the networks with the generative model via a `Trainer` instance:
 
-# Finally, we connect the networks with the generative model via a `Trainer` instance:
+```python
 trainer = bf.trainers.Trainer(amortizer=amortizer, generative_model=generative_model)
+```
 
-# We are now ready to train an amortized posterior approximator. For instance, to run online training, we simply call:
+We are now ready to train an amortized posterior approximator. For instance, 
+to run online training, we simply call:
+
+```python
 losses = trainer.train_online(epochs=10, iterations_per_epoch=500, batch_size=32)
 ```
 
-Before inference, we can use simulation-based calibration (SBC, https://arxiv.org/abs/1804.06788) to check the computational faithfulness of the model-amortizer combination:
+Before inference, we can use simulation-based calibration (SBC, 
+https://arxiv.org/abs/1804.06788) to check the computational faithfulness of 
+the model-amortizer combination:
 
 ```python
 fig = trainer.diagnose_sbc_histograms()
 ```
 
-<img src="https://github.com/stefanradev93/BayesFlow/blob/Future/img/showcase_sbc.png" width=65% height=65%>
+<img src="img/showcase_sbc.png" width=65% height=65%>
 
-Amortized inference on new (real or simulated) data is then easy and fast:
+The histograms are roughly uniform and lie within the expected range for
+well-calibrated inference algorithms as indicated by the shaded gray areas.
+Accordingly, our amortizer seems to have converged to the intended target.
+
+Amortized inference on new (real or simulated) data is then easy and fast.
+For example, we can simulate 200 new data sets and generate 500 posterior draws 
+per data set:
 
 ```python
-# Simulate 200 new data sets and generate 500 posterior draws per data set
 new_sims = trainer.configurator(generative_model(200))
 posterior_draws = amortizer.sample(new_sims, n_samples=500)
 ```
 
-We can then quickly inspect the parameter recoverability of the model:
+We can then quickly inspect the how well the model can recover its parameters
+across the simulated data sets.
 
 ```python
 fig = bf.diagnostics.plot_recovery(posterior_draws, new_sims['parameters'])
 ```
 
-<img src="https://github.com/stefanradev93/BayesFlow/blob/Future/img/showcase_recovery.png" width=65% height=65%>
+<img src="img/showcase_recovery.png" width=65% height=65%>
 
-Or we can look at single posteriors in relation to the prior:
+For any individual data set, we can also compare the parameters' posteriors with 
+their corresponding priors:
 
 ```python
 fig = bf.diagnostics.plot_posterior_2d(posterior_draws[0], prior=generative_model.prior)
 ```
 
-<img src="https://github.com/stefanradev93/BayesFlow/blob/Future/img/showcase_posterior.png" width=45% height=45%>
+<img src="img/showcase_posterior.png" width=45% height=45%>
 
-### Further Reading
+We see clearly how the posterior shrinks relative to the prior for both
+model parameters as a result of conditioning on the data.
 
-Coming soon...
+### References and Further Reading
 
-## Model Misspecification
+- Radev, S. T., Mertens, U. K., Voss, A., Ardizzone, L., & Köthe, U. (2020).
+BayesFlow: Learning complex stochastic models with invertible neural networks.
+<em>IEEE Transactions on Neural Networks and Learning Systems</em>, available
+for free at: https://arxiv.org/abs/2003.06281.
 
-What if we are dealing with misspecified models? That is, how faithful is our amortized inference if the generative model is a poor representation of reality? A modified loss function optimizes the learned summary statistics towards a unit Gaussian and reliably detects model misspecification during inference time.
 
-![Model Misspecification](https://github.com/stefanradev93/BayesFlow/blob/Future/docs/source/images/model_misspecification_amortized_sbi.png?raw=true)
+## Model Misspecification
 
+What if we are dealing with misspecified models? That is, how faithful is our
+amortized inference if the generative model is a poor representation of reality?
+A modified loss function optimizes the learned summary statistics towards a unit
+Gaussian and reliably detects model misspecification during inference time.
 
+![](docs/source/images/model_misspecification_amortized_sbi.png?raw=true)
 
+### References and Further Reading
+
+- Schmitt, M., Bürkner P. C., Köthe U., & Radev S. T. (2022). Detecting Model
+Misspecification in Amortized Bayesian Inference with Neural Networks. <em>ArXiv
+preprint</em>.
 
 ## Model Comparison
 
 Coming soon...
 
+### References and Further Reading
+
+- Radev S. T., D’Alessandro M., Mertens U. K., Voss A., Köthe U., & Bürkner P.
+C. (2021). Amortized Bayesian Model Comparison with Evidental Deep Learning.
+<em>IEEE Transactions on Neural Networks and Learning Systems</em>.
+doi:10.1109/TNNLS.2021.3124052 available for free at: https://arxiv.org/abs/2004.10629
+
 ## Likelihood emulation
 
 Coming soon...