Various fixes to the website (#137)

Balandat · facebook-github-bot · commit bb6b4bfea2d8 · 2019-04-30T17:19:15.000-07:00
Summary: Includes typos, broken links, styling. Pull Request resolved: #137 Reviewed By: danielrjiang Differential Revision: D15159210 Pulled By: Balandat fbshipit-source-id: b4d03f30a8c2b8f13b65f19e6ccb4815b15e55de
diff --git a/docs/botorch_and_ax.md b/docs/botorch_and_ax.md
@@ -3,14 +3,15 @@ id: botorch_and_ax
 title: Using BoTorch with Ax
 ---
 
-[Ax](https://github.com/facebook/Ax) is a platform for sequential
-experimentation. It relies on BoTorch for implementing Bayesian Optimization
-algorithms, but provides higher-level APIs that make it easy and convenient to
-specify problems, visualize results, and benchmark new algorithms.
+[Ax](https://ax.dev) is a platform for sequential experimentation. It relies on
+BoTorch for implementing Bayesian Optimization algorithms, but provides
+higher-level APIs that make it easy and convenient to specify problems,
+visualize results, and benchmark new algorithms.
 It also comes with powerful metadata management, storage of results, and
 deployment-related APIs. Ax makes it convenient to use BoTorch in most standard
 Bayesian Optimization settings.
-Simply put, BoTorch provides the building blocks for the engine, while Ax makes it easy to drive the car.
+Simply put, BoTorch provides the building blocks for the engine, while Ax makes
+it easy to drive the car.
 
 
 ![BoTorch and Ax](assets/botorch_and_ax.svg)
@@ -35,9 +36,9 @@ optimization researcher, such as keeping track of results, and transforming
 inputs and outputs to ranges that will ensure sensible handling in (G)PyTorch.
 The functionality provided by Ax should apply to most standard use cases.
 
-Even if you want something more custom, it may still be easier to use the Ax framework.
-For instance, say you want to experiment with using a different kind of
-surrogate model, or a new type of acquisition function, but leave the rest of
+Even if you want something more custom, it may still be easier to use the Ax
+framework. For instance, say you want to experiment with using a different kind
+of surrogate model, or a new type of acquisition function, but leave the rest of
 the the Bayesian Optimization loop untouched. It is then straightforward to plug
 your custom BoTorch model or acquisition function into Ax to take advantage of
 Ax's various loop control APIs, as well as its powerful automated metadata
@@ -58,13 +59,14 @@ Optimization loop in BoTorch. The
 this can be done.
 
 You may also consider working purely in BoTorch if you want to be able to
-understand and control every single aspect of your BayesOpt loop - Ax's simplicity
-necessarily means that certain powerful BoTorch features will not be fully exposed to the user.
+understand and control every single aspect of your BayesOpt loop - Ax's
+simplicity necessarily means that certain powerful BoTorch features will not be
+fully exposed to the user.
 
 
 ## Prototyping in BoTorch
 
 The modular design of BoTorch makes it very easy to prototype and debug
-individual components in an interactive fashion in a Jupyter notebook just like you might do with PyTorch.
-Once these building blocks have been designed and tested, they can easily
-be integrated into Ax.
+individual components in an interactive fashion in a Jupyter notebook just like
+you might do with PyTorch. Once these building blocks have been designed and
+tested, they can easily be integrated into Ax.
diff --git a/docs/design_philosophy.md b/docs/design_philosophy.md
@@ -25,8 +25,8 @@ BoTorch adheres to the following main design tenets:
 
 ## Parallelism Through Batched Computations
 
-Batching (as in batching data or batching computations) is a central component to
-all modern deep learning platforms and plays a critical role in the design of
+Batching (as in batching data or batching computations) is a central component
+to all modern deep learning platforms and plays a critical role in the design of
 BoTorch. Examples of batch computations in BoTorch include:
 
 1. A batch of candidate points $X$ to be evaluated in parallel on the black-box
@@ -35,10 +35,11 @@ BoTorch. Examples of batch computations in BoTorch include:
 2. A batch of q-batches to be evaluated in parallel on the surrogate model of
    the black-box function. These facilitate fast evaluation on modern hardware
    such as GPUs and multi-core CPUs with advanced instruction sets (e.g. AVX).
-   In BoTorch, we refer to a batch of this type as **"t-batch"** (as in "torch-batch").
-3. A **batched** surrogate **model**, each batch of which models a different output
-   (which is useful for multi-objective Bayesian Optimization). This kind of
-   batching also aims to exploit modern hardware architecture.
+   In BoTorch, we refer to a batch of this type as **"t-batch"** (as in
+   "torch-batch").
+3. A **batched** surrogate **model**, each batch of which models a different
+   output (which is useful for multi-objective Bayesian Optimization). This kind
+   of batching also aims to exploit modern hardware architecture.
 
 Note that none of these notions of batch pertains to the batching of *training
 data*, which is commonly done in training Neural Network models (sometimes
@@ -48,12 +49,15 @@ stochastic gradient descent using mini-batch training, BoTorch itself abstracts
 away from this.
 
 For an in-depth look at the different batch notions in BoTorch, take a look at
-the [Batching in BoTorch](#batching) section.
+the [Batching in BoTorch](batching) section.
 
 
 ## Optimizing Acquisition Functions
 
-While BoTorch tries to align as closely as possible with PyTorch when possible, optimization of acquisition functions requires a somewhat different approach. We now describe this discrepancy and explain in detail why we made this design decision.
+While BoTorch tries to align as closely as possible with PyTorch when possible,
+optimization of acquisition functions requires a somewhat different approach.
+We now describe this discrepancy and explain in detail why we made this design
+decision.
 
 In PyTorch, modules typically map (batches of) data to an output, where the
 mapping is parameterized by the parameters of the modules (often the weights
@@ -80,20 +84,20 @@ optimizing a model with these algorithms is by extracting the module's
 parameters (e.g. using `parameters()`), and writing a manual optimization loop
 that calls `step()` on a torch `Optimizer` object.
 
-Optimizing acquisition functions is different since the problem
-dimensionality is often much smaller. Indeed, optimizing over $q$ design points in a
+Optimizing acquisition functions is different since the problem dimensionality
+is often much smaller. Indeed, optimizing over $q$ design points in a
 $d$-dimensional feature space results in $qd$ scalar parameters to optimize
 over. Both $q$ and $d$ are often quite small, and hence so is the dimensionality
 of the problem.
 Moreover, the optimization problem can be cast as a deterministic one (either
 because an analytic acquisition function is used, or because the
 reparameterization trick is employed to render the Monte-Carlo-based evaluation
 of the acquisition function deterministic in terms of the input tensor $X$).
-As a result, optimization algorithms that are typically inadmissible for problems
-such as training Neural Networks become promising alternatives to standard
-first-order methods. In particular, this includes quasi-second order methods
-(such as L-BFGS or SLSQP) that approximate local curvature of the acquisition
-function by using past gradient information.
+As a result, optimization algorithms that are typically inadmissible for
+problems such as training Neural Networks become promising alternatives to
+standard first-order methods. In particular, this includes quasi-second order
+methods (such as L-BFGS or SLSQP) that approximate local curvature of the
+acquisition function by using past gradient information.
 These methods are currently not well supported in the `torch.optim` package,
 which is why BoTorch provides a custom interface that wraps the optimizers from
 the `scipy.optimize` module.
diff --git a/docs/getting_started.md b/docs/getting_started.md
@@ -1,6 +1,6 @@
 ---
 id: getting_started
-title: Getting started
+title: Getting Started
 ---
 
 This section shows you how to get your feet wet with BoTorch.
@@ -39,7 +39,7 @@ on GitHub.
 
 ## Basic Components
 
-Here's a quick run down of the main components of a Bayesian optimization loop.
+Here's a quick run down of the main components of a Bayesian Optimization loop.
 
 1. Fit a Gaussian Process model to data
     ```python
diff --git a/docs/introduction.md b/docs/introduction.md
@@ -3,10 +3,12 @@ id: introduction
 title: Introduction
 ---
 
-BoTorch (pronounced like "blow-torch") is a library for [Bayesian optimization](https://en.wikipedia.org/wiki/Bayesian_optimization)
-research built on top of [PyTorch](https://pytorch.org/), and is part of the PyTorch ecosystem.
+BoTorch (pronounced like "blow-torch") is a library for
+[Bayesian Optimization](https://en.wikipedia.org/wiki/Bayesian_optimization)
+research built on top of [PyTorch](https://pytorch.org/), and is part of the
+PyTorch ecosystem.
 
-Bayesian optimization (BayesOpt) is an established technique for sequential
+Bayesian Optimization (BayesOpt) is an established technique for sequential
 optimization of costly-to-evaluate black-box functions. It can be applied to a
 wide variety of problems, including hyperparameter optimization for machine
 learning algorithms, A/B testing, as well as many scientific and engineering
@@ -15,19 +17,21 @@ problems.
 BoTorch is best used in tandem with [Ax](https://ax.dev), Facebook's open-source
 adaptive experimentation platform, which provides an easy-to-use interface for
 defining, managing and running sequential experiments, while handling
-(meta-)data management, transformations, and systems integration. Users who just want an easy-to-use suite for Bayesian optimization [should start with Ax](https://ax.dev/docs/bayesopt).
+(meta-)data management, transformations, and systems integration. Users who just
+want an easy-to-use suite for Bayesian Optimization
+[should start with Ax](https://ax.dev/docs/bayesopt).
 
 
 ## Why BoTorch?
 
 ### Improved Developer Efficiency
 
 BoTorch provides a modular and easily extensible interface for composing
-Bayesian optimization primitives, including probabilistic models, acquisition
+Bayesian Optimization primitives, including probabilistic models, acquisition
 functions, and optimizers.
 
 It significantly improves developer efficiency by utilizing quasi-Monte-Carlo
-acquisition functions (by ways of the "re-parameterization trick"
+acquisition functions (by way of the "re-parameterization trick"
 [^AutoEncVarBayes], [^ReparamAcq]), which makes it straightforward to implement
 new ideas without having to impose restrictive assumptions about the underlying
 model. Specifically, it avoids pen and paper math to derive analytic expressions
@@ -39,19 +43,20 @@ rich multi-output models with multiple correlated outcomes.
 BoTorch follows the same modular design philosophy as PyTorch, which makes it
 very easy for users to swap out or rearrange individual components in order to
 customize all aspects of their algorithm, thereby empowering researchers to do
-state-of-the art research on modern Bayesian optimization methods.
+state-of-the art research on modern Bayesian Optimization methods.
 
 
 ### State-of-the-art Modeling
 
-Bayesian optimization traditionally relies heavily on Gaussian Process (GP)
+Bayesian Optimization traditionally relies heavily on Gaussian Process (GP)
 models, which provide well-calibrated uncertainty estimates. BoTorch provides
 first-class support for state-of-the art probabilistic models in
 [GPyTorch](https://gpytorch.ai), a library for efficient and scalable GPs
-implemented in PyTorch (and to which the BoTorch authors have significantly contributed).
+implemented in PyTorch (and to which the BoTorch authors have significantly
+contributed).
 This includes support for multi-task GPs, deep kernel learning, deep GPs, and
 approximate inference. This enables using GP models for problems that have
-traditionally not been amenable to Bayesian optimization techniques.
+traditionally not been amenable to Bayesian Optimization techniques.
 
 In addition, BoTorch's lightweight APIs are model-agnostic (they can for example
 work with [Pyro](http://pyro.ai) models), and support optimization of
@@ -76,7 +81,7 @@ optimization of acquisition functions operating on differentiable models.
 
 ### Bridging the Gap Between Research and Production
 
-BoTorch implements modular building blocks for modern Bayesian optimization.
+BoTorch implements modular building blocks for modern Bayesian Optimization.
 It bridges the gap between research and production by being a very flexible
 research framework, but at the same time, a reliable, production-grade
 implementation that integrates well with other higher-level platforms,
@@ -86,13 +91,13 @@ specifically [Ax](https://ax.dev).
 ## Target Audience
 
 The primary audience for hands-on use of BoTorch are researchers and
-sophisticated practitioners in Bayesian optimization and AI.
+sophisticated practitioners in Bayesian Optimization and AI.
 
 We recommend using BoTorch as a low-level API for implementing new algorithms
 for Ax. Ax has been designed to be an easy-to-use platform for end-users, which
-at the same time is flexible enough for Bayesian optimization researchers to
+at the same time is flexible enough for Bayesian Optimization researchers to
 plug into for handling of feature transformations, (meta-)data management,
-storage, etc. See [Using BoTorch with Ax](../botorch_and_ax) for more details.
+storage, etc. See [Using BoTorch with Ax](botorch_and_ax) for more details.
 
 We recommend that end-users who are not actively doing research on Bayesian
 Optimization simply use Ax.
diff --git a/website/pages/en/index.js b/website/pages/en/index.js
@@ -113,35 +113,34 @@ class Index extends React.Component {
     const pre = "```";
     // Example for model fitting
     const modelFitCodeExample = `${pre}python
->>> import torch
->>> from botorch.models import SingleTaskGP
->>> from botorch.fit import fit_gpytorch_model
->>> from gpytorch.mlls import ExactMarginalLogLikelihood
-
->>> train_X = torch.rand(10, 2)
->>> Y = 1 - torch.norm(train_X - 0.5, dim=-1) + 0.1 * torch.rand(10)
->>> train_Y = (Y - Y.mean()) / Y.std()
-
->>> gp = SingleTaskGP(train_X, train_Y)
->>> mll = ExactMarginalLogLikelihood(gp.likelihood, gp)
->>> fit_gpytorch_model(mll);
+import torch
+from botorch.models import SingleTaskGP
+from botorch.fit import fit_gpytorch_model
+from gpytorch.mlls import ExactMarginalLogLikelihood
+
+train_X = torch.rand(10, 2)
+Y = 1 - torch.norm(train_X - 0.5, dim=-1) + 0.1 * torch.rand(10)
+train_Y = (Y - Y.mean()) / Y.std()
+
+gp = SingleTaskGP(train_X, train_Y)
+mll = ExactMarginalLogLikelihood(gp.likelihood, gp)
+fit_gpytorch_model(mll)
     `;
     // Example for defining an acquisition function
     const constrAcqFuncExample = `${pre}python
->>> from botorch.acquisition import UpperConfidenceBound
+from botorch.acquisition import UpperConfidenceBound
 
->>> UCB = UpperConfidenceBound(gp, beta=0.1)
+UCB = UpperConfidenceBound(gp, beta=0.1)
     `;
     // Example for optimizing candidates
     const optAcqFuncExample = `${pre}python
->>> from botorch.optim import joint_optimize
-
->>> bounds = torch.stack([torch.zeros(2), torch.ones(2)])
->>> candidate = joint_optimize(
-        UCB, bounds=bounds, q=1, num_restarts=5, raw_samples=20,
-    )
->>> candidate
-tensor([0.4887, 0.5063])
+from botorch.optim import joint_optimize
+
+bounds = torch.stack([torch.zeros(2), torch.ones(2)])
+candidate = joint_optimize(
+    UCB, bounds=bounds, q=1, num_restarts=5, raw_samples=20,
+)
+candidate  # tensor([0.4887, 0.5063])
     `;
     //
     const QuickStart = () => (
@@ -152,19 +151,22 @@ tensor([0.4887, 0.5063])
         <Container>
               <ol>
                 <li>
-                  Install BoTorch:
+                  <h4>Install BoTorch:</h4>
+                  <a>via conda (recommended):</a>
+                  <MarkdownBlock>{bash`conda install botorch -c pytorch`}</MarkdownBlock>
+                  <a>via pip:</a>
                   <MarkdownBlock>{bash`pip install botorch`}</MarkdownBlock>
                 </li>
                 <li>
-                  Fit a model:
+                  <h4>Fit a model:</h4>
                   <MarkdownBlock>{modelFitCodeExample}</MarkdownBlock>
                 </li>
                 <li>
-                  Construct an acquisition function:
+                  <h4>Construct an acquisition function:</h4>
                   <MarkdownBlock>{constrAcqFuncExample}</MarkdownBlock>
                 </li>
                 <li>
-                  Optimize the acquisition function:
+                  <h4>Optimize the acquisition function:</h4>
                   <MarkdownBlock>{optAcqFuncExample}</MarkdownBlock>
                 </li>
               </ol>
@@ -192,7 +194,7 @@ tensor([0.4887, 0.5063])
             title: 'Built on PyTorch',
           },
           {
-            content: 'Support for scalable GPs. Run code on multiple devices.',
+            content: 'Support for scalable GPs via GPyTorch. Run code on multiple devices.',
             image: `${baseUrl}img/arrows_expanding_colored.svg`,
             imageAlign: 'top',
             title: 'Scalable',
diff --git a/website/static/css/custom.css b/website/static/css/custom.css
@@ -69,12 +69,16 @@ div.productShowcaseSection {
 
 .productShowcaseSection > h2 {
   font-variant: small-caps;
-  font-weight: 300;
+  font-weight: 360;
   margin: 0px;
   padding: 0px;
   color: #1C60F7;
 }
 
+.productShowcaseSection p {
+  font-weight: 360;
+}
+
 .productShowcaseSection div.container {
   padding: 40px 0px;
 }