Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 4 additions & 4 deletions docs-gb/deployments/abtests.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ In Seldon we provide two current examples on how to run Iter8 experiments.

The first option is to create an AB Test for the candidate model with an updated Seldon Deployment and run an Iter8 experiment to progressively rollout the candidate based on a set of metrics. The architecture is show below:

![seldonIter8Single](seldon-iter8-single.png)
![seldonIter8Single](../images/seldon-iter8-single.png)


We begin by updating our default model to start an AB test as shown below:
Expand Down Expand Up @@ -332,15 +332,15 @@ quickstart-exp A/B iris Completed 15 ExperimentCo

In the above examples a final stage promotion action is defined for the successful candidate to be updated to the new default Seldon deployment.

As a next step [run the notebook running through this example](../examples/iter8-single.html).
As a next step [run the notebook running through this example](../notebooks/iter8-single.md).

## Seldon/Iter8 Experiment over separate Seldon Deployments

We can also run experiments over separate Seldon Deployments. This though would require the creation in your service mesh of choice for a routing rule that Iter8 can modify to push traffic to each Seldon Deployment.

The architecture for this type of experiment is shown below:

![seldonIter8Separate](seldon-iter8-separate.png)
![seldonIter8Separate](../images/seldon-iter8-separate.png)

The difference here is we have two Seldon Deployments. A baseline:

Expand Down Expand Up @@ -498,4 +498,4 @@ spec:

The progression of the experiment is very similar with in this case the best model be promoted onto of the existing default baseline.

As a next step [run the notebook running through this example](../examples/iter8-separate.html).
As a next step [run the notebook running through this example](../notebooks/iter8-separate.md).
4 changes: 2 additions & 2 deletions docs-gb/deployments/metadata.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Model and Deployment Metadata

![metadata](./metadata.svg)
![metadata](../images//metadata.svg)


## Examples
Expand Down Expand Up @@ -122,7 +122,7 @@ Model metadata allow you to specify metadata for each of the components (nodes)
New orchestrator engine will probe all nodes for their metadata and derive global `inputs` and `outputs` of your graph.
It will then expose them together with all nodes' metadata at a single endpoint `/api/v1.0/metadata/` of your deployment.

![graph-metadata](./graph-metadata.svg)
![graph-metadata](../images/graph-metadata.svg)

Example response:
```json
Expand Down
Binary file added docs-gb/images/adversarial-attack.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/anchors.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/batch-processor.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/batch-workflow-managers.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/cicd-seldon.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/core-logo-large.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/core-logo-small.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/core-logo.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/core-play-logo.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/deploy-logo.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/e2e-model-serving.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions docs-gb/images/e2e-model-serving.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/inf-graph.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/jaeger-ui-grpc-example.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/jaeger-ui-rest-example.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/kibana-custom-search.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions docs-gb/images/language-wrappers-1.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions docs-gb/images/language-wrappers-2.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions docs-gb/images/metadata.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/metrics.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions docs-gb/images/model-servers.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions docs-gb/images/operator.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions docs-gb/images/orchestrator.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/prometheus-graph.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs-gb/images/prometheus-targets.png
Binary file added docs-gb/images/rest-openapi.jpg
Binary file added docs-gb/images/seldon-core-high-level.jpg
Binary file added docs-gb/images/seldon-iter8-separate.png
Binary file added docs-gb/images/seldon-iter8-single.png
1 change: 1 addition & 0 deletions docs-gb/images/seldondeployment-crd.svg

Large diffs are not rendered by default.

Binary file added docs-gb/images/stream-processing-knative.jpg
Binary file added docs-gb/images/svcOrch1.png
1 change: 1 addition & 0 deletions docs-gb/images/tracing.svg

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions docs-gb/images/why_not_flask.svg

Large diffs are not rendered by default.

5 changes: 3 additions & 2 deletions docs-gb/install/advanced-helm-chart-configuration.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,8 @@

This page provides a detailed overview of the installation parameters available for the Seldon Core installation when using Helm 3.x. The high level workflows to install Seldon Core can be found in the [Installation Page](../workflow/install.md).

You can find the `values.yaml` file of the [seldon-core-operator Helm chart](https://github.com/SeldonIO/seldon-core/tree/master/helm-charts/seldon-core-operator), which contains basically all the values that you can configure in your installation by using the `set` flag in the format `--set value.path=YOUR_VALUE`.
You can find the `values.yaml` file of the [seldon-core-operator Helm chart](https://github.com/SeldonIO/seldon-core/helm-charts/seldon-core-operator), which contains basically all the values that you can configure in your installation by using the `set` flag in the format `--set value.path=YOUR_VALUE`.

The file has been written to be self documented, and has information on all the core parameters. Further information is referenced in the file to specific documentation pages.
{% @github-files/github-code-block url="https://github.com/SeldonIO/seldon-core/tree/master/helm-charts/seldon-core-operator/values.yaml" %}

{% @github-files/github-code-block url="https://github.com/SeldonIO/seldon-core/blob/master/helm-charts/seldon-core-operator/values.yaml" %}
20 changes: 10 additions & 10 deletions docs-gb/overview.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@ Keep reading to learn more!

With `Seldon Core` you can take your model and put it directly into the production using our flexible `Model Servers`.

![](../images/e2e-model-serving.svg)
![](../docs-gb/images/e2e-model-serving.svg)

Using the so-called `Reusable Model Servers` you can deploy your models into Kubernetes cluster in just a few steps:

Expand All @@ -48,7 +48,7 @@ Each of these are useful depending on the context and the actual use case.
Does not require the central repository but requires a build of a new image for every model.


![](../images/model-servers.svg)
![](../docs-gb//images/model-servers.svg)

Read more about our pre-packaged `Model Servers` on their dedicated documentation pages:
- [MLflow Server](../servers/mlflow.html)
Expand All @@ -64,7 +64,7 @@ Language wrappers allows Seldon Core users to build `Reusable` and `Non-Reusable
As you will see, the whole process is very simple and requires user to only define logic that
loads models and perform inference prediction as well as the required runtime dependencies.

![](../images/language-wrappers-1.svg)
![](../docs-gb/images/language-wrappers-1.svg)


Model loading and inference logic is defined in `Model.py` file:
Expand Down Expand Up @@ -103,7 +103,7 @@ $ curl http://localhost:9000/api/v1.0/predictions \
}
```

![](../images/language-wrappers-2.svg)
![](../docs-gb/images/language-wrappers-2.svg)

To complete containerisation process you need two more components:
- `requirements.txt` file that describes your runtime dependencies
Expand All @@ -130,7 +130,7 @@ In Seldon Core we use CRDs to define the inference graph through the manifest ya
The manifest file that you write is very powerful yet simple.
You can easily define what models do you want in your deployment and how they are connected in the inference graph.

![](../images/seldondeployment-crd.svg)
![](../docs-gb//images/seldondeployment-crd.svg)

You can think about the CRD as an abstraction around the actual deployment and services that are created in the cluster.
Once the manifest is applied to the cluster, Seldon Core `Operator` creates all Kubernetes objects required to serve the inference requests.
Expand All @@ -149,7 +149,7 @@ It works according to the common Kubernetes operator pattern - in a continues lo
- `diff` against desired state
- if necessary `act` to apply desired state

![](../images/operator.svg)
![](../docs-gb/images/operator.svg)


## Service Orchestrator
Expand All @@ -159,7 +159,7 @@ It reads the inference graph structure from the `CRD` and when inference request

It is because of the presence of `Service Orchestrator` that complex graph components like `routers`, `combiners` and output/input `transformers` are available in the `Seldon` world.

![](../images/orchestrator.svg)
![](../docs-gb/images/orchestrator.svg)

`Service Orchestrator` is also responsible for providing many advance features out of the box:
- `Jaeger` tracing
Expand All @@ -179,7 +179,7 @@ You can easily version your model and describe its expected inputs and outputs.
These allow you to make connection to the platform you trained your model with (DVC, Pachyderm, ...)
and know what inputs / outputs you can expect from your inference graph.

![](../images/metadata.svg)
![](../docs-gb/images/metadata.svg)

Read more about [metadata provenance on its dedicated documentation page](../reference/apis/metadata.html).

Expand All @@ -189,7 +189,7 @@ Read more about [metadata provenance on its dedicated documentation page](../ref
Metrics is important aspect of serving ML inference models in production.
Out of the box Seldon Core deployments expose standard metrics to [Prometheus](https://prometheus.io/) on the `Service Orchestrator`.

![](../images/metrics.png)
![](../docs-gb/images/metrics.png)

Read more about [metrics on its dedicated documentation page](../analytics/analytics.html).

Expand All @@ -198,7 +198,7 @@ Read more about [metrics on its dedicated documentation page](../analytics/analy

By default, we support [Jaeger](https://www.jaegertracing.io/) for Distributed Tracing.

![](../images/tracing.svg)
![](../docs-gb/images/tracing.svg)

Read more about [tracing on its dedicated documentation page](../graph/distributed-tracing.html).

Expand Down
2 changes: 1 addition & 1 deletion docs-gb/routing/inference-graph.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ Seldon Core extends Kubernetes with its own custom resource SeldonDeployment whe

A SeldonDeployment is a JSON or YAML file that allows you to define your graph of component images and the resources each of those images will need to run (using a Kubernetes PodTemplateSpec). The parts of a SeldonDeployment are shown below:

![inference-graph](./inf-graph.png)
![inference-graph](../images/inf-graph.png)

A minimal example for a single model, this time in YAML, is shown below:
```yaml
Expand Down
5 changes: 3 additions & 2 deletions docs-gb/servers/huggingface.md
Original file line number Diff line number Diff line change
Expand Up @@ -94,5 +94,6 @@ spec:
replicas: 1
```

.. note::
As a next step, why not try running a larger-scale model? You can find one in gs://seldon-models/v1.18.0/huggingface/text-gen-custom-gpt2. However, you may need to request more memory!
{% hint style="note" %}
As a next step, why not try running a larger-scale model? You can find one in gs://seldon-models/v1.18.0/huggingface/text-gen-custom-gpt2. However, you may need to request more memory!
{% endhint %}
Loading