Merge pull request #547 from aws-otel/rapphil-add-processors-v0.29.0

rapphil · web-flow · commit 69b1388a98d9 · 2023-05-15T12:01:41.000-07:00
Prepare release notes for ADOT Collector v0.29.0
diff --git a/src/config/sideBarData.js b/src/config/sideBarData.js
@@ -108,6 +108,7 @@ export const sideBarData = [
         {label: "Kafka Receiver/Expoter", link: "/docs/components/kafka-receiver-exporter"},
         {label: "Logging Exporter", link: "/docs/components/misc-exporters#logging-exporter"},
         {label: "OTLP Exporters", link: "/docs/components/otlp-exporter"},
+        {label: "Processors", link: "/docs/components/processors"},
         {label: "Prometheus Exporters", link: "/docs/components/prometheus-exporters"},
         {label: "StatsD Receiver", link: "/docs/components/statsd-receiver"},
         {label: "X-Ray Exporter", link: "/docs/getting-started/x-ray#configuring-the-aws-x-ray-exporter"},
diff --git a/src/content/BlogPosts/blogPosts.yaml b/src/content/BlogPosts/blogPosts.yaml
@@ -6,6 +6,13 @@ description:
 path: /blog
 
 blogs:
+  - title: "AWS Distro for OpenTelemetry v0.29.0 is now available"
+    author: "Raphael Silva"
+    date: "15-May-2023"
+    body:
+      "AWS Distro for OpenTelemetry v0.29.0 is now available. You can download the latest AWS Distro for OpenTelemetry Collector image
+        from the Amazon Elastic Container Registry (Amazon ECR) Public Gallery."
+    link: "/docs/ReleaseBlogs/aws-distro-for-opentelemetry-collector-v0.29.0"
   - title: "AWS Distro for OpenTelemetry Lambda Layers are now available with ADOT Collector v0.28.0"
     author: "Pavan Sai Vasireddy"
     date: "25-April-2023"
diff --git a/src/content/Blogs/ReleaseBlogs/aws-distro-for-opentelemetry-collector-v0.29.0.mdx b/src/content/Blogs/ReleaseBlogs/aws-distro-for-opentelemetry-collector-v0.29.0.mdx
@@ -0,0 +1,60 @@
+---
+title: 'AWS Distro for OpenTelemetry v0.29.0'
+description:
+    This blog post is the release announcement for ADOT Collector v0.29.0
+---
+
+import SectionSeparator from "components/MdxSectionSeparator/sectionSeparator.jsx"
+
+<SectionSeparator />
+
+[AWS Distro for OpenTelemetry (ADOT)](https://aws-otel.github.io/) Collector v0.29.0 is now available.
+You can download the latest [ADOT Collector image](https://gallery.ecr.aws/aws-observability/aws-otel-collector) from the
+[Amazon Elastic Container Registry (Amazon ECR)](https://aws.amazon.com/ecr/) Public Gallery.
+
+<SectionSeparator />
+
+**Upstream changelog**
+
+* [OpenTelemetry Collector v0.75.0](https://github.com/open-telemetry/opentelemetry-collector/releases/tag/v0.75.0)
+* [OpenTelemetry Collector v0.76.1](https://github.com/open-telemetry/opentelemetry-collector/releases/tag/v0.76.1)
+* [OpenTelemetry Collector Contrib v0.75.0](https://github.com/open-telemetry/opentelemetry-collector-contrib/releases/tag/v0.75.0)
+* [OpenTelemetry Collector Contrib v0.76.3](https://github.com/open-telemetry/opentelemetry-collector-contrib/releases/tag/v0.76.3)
+
+**Release Highlights**
+
+* Update notices for prometheus receiver [\#2053](https://github.com/aws-observability/aws-otel-collector/pull/2053) ([bryan-aguilar](https://github.com/bryan-aguilar))
+* Add Group by Trace and Tail Sampling processors [\#2052](https://github.com/aws-observability/aws-otel-collector/pull/2052) ([rapphil](https://github.com/rapphil))
+* Update to use public ecr rather than dockerhub in vended templates. [\#2045](https://github.com/aws-observability/aws-otel-collector/pull/2045) ([bryan-aguilar](https://github.com/bryan-aguilar))
+* Disable pkg.translator.prometheus.NormalizeName feature gate by default [\#2044](https://github.com/aws-observability/aws-otel-collector/pull/2044) ([bryan-aguilar](https://github.com/bryan-aguilar))
+* Deprecate lambdacomponents module [\#1981](https://github.com/aws-observability/aws-otel-collector/pull/1981) ([bryan-aguilar](https://github.com/bryan-aguilar))
+
+**IMPORTANT:**
+
+* [There are upstream breaking changes in prometheus related components](https://github.com/open-telemetry/opentelemetry-collector-contrib/releases/tag/v0.76.3) that affects metric names. The ADOT collector will adopt the upstream behaviour starting v0.31.0. For more details and testing instructions please refer to issue [#2043](https://github.com/aws-observability/aws-otel-collector/issues/2043).
+* The `aws.ecs.service.name` property is being set to `ServiceName` metadata in the case a collector with `awsecscontainermetricsreceiver` is running in ECS on EC2. For ECS on Fargate, `aws.ecs.service.name` is an empty string. Previously this value was always set to `"undefined"` for both EC2 and Fargate compute types.  [\#19744](https://github.com/open-telemetry/opentelemetry-collector-contrib/pull/19744) ([erichsueh3](https://github.com/erichsueh3))
+
+
+Detailed release notes are on [GitHub](https://github.com/aws-observability/aws-otel-collector/releases).
+All code changes are upstream in the respective OpenTelemetry project components.
+
+**Download**
+
+Detailed technical documentation is available on the [ADOT developer site](https://aws-otel.github.io/),
+and you can [download the distribution](https://aws-otel.github.io/download) from
+[GitHub](https://github.com/aws-observability/aws-otel-collector/releases/tag/v0.29.0).
+You can also download the latest [ADOT Collector image](https://gallery.ecr.aws/aws-observability/aws-otel-collector)
+from the [Amazon Elastic Container Registry (Amazon ECR)](https://aws.amazon.com/ecr/) Public Gallery.
+
+To learn more about how to use AWS Distro for OpenTelemetry (ADOT) to collect data for your observability solution,
+check out the hands-on [AWS Observability workshop](https://observability.workshop.aws/en/adot.html).
+Please file an [issue](https://github.com/aws-observability/aws-otel-community/issues) if you have any
+questions about the distribution, features, or its components.
+
+We also welcome you to participate in the [OpenTelemetry project](https://github.com/open-telemetry).
+The project was [approved for incubation](https://www.cncf.io/blog/2021/08/26/opentelemetry-becomes-a-cncf-incubating-project/) status
+in August 2021 by the Cloud Native Computing Foundation Technical Oversight Committee (CNCF TOC). Learn more about
+[AWS Distro for OpenTelemetry](https://aws.amazon.com/blogs/opensource/category/management-tools/aws-distro-for-opentelemetry/) on the
+[AWS Open Source Blog](https://aws.amazon.com/blogs/opensource/category/management-tools/aws-distro-for-opentelemetry/), where we announced
+the distribution’s [general availability for tracing](https://aws.amazon.com/blogs/opensource/aws-distro-for-opentelemetry-is-now-ga-for-tracing/) in September 2021
+and the distribution's [general availability for metrics](https://aws.amazon.com/blogs/opensource/aws-distro-for-opentelemetry-is-now-generally-available-for-metrics/) in May 2022.
diff --git a/src/content/Downloads/downloads.yaml b/src/content/Downloads/downloads.yaml
@@ -1,3 +1,9 @@
+- version: 'AWS Distro for OpenTelemetry Collector Version 0.29.0'
+  releaseDate: 'May-15-2023'
+  license: 'Apache-2.0'
+  releaseNotesLink: 'https://github.com/aws-observability/aws-otel-collector/releases/tag/v0.29.0'
+  documentationLink: 'https://github.com/aws-observability/aws-otel-collector/blob/v0.29.0/README.md'
+  downloadLink: 'https://gallery.ecr.aws/aws-observability/aws-otel-collector'
 - version: 'AWS Distro for OpenTelemetry Collector Version 0.28.0'
   releaseDate: 'April-07-2023'
   license: 'Apache-2.0'
diff --git a/src/docs/components/processors.mdx b/src/docs/components/processors.mdx
@@ -0,0 +1,118 @@
+---
+title: 'Processors'
+description: |
+    Processors pre-process the data collected by the receivers before they are exported by exporters. Processors can modify, batch or
+    filter the data flowing through the pipeline.
+path: '/docs/components/processors'
+---
+
+import SectionSeparator from "components/MdxSectionSeparator/sectionSeparator.jsx"
+
+Processors are used in several stages of an OpenTelemetry collector pipeline. They are used to pre-process the data being passed in the pipeline. In a processor the data can be modified, batched, filtered or sampled. The
+ADOT collector supports a selected list of processors.
+
+<SectionSeparator />
+
+## Processors supported by ADOT collector
+
+The ADOT collector supports the following processors:
+
+* [Attributes processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/attributesprocessor#attributes-processor)
+* [Batch processor](https://github.com/open-telemetry/opentelemetry-collector/tree/main/processor/batchprocessor#batch-processor)
+* [Delta to Rate processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/deltatorateprocessor#delta-to-rate-processor)
+* [Filter processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/filterprocessor#filter-processor)
+* [Group by Trace processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/main/processor/groupbytraceprocessor/README.md)
+* [Memory Limiter processor](https://github.com/open-telemetry/opentelemetry-collector/tree/main/processor/memorylimiterprocessor#memory-limiter-processor)
+* [Metrics Generation processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/metricsgenerationprocessor#metrics-generation-processor)
+* [Metrics Transform processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/metricstransformprocessor#metrics-transform-processor)
+* [Probabilistic Sampling processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/probabilisticsamplerprocessor#probabilistic-sampling-processor)
+* [Resource Detection processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/resourcedetectionprocessor#resource-detection-processor)
+* [Resource processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/resourceprocessor#resource-processor)
+* [Span processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/spanprocessor#span-processor)
+* [Tail Sampling processor](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/tailsamplingprocessor/README.md)
+
+## Notes on Group by Trace and Tail Sampling processors
+
+In order to achieve the desired results when using the Tail Sampling and Group by Trace processors, **do not use a Batch processor before these components in a pipeline**. Using a Batch processor before these components might separate spans belonging to a same trace. It is important to pay attention to this detail because these components will try to group all the spans belonging to a trace. In the case of the Tail Sampling processor this will allow for a sampling decision to affect all spans of a trace, creating a full picture of the trace in case it is sampled. A Batch processor immediately after these components does not cause any problems and is recommended to properly pre-process data for subsequent exporters.
+
+Also, you need to make sure that all the spans for a trace are processed in the same collector instances. This is specially important for a collector running in gateway mode.
+
+Besides that, you have to tune the `wait_duration` parameter of the Group by Trace processor and `decision_wait` parameter of the Tail Sampling processor to be greater than or equal to the maximum expected latency of a trace in your system. Also, be sure to include a grace period for network latency between an application and collector. Again, this will guarantee that spans of a same trace are processed in the same batch.
+
+Finally to really limit the number of traces that should be kept in memory, we recommend that you use the Group by Trace processor before the Tail Sampling processor. The reason why is because the Group by Trace processor implements a limit for the number of traces to be kept in memory while this is not fully implemented in the Tail Sampling processor.
+
+The Group by Trace processor will drop the oldest trace in case the `num_traces` limit is exceeded. `wait_duration` and `num_traces` should be scaled to consider the expected traffic in the monitored applications.
+
+### Examples
+
+If the maximum expected latency for a request in your application is 10s and the maximum traffic in number of requests per second that your application can have is 1000 requests per second, `wait_duration` should be set to 10s and `num_traces` should be set to at least 10000 (10 * 1000 requests per second). It is highly recommended that you monitor the `otelcol_processor_groupbytrace_traces_evicted` metric from the collector [self telemetry](https://opentelemetry.io/docs/collector/configuration/#service). If the value in the metric is greater than zero, that means that the collector is receiving more traffic than it can handle and you should increase the `num_traces` accordingly.
+
+
+Example from the description above:
+```yaml
+processors:
+  groupbytrace:
+    wait_duration: 10s
+    num_traces: 20000 # Double the max expected traffic (2 * 10 * 1000 requests per second)
+  tail_sampling:
+    decision_wait: 1s # This value should be smaller than wait_duration
+    policies:
+      - ..... # Applicable policies
+  batch/tracesampling:
+    timeout: 0s # No need to wait more since this will happen in previous processors
+    send_batch_max_size: 8196 # This will still allow us to limit the size of the batches sent to subsequent exporters
+
+service:
+  pipelines:
+    traces/tailsampling:
+      receivers: [otlp]
+      processors: [groupbytrace, tail_sampling, batch/tracesampling]
+      exporters: [awsxray]
+
+```
+
+The Tail Sampling processor has the functionality to combine sampling policies. For example, to sample traces from a specific path in case of errors you could use the following configuration:
+
+```yaml
+processors:
+  tail_sampling:
+    decision_wait: 1s
+    policies:
+      - name: and-policy
+        type: and
+        and:
+          and_sub_policy:
+            - name: path-policy
+              type: string_attribute
+              string_attribute:
+                key: http.url
+                values: ["\/users"]
+                enabled_regex_matching: true
+            - name: error-policy
+              type: status_code
+              status_code:
+                status_codes: ["ERROR", "UNSET"]
+```
+
+In the next example we will sample 20% of the spans that present an error:
+
+```yaml
+processors:
+  tail_sampling:
+    decision_wait: 1s
+    policies:
+      - name: and-policy
+        type: and
+        and:
+          and_sub_policy:
+            - name: error-policy
+              type: status_code
+              status_code:
+                status_codes: ["ERROR", "UNSET"]
+            - name: probabilistic-policy
+              type: probabilistic
+              probabilistic:
+                sampling_percentage: 20
+```
+
+To see the full set of policy options available to the tail sampling processor please refer to it's [README](https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/processor/tailsamplingprocessor/README.md).