Improve memory consumption of the log sampler

Hi!

The [zapcore/sampler.go](https://github.com/uber-go/zap/blob/07077a697f639389cc998ff91b8885feb25f520d/zapcore/sampler.go) allocates ~0.5MB per instance. 

That's quite a bit considering that some binaries instantiate quite a few. For example, [OpenTelemetry collectors monitoring Kubernetes cluters](https://opentelemetry.io/docs/platforms/kubernetes/collector/) can instantiate tens of thousands of those.

Here I suggest an update that divides the memory consumption by 7 and improves performance, with very little drawback.

### Details

The sampler allocates `numberOfLogLevels * 4096` instances of the `counter` structure (128 bits). There are [7 different log levels](https://github.com/uber-go/zap/blob/master/zapcore/level.go#L34-L55). Once an log message is emitted, it's hashed to one of those 7*4096 buckets.

In practice, all 7 log levels are not used uniformly (far from it), and some (e.g. DPanic, Panic, Fatal) re barely used at all.

I thus propose to only instantiate 4096 instances of the `counter` structure (dividing the memory footprint in 7) and include the log level as an input to the hash function. Since the 7 log levels are use very unevenly, this only marginally impact the risks of collision while improving cache locality.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve memory consumption of the log sampler #1516

Details

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improve memory consumption of the log sampler #1516

Description

Details

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions