Prometheus configuration and optimitzation for Netapp Harvest 2.0 #2563
Replies: 1 comment 6 replies
-
|
For long-term storage, consider integrating with Thanos, Cortex, or VictoriaMetrics. These systems are designed for durable, scalable, long-term storage of Prometheus metrics. Prometheus Sizing reference is available here VictoriaMetrics provides long-term storage capabilities and is designed to handle large amounts of data efficiently. It also supports downsampling, which reduces the amount of data stored over time, and it's compatible with Prometheus querying API, which means you can use Grafana or any other tool that works with Prometheus to visualize the data. Thanos and Cortex, on the other hand, are not just long-term storage solutions. They provide a horizontally scalable, highly available, multi-tenant, long-term storage for Prometheus. Thanos extends Prometheus for long-term storage while preserving Prometheus's query language. It's a good choice if you have multiple Prometheus instances and want to query them as if they were a single global instance. Thanos also supports downsampling and replication, which can improve query performance and reliability. Cortex provides horizontally scalable, multi-tenant, long-term storage for Prometheus. It's designed to support very large metric volumes across multiple tenants. Cortex allows for scaling the metric ingestion rate and query load by adding more nodes to the cluster. It's a good choice if you need to support a large number of users or teams, each with their own Prometheus-style API. NABox is a virtual appliance which facilitates the deployment of Harvest. The choice of deployment method ultimately depends on your personal preference. If you're more comfortable with Kubernetes, you can certainly opt for that. Some of our customers have found it necessary to expand their disk space in order to accommodate two years' worth of historical data in NABox. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I'm excited about Netapp Harvest 2.0 and with my team are planning to put it into production with the goal of renewing the version of Harvest 1.0 that we still have based on graphite and deployed on a server.
We want to take the opportunity to decommission the server and be able to run Netapp Harvest 2.0 on Kubernetes.
One of our concerns is the Prometheus deployment, since we don't have a lot of knowledge in this product and I'm worried about how we should size and configure it correctly so we don't have problems with memory.
Deploying a pooler for one of our clusters, I have seen that it has generated an exporter of 12MB, about 70000 series. Not bad at all! Extrapolating to all our clusters we would have to multiply this by 4.
Doing calculations of how much storage I need for my prometheus, I have thought that I need to do data aggregations in time to be able to maintain at least one year.
Something like this:
If I do the above, I might need about 200GB of local disk (otherwise, to store a year, I would need several TB of disk).
However, the problem that worries me is the memory, because I have been told that prometheus is not intended to store more than one or two weeks and if I were to store more I would need a lot of memory, not only to process the data but also to make queries.
How could I estimate the memory I would need to run Netapp Harvest? Do you have any example of prometheus configuration to optimize the storage (like the aggregation than I explained)? How is it usually done in other installations? Do you usually use a third-party (Thanos or whatever) to overflow more than two weeks?
On the other hand, I see that with Nabox, with the memory and disk requirements that has managed, you could save a history of 2 years!!! How do you do it? Is it worth more to install the NABOX OVA instead of deploying in kubernetes directly?
Am I worrying too much with a hypothetical prometheus monster when it should not be a problem? What is your opinion?
Best regards,
Oscar.
Beta Was this translation helpful? Give feedback.
All reactions