Skip to content

OpenTSDB should reliably operate on a single node #796

@IzakMarais

Description

@IzakMarais

We run single node OpenTSDB with HBase writing to local file (RAID backed) in stead of HDFS when deploying to smaller clusters. OpenTSDB easily handles the ingestion rate (about 7000 dps).

However we have had repeated file level corruption problems. Over the last few months our 2 test systems have 5 times had an HBase 'tsdb' region is stuck in a FAILED_OPEN state. The only way I could recover from this is to delete the region file from the disk.
regions_in_transition

Is there something we can improve in our setup to avoid these errors? I am thinking about moving to HDFS. Is it possible/worth while to run a single node HDFS (with mulitple JBOD disks for reliability).

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions