Skip to content

[BUG] Persistent Hard Crashing with XFS Temp Drive in MDADM RAID0Β #6350

@andrewseid

Description

@andrewseid

Describe the bug
XFS is known to be the fastest format for plotting, and testing proves this out. However, it also seems to result in reliable hard crashes on Ubuntu, usually within the first day of beginning plotting. I have experienced this issue about 15 times, on a mix of Ubuntu GUI 20.04, Ubuntu GUI 21.04, and Ubuntu Server 21.04. I've experienced it on three different systems, two AMD builds (3960X and 3990X), and one Intel build (i7-11700K). All systems have been using between two and four Samsung 980 Pro NVMe drives in MDADM RAID0.

The issue seems to go away when I format the temp drive RAID0 array with ext4.

To Reproduce

  1. Create an XFS MDADM RAID0 array on Ubuntu 20.04 or 21.04 (GUI or server, doesn't matter), using 2-4 NVMe drives (in my case, Samsung 980 Pro 2TB, running on PCIe Gen 4).
  2. Start 10+ plotting queues with -n 5 -r, depending on system specs.
  3. Let system plot for 24-48 hours.

Expected behavior
Observe eventual hard crash.

Screenshots
On Ubuntu GUI, the desktop just completely freezes wherever it is.
On Ubuntu Server, I got this:
IMG_8562

Desktop:

  • OS: Ubuntu GUI 20.04, Ubuntu GUI 21.04, Ubuntu Server 21.04
  • CPU: AMD 3990X, AMD 3960X, Intel i7-11700K
  • NVMe: 2-4 Samsung 980 Pro 2TB in MDADM RAID0

Additional context
Random theory that you can feel free to ignore: since this is an extremely high performance setup, maybe it's hitting some kind of performance threshold or race condition during plotting? Or maybe it's something else entirely XD Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions