|
| 1 | +========================== |
| 2 | + Beginner's Guide |
| 3 | +========================== |
| 4 | + |
| 5 | +The purpose of A Beginner's Guide to Ceph is to make Ceph comprehensible. |
| 6 | + |
| 7 | +Ceph is a clustered and distributed storage manager. If that's too cryptic, |
| 8 | +then just think of Ceph as a computer program that stores data and uses a |
| 9 | +network to make sure that there is a backup copy of the data. |
| 10 | + |
| 11 | +Ceph offers several "storage interfaces", which is another |
| 12 | +way of saying "ways of storing data". These storage interfaces include: |
| 13 | +- CephFS (a file system) |
| 14 | +- RBD (block devices) |
| 15 | +- RADOS (an object store). |
| 16 | + |
| 17 | +Deep down, though, all three of these are really RADOS object stores. CephFS |
| 18 | +and RBD are just presenting themselves as file systems and block devices. |
| 19 | + |
| 20 | +Ceph is a clustered and distributed storage manager that offers data |
| 21 | +redundancy. This sentence might be too cryptic for first-time readers of the |
| 22 | +Ceph Beginner's Guide, so let's explain all of the terms in it: |
| 23 | + |
| 24 | +- **Storage Manager.** Ceph is a storage manager. This means that Ceph is |
| 25 | + software that helps storage resources store data. Storage resources come in |
| 26 | + several forms: hard disk drives (HDD), solid-state drives (SSD), magnetic |
| 27 | + tape, floppy disks, punched tape, Hoellerith-style punch cards, and magnetic |
| 28 | + drum memory are all forms of storage resources. In this beginner's guide, |
| 29 | + we'll focus on hard disk drives (HDD) and solid-state drives (SSD). |
| 30 | +- **Clustered storage manager** Ceph is a clustered storage manager. That means |
| 31 | + that the storage manager installed not just on a single machine but on |
| 32 | + several machines that work together as a system. |
| 33 | +- **Distributed storage manager.** Ceph is a clustered and distributed storage |
| 34 | + manager. That means that the data that is stored and the infrastructure that |
| 35 | + supports it is spread across multiple machines and is not centralized in a |
| 36 | + single machine. To better understand what distributed means in this context, |
| 37 | + it might be helpful to describe what it is not: it is not a system ISCSI, |
| 38 | + which is a system that exposes a single logical disk over the network in a |
| 39 | + 1:1 (one-to-one) mapping. |
| 40 | +- **Data Redundancy.** Having a second copy of your data somewhere. |
| 41 | + |
| 42 | +Ceph Monitor |
| 43 | +------------ |
| 44 | + |
| 45 | +The Ceph Monitor is one of the daemons essential to the functioning of a Ceph |
| 46 | +cluster. Monitors know the location of all the data in the Ceph cluster. |
| 47 | +Monitors maintain maps of the cluster state, and those maps make it possible |
| 48 | +for Ceph daemons to work together. These maps include the monitor map, the OSD |
| 49 | +map, the MDS map, and the CRUSH map. Three monitors are required to reach |
| 50 | +quorum. Quorum is a state that is necessary for a Ceph cluster to work |
| 51 | +properly. Quorum means that a majority of the monitors are in the "up" state. |
| 52 | + |
| 53 | +MANAGER |
| 54 | +------- |
| 55 | +The manager balances the data in the Ceph cluster, distributing load evenly so |
| 56 | +that no part of the cluster gets overloaded. The manager is one of the daemons |
| 57 | +essential to the functioning of the Ceph cluster. Managers keep track of |
| 58 | +runtime metrics, system utilization, CPU performance, disk load, and they host |
| 59 | +the Ceph dashboard web GUI. |
| 60 | + |
| 61 | +OSD |
| 62 | +--- |
| 63 | + |
| 64 | +Object Storage Daemons (OSDs) store objects. |
| 65 | + |
| 66 | +An OSD is a process that runs on a storage server. The OSD is responsible for |
| 67 | +managing a single unit of storage, which is usually a single disk. |
| 68 | + |
| 69 | +POOLS |
| 70 | +----- |
| 71 | + |
| 72 | +A pool is an abstraction that can be designated as either "replicated" or |
| 73 | +"erasure coded". In Ceph, the method of data protection is set at the pool |
| 74 | +level. Ceph offers and supports two types of data protection: replication and |
| 75 | +erasure coding. Objects are stored in pools. "A storage pool is a collection of |
| 76 | +storage volumes. A storage volume is the basic unit of storage, such as |
| 77 | +allocated space on a disk or a single tape cartridge. The server uses the |
| 78 | +storage volumes to store backed-up, archived, or space-managed files." (IBM |
| 79 | +Tivoli Storage Manager, Version 7.1, "Storage Pools") |
| 80 | + |
| 81 | +PLACEMENT GROUPS |
| 82 | +---------------- |
| 83 | + |
| 84 | +Placement groups are a part of pools. |
| 85 | + |
| 86 | +MDS |
| 87 | +--- |
| 88 | +A metadata server (MDS) is necessary for the proper functioning of CephFS. |
| 89 | +See :ref:`orchestrator-cli-cephfs` and :ref:`arch-cephfs`. |
| 90 | + |
| 91 | + |
| 92 | + |
0 commit comments