Storage¶
- Storage description
- Storage types
- Storage capacity
- Fault tolerance
- Volume description
- Available volume operations
Storage description
Data storage system
The storage system in the SIM-Cloud is built from software-defined storage (SDS). The best solution for SIMCloud tasks is a Ceph-based solution.
Ceph is a block and file access object storage area network (SAN) that is managed by free and open source software.
The Ceph distributed file system provides the following capabilities:
- replication (storage of spare duplicates of each data block in different network nodes);
- distribution of workload between nodes;
- creating duplicate data blocks (instead of unavailable ones) on other nodes, in the event of a disk, node or node group failure;
- rebalancing of data arrays when deleting (failure) nodes or adding new ones to the network (restoration);
- increase in performance due to caching (transfer of the most requested data and file system logs to the fastest disks).
All this guarantees high availability and reliability of the data storage system.
Storage types
There are 2 storage types (cs1 and gs1) available in SIM-Cloud
- gs1 – general storage, universal storage; it is used for most services, it
- is also suitable for high-load applications with intensive data exchange.
cs1 – storage (cold storage), intended for low load tasks.
Storage capacity
For the convenience of comparing storages, we have placed their qualitative characteristics in a tabular form:
Storage type | Bandwith | IOPS | Volume Size |
cs1 | 200MB/s | 750/s | 1GB up to 2000GB |
gs1 | 300MB/s | 10000/s | 1GB up to 2000GB |
- Carrying capacity - “Bandwidth”;
- The number of input-output operations - “IOPS”;
- Volume - “Volume Size”.
As can be seen from the table, gs1 storage has a higher throughput and significantly outperforms cs1 storage in terms of the number of I/O operations, which makes it sought after for heavily loaded systems with intensive data exchange.
Fault tolerance
All data in the SIM-Cloud storage system is replicated with a x2 factor (each identical data block is located on two different physical nodes placed in two different racks in the data centre).
Each project is assigned a specific quota for storage type cs1 or gs1. The amount of the required quota is determined by the owner of the project. Additionally, the user may increase the storage capacity.
Within the available quota of a particular storage type, a project user may create volumes, and create and launch instances on their basis.
Note
Please note that the documentation does not use the term “disk” as a physical device (hard disk), but instead use more precise definitions - “storage” or “volume” depending on the context. First of all, this is due to the fact that the main qualitative characteristics of a physical disk (bandwidth, IOPS) differ from the characteristics of a volume as understood by the openstack service. In addition, the terms “volume” or “storage” already implies the use of redundancy, due to which the fault tolerance and reliability of the block storage device is achieved.
Hereafter we will use only the terms “storage” or “volume”, and in exceptional cases, for better understanding, it is possible to use the term “disk”, please take note of this when reading our documentation.
Volume description
A volume is a block device based on cs1 or gs1 storage. Depending on the storage in which block devices are created, volumes inherit their performance metrics. This means that the volume created in the cs1 storage will have the following characteristics: bandwidth: 200MB/s, the number of IOPS is 750 IOPS, and the volume created in the gs1 storage will have the throughput: 300MB/s, the number of input-output operations is 10,000 IOPS.
One volume may be attached to only one instance at a time, while multiple volumes may be attached to the same instance. This means that you may mount multiple volumes and distribute your data between them to increase I/O resources and increase throughput. This is especially useful for database applications that often experience a large number of random read and write operations to a dataset.
The user may create both a boot (system) volume, from which the instance will be started in the future, or a “clean” volume - an empty volume that does not contain a file system or partition table. An “empty” volume may be attached to an instance as an additional volume.
Available volume operations
The project user has access to the following volume operations:
- Creating a volume
- Editing a volume
- Attaching a volume to an instance
- Disabling a volume
- Deleting a volume
- Extending a volume
- Changing a volume type
- Transferring a volume
- Taking a snapshot of a volume
- Deleting a snapshot
- Creating an image from a snapshot
You can learn more about the above volume operations in our article.