Six Technical Analysis of Cloud Storage Systems.

CCNA 200-301

CCNA 200-301

CCNP Enterprise

CCNP Enterprise

CCNP Security

CCNP Security

CCIE Enterprise Lab

CCIE Enterprise Lab

CCIE Security Lab

CCIE Security Lab

CCNP Service Provider

CCNP Service Provider

CCNP Data Center

CCNP Data Center

CCNP Collaboration

CCNP Collaboration

CCIE DC Lab

CCIE DC Lab

ic_r
ic_l
Six Technical Analysis of Cloud Storage Systems.
images

SPOTO has 16-year IT training experience. SPOTO will guarantee that you can pass each certification exam at first attempt. We have massive certification dumps and professional tutors. we have mastered several technologies.

With the rapid development of the surveillance field, the birth of new technologies is also coming, cloud storage is the most high-tech products that people are most happy. It has the following major technologies.

The cloud storage system has the following characteristics: data security, superior scalability, charging according to usage, automatic switching failures across different applications, and easy management. Cloud storage is mainly used in four areas of backup, archive, distribution and shared collaboration.

Cloud storage is a new concept extended and developed in the concept of cloud computing. It refers to the aggregation of a large number of different types of storage devices in the network through application software through functions such as cluster application, grid technology or distributed file system. Work together to provide a system for data storage and business access functions. When the core of computing and processing of cloud computing systems is the storage and management of large amounts of data, cloud computing systems need to be configured with a large number of storage devices, then the cloud computing system is transformed into a cloud storage system, so cloud storage is a data storage system. And management as the core of the cloud computing system.

Compared with cloud computing systems, cloud storage can be considered as a cloud computing system configured with large storage space. The cloud storage system has the following characteristics: data security, superior scalability, charging according to usage, automatic switching failures across different applications, and easy management. Cloud storage is mainly used in four areas of backup, archive, distribution and shared collaboration.

A cloud storage system is a collection of multi-device, multi-application, and multi-service collaborative work. Its implementation is based on the development of multiple technologies. According to the characteristics of cloud storage and its application areas, the main cloud storage technologies involve storage virtualization, distributed file systems, cluster storage, centralized storage management, heterogeneous platform collaboration, automatic tiered storage, and of course, deduplication. , data compression and other technologies.

Storage virtualization

The most common understanding of storage virtualization (storage virtualization) is the abstraction of storage hardware resources. Provides a comprehensive and comprehensive functional service by integrating one (or more) target services or functions with other additional functions. Typical virtualization includes situations such as masking system complexity, adding or integrating new features, emulating, integrating, or decomposing existing service functions. Virtualization is applied to one or more entities that are used to provide storage resources or services.

Storage virtualization is a technology that runs through the entire IT environment to simplify the underlying infrastructure that might otherwise be relatively complex. The idea of storage virtualization is to separate the logical image of the resource from the physical storage, providing a simplified, seamless view of the resources for the system and administrators.

For the user, the virtualized storage resource is like a huge "storage pool". Users don't see the specific disk or tape, and they don't have to care which path the data goes through to which specific storage device.

Distributed file system

The Hadoop Distributed File System (HDFS) is a distributed file system designed for use on common hardware devices. It has many similarities to existing distributed file systems, but it is significantly different from these file systems. HDFS is highly fault-tolerant and is designed to be deployed on inexpensive hardware. HDFS provides high throughput for application data and is suitable for large dataset applications. HDFS opens up some of the necessary interfaces for POSIX, allowing streaming access to file system data.

HDFS is a master/slave architecture. A cluster has a name node, the main control server, which manages the file system's namespace and coordinates client access to files. There is also a bunch of data nodes, usually one deployed on a physical node, responsible for storage management on the physical nodes where they are located. The HDFS open file system namespace allows user data to be stored in files. Internally, a file is divided into one or more data blocks that are stored in a set of data nodes. Name nodes perform file system namespace operations, such as opening, closing, renaming files or directories, and also determining the mapping of data blocks from data nodes. The data node is responsible for providing the client's read and writes requests. The data node also performs the creation, deletion, and copying of data blocks in accordance with the instructions of the name node.

Cluster storage

Cluster storage is to aggregate storage space in multiple storage devices into a storage pool that provides a unified access interface and management interface for application servers. Applications can transparently access and utilize disks on all storage devices through the access interface. Take advantage of storage device performance and disk utilization. Data will be stored and read from multiple storage devices in accordance with certain rules for higher concurrent access performance.

The advantages of clustered storage are primarily in improving the overall performance of parallel or partitioned I/O, especially for workflow, read-intensive, and large file access, by lowering the overall cost by using lower cost servers. There are two ways to implement cluster storage: one is hardware infrastructure plus software; the other is dedicated cluster storage, which is built on top of the NAS infrastructure, but implements cluster storage through the operating system.

Centralized management of storage

The cloud storage management platform requires support for deployment and management across data centers and supports user access schedules, data migration, and offsite storage backup across data centers.

Supports centralized management. The cloud storage management platform is deployed in the central computer room of the cloud computing system. The storage nodes can be deployed in the computer rooms. The management platform can manage and schedule the storage devices in each computer room.

Through the cloud storage management platform, users can easily understand the service status of each node of the cloud storage system, including the capacity and performance of each node (read IOPS, write IOPS, read traffic, write traffic) and other information, so that users can understand the resources in the domain in real time. Information, operational status, and thus can manipulate these resources; at the same time, timely understanding of resource anomalies, if necessary, can take appropriate measures to ensure its normal operation.

Heterogeneous platform collaboration

At present, various storage solutions and technologies are very complicated and diverse. It is known from the situation that there may be different types of storage devices in one enterprise at the same time. Moreover, the storage environment between different storage device vendors has always had compatibility problems. Therefore, the storage consolidation that has been called for many years is still difficult to meet the needs of enterprises, which is the biggest obstacle to the expansion of storage virtualization and cloud storage.

While the storage cloud is easy to step in some way (such as online storage and backup), on the other hand, it seems not easy to achieve comprehensive storage consolidation through private cloud storage. In this regard, in order to successfully complete the storage virtualization goal, it is necessary to improve the enterprise's existing IT storage environment, and its improvement focus is not only shared storage architecture, affinity environment, simple and simple operation interface, and efficient storage solutions. Among them, whether it is a single operation interface or a unified standard API, it is one of the keys to solving the coordination problem between different storage devices.

CDMI is a new standard interface for cloud storage, developed by the International Storage Network Industry Association SNIA. For cloud computing, CDMI provides a common cloud computing management infrastructure, while the original focus of information management has gradually shifted from storage management to data management. The CDMI standard can help users mark special metadata (Metadata) on the data, which tells the endpoint storage provider what data service provides the data (such as backup, archive, encryption, etc.). Through the implementation of the CDMI standard interface, users can move data between different cloud providers without having to endure the pain of re-encoding in different interfaces.

Automatic tiered storage

Improving storage management efficiency has become the primary problem for many enterprises. Automatic tiered storage is the most effective basic technology. It refers to the function of migrating data blocks between different disk types and RAID levels, which can meet the performance and space usage. The right balance quickly put the data in the right place and avoid the so-called hot spots.

As this technology has received wide attention, mobile data needs to implement a fully automated migration process between different levels of storage media, such as FC disks and SATA disks.

In a hierarchical data storage structure, a storage device generally has a tape library, a disk or a disk array, and the disk can be classified into an FC disk, a SCSI disk, a SATA disk, and the like according to its performance, and the flash storage medium (non-volatile random) Access memory (NVRAM) is also a higher level in the hierarchical data storage architecture because of its higher performance. Generally, high-cost, fast-speed devices such as disks or disk arrays are used to store important information that is frequently accessed, while low-cost storage resources such as tape libraries are used to store information with low access frequency.