Use InfiniBand RDMA split storage & compute-阿里云开发者社区

Use InfiniBand RDMA split storage & compute

2016-04-01 3119

版权

本文内容由阿里云实名注册用户自发贡献，版权归原作者所有，阿里云开发者社区不拥有其著作权，亦不承担相应法律责任。具体规则请查看《阿里云开发者社区用户服务协议》和《阿里云开发者社区知识产权保护指引》。如果您发现本社区中有涉嫌抄袭的内容，填写侵权投诉表单进行举报，一经查实，本社区将立刻删除涉嫌侵权内容。

简介：

RDMA是一种特殊的协议，主要用于IB网络，高吞吐量，低延迟的远程存储访问。并且不需要耗费CPU时间，不需要在应用内存和内核内存直接拷贝数据。

一般可被用于大型的网络存储访问。或者有利于将计算资源和存储资源分离，使用RDMA提供低延迟和高吞吐量, 降低CPU开销。

下面是关于drma的介绍。

In computing , remote direct memory access ( RDMA ) is a direct memory access from the memory of one computer into that of another without involving either one's operating system . This permits high-throughput, low- latency networking, which is especially useful in massively parallel computer clusters .

RDMA supports zero-copy networking by enabling the network adapter to transfer data directly to or from application memory, eliminating the need to copy data between application memory and the data buffers in the operating system. Such transfers require no work to be done by CPUs, caches, or context switches, and transfers continue in parallel with other system operations. When an application performs an RDMA Read or Write request, the application data is delivered directly to the network, reducing latency and enabling fast message transfer.

However, this strategy presents several problems related to the fact that the target node is not notified of the completion of the request (1-sided communications).

Much like other high performance computing (HPC) interconnects, RDMA has achieved limited acceptance as of 2013 due to the need to install a different networking infrastructure. However, new standards^[specify] enable Ethernet RDMA implementation at the physical layer using TCP/IP as the transport, thus combining the performance and latency advantages of RDMA with a low-cost, standards-based solution.^{[citation needed]} The RDMA Consortium and the DAT Collaborative^[1] have played key roles in the development of RDMA protocols and APIs for consideration by standards groups such as the Internet Engineering Task Force and the Interconnect Software Consortium.^[2]

Hardware vendors have started working on higher-capacity RDMA-based network adapters, with rates of 40Gbit/s reported.^[3]^[4] Software vendors such as Red Hat and Oracle Corporation support these APIs in their latest products, and as of 2013 engineers have started developing network adapters that implement RDMA over Ethernet. Both Red Hat Enterprise Linux and Red Hat Enterprise MRG^[5] have support for RDMA. Microsoft supports RDMA in Windows Server 2012 via SMB Direct.

Common RDMA implementations include the Virtual Interface Architecture, RDMA over Converged Ethernet (RoCE),^[6] InfiniBand, and iWARP.

如果你的系统中有支持RDMA的适配器，你可以尝试使用它，例如从Red Hat 6开始支持NFS over RDMA。

这是一种对应用透明的远程存储访问方法。

配置方法也很简单。

9.7.5. NFS over RDMA

To enable the RDMA transport in the linux kernel NFS server, use the following procedure:

Procedure 9.2. Enable RDMA from server

Ensure the RDMA rpm is installed and the RDMA service is enabled with the following command:
```
# yum install rdma; chkconfig --level 2345 rdma on
```
Ensure the package that provides the nfs-rdma service is installed and the service is enabled with the following command:
```
# yum install rdma; chkconfig --level 345 nfs-rdma on
```
Ensure that the RDMA port is set to the preferred port (default for Red Hat Enterprise Linux 6 is 2050). To do so, edit the /etc/rdma/rdma.conf file to set NFSoRDMA_LOAD=yes and NFSoRDMA_PORT to the desired port.
Set up the exported filesystem as normal for NFS mounts.

On the client side, use the following procedure:

Procedure 9.3. Enable RDMA from client

Ensure the RDMA rpm is installed and the RDMA service is enabled with the following command:
```
# yum install rdma; chkconfig --level 2345 rdma on
```
Mount the NFS exported partition using the RDMA option on the mount call. The port option can optionally be added to the call.
```
# mount -t nfs -o rdma,port=port_number
```

还有一种方法是使用SDP，也是对应用透明的。

可参考

http://www.ibm.com/developerworks/cn/data/library/techarticle/dm-1207rdmasocketdirect/

[参考]

1. https://www.kernel.org/doc/Documentation/filesystems/nfs/nfs-rdma.txt

2. http://service.chelsio.com/site-bin/readme.cgi?FILE=linux/rdma/nfs-rdma-howto.txt

3. http://pgsnaga.blogspot.hk/2012/01/postgresql-conference-2012-on-february.html

4. https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/7/html/Networking_Guide/part-InfiniBand_and_RDMA_Networking.html

5. http://www.ibm.com/developerworks/cn/data/library/techarticle/dm-1207rdmasocketdirect/

6. https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Storage_Administration_Guide/nfs-rdma.html

7. http://en.wikipedia.org/wiki/Remote_direct_memory_access

8. http://docs.oracle.com/cd/E23824_01/html/821-1454/rfsrefer-154.html