這幾天,在HPC上安裝調試Linux版的STAR-CCM+ 11.06.010-R8, 以串行模式加載算例時,能夠成功計算。但是以並行方式加載算例時,會報出一大堆錯誤信息:
Design STAR-CCM+ simulation completed
Server process ended unexpectedly (return code 1)
starccm+: Rank 0:1: MPI_Init: multiple pkey found in partition key table, please choose one via MPI_IB_PKEY
starccm+: Rank 0:1: MPI_Init: pkey table:
starccm+: Rank 0:1: MPI_Init: 0x8001
starccm+: Rank 0:1: MPI_Init: 0x7fff
starccm+: Rank 0:1: MPI_Init: 0xffff
starccm+: Rank 0:1: MPI_Init: ibv_get_pkey() failed
starccm+: Rank 0:1: MPI_Init: Can't initialize RDMA device
starccm+: Rank 0:1: MPI_Init: Internal Error: Cannot initialize RDMA protocol
starccm+: Rank 0:0: MPI_Init: multiple pkey found in partition key table, please choose one via MPI_IB_PKEY
starccm+: Rank 0:0: MPI_Init: pkey table:
starccm+: Rank 0:0: MPI_Init: 0x8001
starccm+: Rank 0:0: MPI_Init: 0x7fff
starccm+: Rank 0:0: MPI_Init: 0xffff
starccm+: Rank 0:0: MPI_Init: ibv_get_pkey() failed
starccm+: Rank 0:0: MPI_Init: Can't initialize RDMA device
starccm+: Rank 0:0: MPI_Init: Internal Error: Cannot initialize RDMA protocol
MPI Application rank 1 exited before MPI_Init() with status 1
mpirun: Broken pipe
緊緊圍繞着“”multiple pkey found in partition key table, please choose one via MPI_IB_PKEY“”這句話,通過"http://guge.firstguo.com/"一路來到Mellanox的社區論壇:
https://community.mellanox.com/docs/DOC-2232
裏面有這麼一句話:For example, the default PKEY will either have a value of 0x7FFF (limited) or 0xFFFF (full).
果斷地在用戶環境變量文件.bashrc中添加環境變量:export MPI_IB_PKEY=0x7FFF
再次啓動starccm+, 以並行方式加載算例,成功加載,然後順利開始並行計算!!