multiple pkey found in partition key table, please choose one via MPI_IB_PKEY問題

這幾天,在HPC上安裝調試Linux版的STAR-CCM+ 11.06.010-R8, 以串行模式加載算例時,能夠成功計算。但是以並行方式加載算例時,會報出一大堆錯誤信息:

Design STAR-CCM+ simulation completed
Server process ended unexpectedly (return code 1)
 starccm+: Rank 0:1: MPI_Init: multiple pkey found in partition key table, please choose one via MPI_IB_PKEY
starccm+: Rank 0:1: MPI_Init: pkey table: 
starccm+: Rank 0:1: MPI_Init: 0x8001 
starccm+: Rank 0:1: MPI_Init: 0x7fff 
starccm+: Rank 0:1: MPI_Init: 0xffff 
starccm+: Rank 0:1: MPI_Init: ibv_get_pkey() failed
starccm+: Rank 0:1: MPI_Init: Can't initialize RDMA device
starccm+: Rank 0:1: MPI_Init: Internal Error: Cannot initialize RDMA protocol


starccm+: Rank 0:0: MPI_Init: multiple pkey found in partition key table, please choose one via MPI_IB_PKEY
starccm+: Rank 0:0: MPI_Init: pkey table: 
starccm+: Rank 0:0: MPI_Init: 0x8001 
starccm+: Rank 0:0: MPI_Init: 0x7fff 
starccm+: Rank 0:0: MPI_Init: 0xffff 
starccm+: Rank 0:0: MPI_Init: ibv_get_pkey() failed
starccm+: Rank 0:0: MPI_Init: Can't initialize RDMA device
starccm+: Rank 0:0: MPI_Init: Internal Error: Cannot initialize RDMA protocol


MPI Application rank 1 exited before MPI_Init() with status 1
mpirun: Broken pipe


緊緊圍繞着“”multiple pkey found in partition key table, please choose one via MPI_IB_PKEY“”這句話,通過"http://guge.firstguo.com/"一路來到Mellanox的社區論壇:

https://community.mellanox.com/docs/DOC-2232

裏面有這麼一句話:For example, the default PKEY will either have a value of 0x7FFF (limited) or 0xFFFF (full).


果斷地在用戶環境變量文件.bashrc中添加環境變量:export MPI_IB_PKEY=0x7FFF


再次啓動starccm+, 以並行方式加載算例,成功加載,然後順利開始並行計算!!

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章