SystemRequirementsBitfusionGuideWHITEPAPER–OCTOBER2019WHITEPAPER|2VMwareBitfusion:SystemRequirementsTableofContentsSystemRequirements3HardwareRequirements3Networking4VerifyingSystemHealth5SoftwareDependencies6Ubuntu6CentOSandRedHat7ListCUDALibraries.
7VerifytheCUDAInstallation7CUDAInstallationonCPUNodes/VMs/Containers7WHITEPAPER|3SystemRequirementsFlexDirectcanbeinstalledfordifferentmodesofoperation.
Somemodesinclude,inwholeorinpart,theabilitiesofothermodes.
Thisonedocumentliststherequirementsforeachmode.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGERUbuntuLTS16.
04CentOS7orRHEL7.
4+NVIDIAToolkitCUDA7.
5CUDA8CUDA9CUDA10UbuntuLTS16.
04CentOS7orRHEL7.
4+UbuntuLTS16.
04CentOS7orRHEL7.
4+UbuntuLTS16.
04CentOS7orRHEL7.
4+NVIDIAGPUdriverversion372orhigherNVIDIAGPUdriverversion372orhigherNVIDIAGPUdriverversion372orhigherNote:CanalsorunApplicationsunderFlexDirectClient,inwhichcaseNVIDIAToolkitprerequisitesareneeded.
Note:CanalsorunApplicationsunderFlexDirectClient,inwhichcaseNVIDIAToolkitprerequisitesareneeded.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGERN/AAnygenerationCUDA-enabledNVIDIAGPU(s)AnygenerationCUDA-enabledNVIDIAGPU(s)AnygenerationCUDA-enabledNVIDIAGPU(s)10GbsormoreEthernet(TCP/IP)RoCEorInfinibandNetworkConnectivity10GbsormoreEthernet(TCP/IP)RoCEorInfiniband10GbsormoreEthernet(TCP/IP)RoCEorInfinibandSystemmemoryontheclientCPUmachinesshouldbe1.
5xthetotalaggregateGPUmemoryacrossalltheGPUsinthelargestGPUmachine.
N/AN/AN/AHardwareRequirementsWHITEPAPER|4NetworkingFlexDirectmakesuseofseveralportsorrangesofportsforprocesstoprocesscommunication.
Pleaseensurethatyourfirewallsdonotblocktheportsbelow.
Pleaseensuretherearenoportconflictswithotherapplications.
Someoftherangescanbemodifiedwithcommand-linearguments.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGER56001ServerCommunicationOutboundto:–Servernodes–Managernodes56001ServerCommunicationInboundfrom:–Analyticsnodes–ManagernodesOutboundto:–Analyticsnodes–Managernodes56001ServerCommunicationInboundfrom:–Clientnodes56001ServerCommunicationInboundfrom:–Clientnodes–Analyticsnodes–ManagernodesOutboundto:–Analyticsnodes–ManagernodesNote:Overridewith--srs_port56008,forexample.
Note:Overridewith--srs_port56008,forexample.
Note:Overridewith--srs_port56008,forexample.
55001-55100DispatcherCommunicationOutboundto:–Servernodes–Managernodes55001-55100DispatcherCommunicationN/A55001-55100DispatcherCommunicationInboundfrom:–Clientnodes55001-55100DispatcherCommunicationInboundfrom:–Clientnodes45201-46225CUDACommunicationOutboundto:–Servernodes–Managernodes45201-46225CUDACommunicationN/A45201-46225CUDACommunicationInboundfrom:–Clientnodes45201-46225CUDACommunicationInboundfrom:–Clientnodes54000WebServerN/A54000WebServerInboundfrom:–Anyonewithabrowser54000WebServerN/A54000WebServerInboundfrom:–AnyonewithabrowserNote:Overridewith--web_port12345,forexample.
Note:Overridewith--web_port12345,forexample.
WHITEPAPER|5FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGER54001StatisticsCollectionN/A54001StatisticsCollectionInbound/Outbound–localhostonly54001StatisticsCollectionN/A54001StatisticsCollectionInbound/Outbound–localhostonlyNote:Overridewith--collection_port54321,forexample.
Note:Overridewith--collection_port54321,forexample.
Ensureyourmachineshaveingress/egressinternetaccessforaccesstodownloads,licensing,etc.
Checkforothernetworkingpolicies,suchasoutboundproxyorinternalDNS,staticversusdynamicIPs.
ConsultwiththeBitfusionteamifyouareusingdynamicIPsorhaveanyproxiessetup.
VerifyingSystemHealthOnceyouhaveinstalledFlexDirect,youcanvalidateyourenvironmentforthebestresultsbyrunningtheFlexDirecthealthcheck.
Shell#Assumesflexdirectisinstalled;seeinstallationguide.
flexdirecthealthThehealthcheckwillrunchecksonthenodesofyourclusterappropriatetotheirhardware(e.
g.
,GPUs)ortotheirconfiguration(e.
g.
RoCE).
Soyouroutputmaydifferfromthatshownbelow.
Checkstrytofindsettingsorproblemsthatwilllimitorpreventthehigh-bandwidth,low-latencycommunicationneededforthebestperformance.
Theresultsshouldbeselfexplanatory.
CheckswillbeperformedonthelocalhostandonallconfiguredGPUservers.
Shownbelowistheoutputfromthecheckonthelocalnode.
WHITEPAPER|6Output$flexdirecthealthHealthreportforserver:192.
168.
10.
41:56001SoftwareVersionschecks:[PASS]Checklibrarydependency[PASS]CheckOFEDVersion:4.
3[PASS]CheckCUDAversion>=7050:Currentversion:9020[PASS]CheckGPUdriverversion>=367.
0:Currentversion:396.
37SystemResourceschecks:[PASS]Checkflexdirectinstall[PASS]CheckexternalconnectivitytoBitfusionlicenseserver.
Ignoreforon-premiseinstallations.
[PASS]Checkshadowmemory192076MBandtotalgpumem64640MBPerformancechecks:[PASS]CheckMTUSize:hi-speedinterfacesMTU>=4K[PASS]Checkmemops[PASS]Checkulimit-n>=4096:4096[MARGINAL]Checkmultinodesupport:GPUdirectnotsupported[MARGINAL]Checknv_peer_mem:nv_peer_memmodulenotloaded[SKIPPED]CannotperformPCIediagnosiswithoutrootaccess.
RunthebinarywithsudotogetdiagnosisStabilitychecks:[PASS]CheckNetworkErrors/Drops:foundnoerrorsorpacketdrops[PASS]CheckIBphysicallyup[PASS]CheckIBSMup[PASS]CheckMADagentregistrationerrors[PASS]CheckGPUAPImismatch[PASS]CheckGPUXiderrors[PASS]Checktemperaturecom/compute/cuda/repos/ubuntu1604/x86_64/cuda-repo-ubuntu1604_9.
1.
85-1_amd64.
deb$sudodpkg-icuda-repo-ubuntu1604_9.
1.
85-1_amd64.
deb$sudoapt-keyadv--fetch-keyshttp://developer.
download.
nvidia.
com/compute/cuda/repos/ubuntu1604/x86_64/7fa2af80.
pub$sudoapt-getupdate$sudoapt-getinstall-ycuda-toolkit-9-1$rmcuda-repo-ubuntu1604_9.
1.
85-1_amd64.
debListCUDALibrariesCUDAlibsmightbemissingifyourinstallationisaminimalone.
Copy/pastethefilesfrom/usr/local/cuda/lib64:VerifytheCUDAinstallationTheeasiestwaytodothiswillbetoinstalltheCUDAsamples.
HereistheRPMforCUDAsamples.
ItistheRPMforRHEL7andCUDA9.
1.
Afterinstallingthesamples,makesuredeviceQuerycanberuneitherdirectly(ifitisaGPUmachine)orwithFlexDirect(ifitisaClientCPUmachine)ScheduleatestjobusingCUDAdeviceQuery:CUDAInstallationonCPUNodes/VMs/ContainersInordertorunGPUapplicationswithFlexDirectonCPUnodes,VMs,orincontainerswhichdonothavedirectaccesstophysicalGPUhardware,youstillneedtoinstallCUDA—butthereisnoneedtoinstallanyNvidiadrivercomponents.
TheexamplebelowillustrateshowtotoinstallCUDA9.
1inafeweasysteps.
TheCUDAversionthatyouinstallontheClientshouldmatchtheCUDAversionwhichisrunningontheserver.
VMware,Inc.
3401HillviewAvenuePaloAltoCA94304USATel877-486-9273Fax650-427-5001vmware.
comCopyright2019VMware,Inc.
Allrightsreserved.
ThisproductisprotectedbyU.
S.
andinternationalcopyrightandintellectualpropertylaws.
VMwareproductsarecoveredbyoneormorepatentslistedatvmware.
com/go/patents.
VMwareisaregisteredtrademarkortrademarkofVMware,Inc.
anditssubsidiariesintheUnitedStatesandotherjurisdictions.
Allothermarksandnamesmentionedhereinmaybetrademarksoftheirrespectivecompanies.
ItemNo:VMW-0518-1843_VMW_CPBUTechnicalWhitePapers_BitfusionDocs_05SystemRequirements_1.
4_YC8/19
螢光云官網萤光云成立于2002年,是一家自有IDC的云厂商,主打高防云服务器产品。在国内有福州、北京、上海、台湾、香港CN2节点,还有华盛顿、河内、曼谷等海外节点。萤光云的高防云服务器自带50G防御,适合高防建站、游戏高防等业务。本次萤光云中秋云活动简单无套路,直接在原有价格上砍了一大刀,最低价格16元/月,而且有没有账户限制,新老客户都可以买,就是直接满满的诚意给大家送优惠了!官网首页:www....
香港站群多ip服务器多少钱?想做好站群的SEO优化,最好给每个网站都分配一个独立IP,这样每个网站之间才不会受到影响。对做站群的站长来说,租用一家性价比高且提供多IP的香港多ip站群服务器很有必要。零途云推出的香港多ip站群云服务器多达256个IP,可以满足站群的优化需求,而且性价比非常高。那么,香港多ip站群云服务器价格多少钱一个月?选择什么样的香港多IP站群云服务器比较好呢?今天,小编带大家一...
ATCLOUD.NET怎么样?ATCLOUD.NET主要提供KVM架构的VPS产品、LXC容器化产品、权威DNS智能解析、域名注册、SSL证书等海外网站建设服务。 其大部分数据中心是由OVH机房提供,其节点包括美国(俄勒冈、弗吉尼亚)、加拿大、英国、法国、德国以及新加坡。 提供超过480Gbps的DDoS高防保护,杜绝DDoS攻击骚扰,比较适合海外建站等业务。官方网站:点击访问ATCLOUD官网活...
WWW YC8 COM为你推荐
湖南商标注册在湖南搞商标注册是代理好还是自己去好一点?湖南商标注册的流程又是什么样的呢?中国电信互联星空中国电信宽带于互联星空的区别显卡温度多少正常显卡温度是多少才算正常的?网店推广网站什么平台适合做淘宝店铺推广数据库损坏数据库坏了,怎么修复?lockdowndios8.1能用gpp3to2吗?型号A1429网管工具做技术网管需要哪些工具?具体做些什么?怎么把网页的字变大怎么使网页字体变大recovery教程Recovery是什么?怎样使用Recovery刷机?nokia最新手机诺基亚最新款手机都那些
com域名注册 万网域名 免费二级域名 景安vps 淘宝二级域名 花生壳域名贝锐 lamp安装 狗爹 diahosting Vultr z.com sub-process 好看的桌面背景大图 12306抢票助手 长沙服务器 网站挂马检测工具 linux空间 腾讯云分析 秒杀预告 me空间社区 更多