SystemRequirementsBitfusionGuideWHITEPAPER–OCTOBER2019WHITEPAPER|2VMwareBitfusion:SystemRequirementsTableofContentsSystemRequirements3HardwareRequirements3Networking4VerifyingSystemHealth5SoftwareDependencies6Ubuntu6CentOSandRedHat7ListCUDALibraries.
7VerifytheCUDAInstallation7CUDAInstallationonCPUNodes/VMs/Containers7WHITEPAPER|3SystemRequirementsFlexDirectcanbeinstalledfordifferentmodesofoperation.
Somemodesinclude,inwholeorinpart,theabilitiesofothermodes.
Thisonedocumentliststherequirementsforeachmode.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGERUbuntuLTS16.
04CentOS7orRHEL7.
4+NVIDIAToolkitCUDA7.
5CUDA8CUDA9CUDA10UbuntuLTS16.
04CentOS7orRHEL7.
4+UbuntuLTS16.
04CentOS7orRHEL7.
4+UbuntuLTS16.
04CentOS7orRHEL7.
4+NVIDIAGPUdriverversion372orhigherNVIDIAGPUdriverversion372orhigherNVIDIAGPUdriverversion372orhigherNote:CanalsorunApplicationsunderFlexDirectClient,inwhichcaseNVIDIAToolkitprerequisitesareneeded.
Note:CanalsorunApplicationsunderFlexDirectClient,inwhichcaseNVIDIAToolkitprerequisitesareneeded.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGERN/AAnygenerationCUDA-enabledNVIDIAGPU(s)AnygenerationCUDA-enabledNVIDIAGPU(s)AnygenerationCUDA-enabledNVIDIAGPU(s)10GbsormoreEthernet(TCP/IP)RoCEorInfinibandNetworkConnectivity10GbsormoreEthernet(TCP/IP)RoCEorInfiniband10GbsormoreEthernet(TCP/IP)RoCEorInfinibandSystemmemoryontheclientCPUmachinesshouldbe1.
5xthetotalaggregateGPUmemoryacrossalltheGPUsinthelargestGPUmachine.
N/AN/AN/AHardwareRequirementsWHITEPAPER|4NetworkingFlexDirectmakesuseofseveralportsorrangesofportsforprocesstoprocesscommunication.
Pleaseensurethatyourfirewallsdonotblocktheportsbelow.
Pleaseensuretherearenoportconflictswithotherapplications.
Someoftherangescanbemodifiedwithcommand-linearguments.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGER56001ServerCommunicationOutboundto:–Servernodes–Managernodes56001ServerCommunicationInboundfrom:–Analyticsnodes–ManagernodesOutboundto:–Analyticsnodes–Managernodes56001ServerCommunicationInboundfrom:–Clientnodes56001ServerCommunicationInboundfrom:–Clientnodes–Analyticsnodes–ManagernodesOutboundto:–Analyticsnodes–ManagernodesNote:Overridewith--srs_port56008,forexample.
Note:Overridewith--srs_port56008,forexample.
Note:Overridewith--srs_port56008,forexample.
55001-55100DispatcherCommunicationOutboundto:–Servernodes–Managernodes55001-55100DispatcherCommunicationN/A55001-55100DispatcherCommunicationInboundfrom:–Clientnodes55001-55100DispatcherCommunicationInboundfrom:–Clientnodes45201-46225CUDACommunicationOutboundto:–Servernodes–Managernodes45201-46225CUDACommunicationN/A45201-46225CUDACommunicationInboundfrom:–Clientnodes45201-46225CUDACommunicationInboundfrom:–Clientnodes54000WebServerN/A54000WebServerInboundfrom:–Anyonewithabrowser54000WebServerN/A54000WebServerInboundfrom:–AnyonewithabrowserNote:Overridewith--web_port12345,forexample.
Note:Overridewith--web_port12345,forexample.
WHITEPAPER|5FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGER54001StatisticsCollectionN/A54001StatisticsCollectionInbound/Outbound–localhostonly54001StatisticsCollectionN/A54001StatisticsCollectionInbound/Outbound–localhostonlyNote:Overridewith--collection_port54321,forexample.
Note:Overridewith--collection_port54321,forexample.
Ensureyourmachineshaveingress/egressinternetaccessforaccesstodownloads,licensing,etc.
Checkforothernetworkingpolicies,suchasoutboundproxyorinternalDNS,staticversusdynamicIPs.
ConsultwiththeBitfusionteamifyouareusingdynamicIPsorhaveanyproxiessetup.
VerifyingSystemHealthOnceyouhaveinstalledFlexDirect,youcanvalidateyourenvironmentforthebestresultsbyrunningtheFlexDirecthealthcheck.
Shell#Assumesflexdirectisinstalled;seeinstallationguide.
flexdirecthealthThehealthcheckwillrunchecksonthenodesofyourclusterappropriatetotheirhardware(e.
g.
,GPUs)ortotheirconfiguration(e.
g.
RoCE).
Soyouroutputmaydifferfromthatshownbelow.
Checkstrytofindsettingsorproblemsthatwilllimitorpreventthehigh-bandwidth,low-latencycommunicationneededforthebestperformance.
Theresultsshouldbeselfexplanatory.
CheckswillbeperformedonthelocalhostandonallconfiguredGPUservers.
Shownbelowistheoutputfromthecheckonthelocalnode.
WHITEPAPER|6Output$flexdirecthealthHealthreportforserver:192.
168.
10.
41:56001SoftwareVersionschecks:[PASS]Checklibrarydependency[PASS]CheckOFEDVersion:4.
3[PASS]CheckCUDAversion>=7050:Currentversion:9020[PASS]CheckGPUdriverversion>=367.
0:Currentversion:396.
37SystemResourceschecks:[PASS]Checkflexdirectinstall[PASS]CheckexternalconnectivitytoBitfusionlicenseserver.
Ignoreforon-premiseinstallations.
[PASS]Checkshadowmemory192076MBandtotalgpumem64640MBPerformancechecks:[PASS]CheckMTUSize:hi-speedinterfacesMTU>=4K[PASS]Checkmemops[PASS]Checkulimit-n>=4096:4096[MARGINAL]Checkmultinodesupport:GPUdirectnotsupported[MARGINAL]Checknv_peer_mem:nv_peer_memmodulenotloaded[SKIPPED]CannotperformPCIediagnosiswithoutrootaccess.
RunthebinarywithsudotogetdiagnosisStabilitychecks:[PASS]CheckNetworkErrors/Drops:foundnoerrorsorpacketdrops[PASS]CheckIBphysicallyup[PASS]CheckIBSMup[PASS]CheckMADagentregistrationerrors[PASS]CheckGPUAPImismatch[PASS]CheckGPUXiderrors[PASS]Checktemperaturecom/compute/cuda/repos/ubuntu1604/x86_64/cuda-repo-ubuntu1604_9.
1.
85-1_amd64.
deb$sudodpkg-icuda-repo-ubuntu1604_9.
1.
85-1_amd64.
deb$sudoapt-keyadv--fetch-keyshttp://developer.
download.
nvidia.
com/compute/cuda/repos/ubuntu1604/x86_64/7fa2af80.
pub$sudoapt-getupdate$sudoapt-getinstall-ycuda-toolkit-9-1$rmcuda-repo-ubuntu1604_9.
1.
85-1_amd64.
debListCUDALibrariesCUDAlibsmightbemissingifyourinstallationisaminimalone.
Copy/pastethefilesfrom/usr/local/cuda/lib64:VerifytheCUDAinstallationTheeasiestwaytodothiswillbetoinstalltheCUDAsamples.
HereistheRPMforCUDAsamples.
ItistheRPMforRHEL7andCUDA9.
1.
Afterinstallingthesamples,makesuredeviceQuerycanberuneitherdirectly(ifitisaGPUmachine)orwithFlexDirect(ifitisaClientCPUmachine)ScheduleatestjobusingCUDAdeviceQuery:CUDAInstallationonCPUNodes/VMs/ContainersInordertorunGPUapplicationswithFlexDirectonCPUnodes,VMs,orincontainerswhichdonothavedirectaccesstophysicalGPUhardware,youstillneedtoinstallCUDA—butthereisnoneedtoinstallanyNvidiadrivercomponents.
TheexamplebelowillustrateshowtotoinstallCUDA9.
1inafeweasysteps.
TheCUDAversionthatyouinstallontheClientshouldmatchtheCUDAversionwhichisrunningontheserver.
VMware,Inc.
3401HillviewAvenuePaloAltoCA94304USATel877-486-9273Fax650-427-5001vmware.
comCopyright2019VMware,Inc.
Allrightsreserved.
ThisproductisprotectedbyU.
S.
andinternationalcopyrightandintellectualpropertylaws.
VMwareproductsarecoveredbyoneormorepatentslistedatvmware.
com/go/patents.
VMwareisaregisteredtrademarkortrademarkofVMware,Inc.
anditssubsidiariesintheUnitedStatesandotherjurisdictions.
Allothermarksandnamesmentionedhereinmaybetrademarksoftheirrespectivecompanies.
ItemNo:VMW-0518-1843_VMW_CPBUTechnicalWhitePapers_BitfusionDocs_05SystemRequirements_1.
4_YC8/19
Hostodo又发布了几款针对7月4日美国独立日的优惠套餐(Independence Day Super Sale),均为年付,基于KVM架构,采用NVMe硬盘,最低13.99美元起,可选拉斯维加斯或者迈阿密机房。这是一家成立于2014年的国外VPS主机商,主打低价VPS套餐且年付为主,基于OpenVZ和KVM架构,产品性能一般,支持使用PayPal或者支付宝等付款方式。商家客服响应也比较一般,推...
Hosteons,一家海外主机商成立于2018年,在之前还没有介绍和接触这个主机商,今天是有在LEB上看到有官方发送的活动主要是针对LEB的用户提供的洛杉矶、达拉斯和纽约三个机房的方案,最低年付21美元,其特点主要在于可以从1G带宽升级至10G,而且是免费的,是不是很吸引人?本来这次活动是仅仅在LEB留言提交账单ID才可以,这个感觉有点麻烦。不过看到老龚同学有拿到识别优惠码,于是就一并来分享给有需...
Webhosting24宣布自7月1日起开始对日本机房的VPS进行NVMe和流量大升级,几乎是翻倍了硬盘和流量,价格依旧不变。目前来看,日本VPS国内过去走的是NTT直连,服务器托管机房应该是CDN77*(也就是datapacket.com),加上高性能平台(AMD Ryzen 9 3900X+NVMe),还是有相当大的性价比的。此外在6月30日,又新增了洛杉矶机房,CPU为AMD Ryzen 9...
WWW YC8 COM为你推荐
湖南商标注册湖南商标注册怎么办理vista系统重装vista怎样重装系统?照片转手绘照片弄成手绘一样的那个软件到底叫什么,能不能告诉啊?数码资源网有什么网站弄相片效果比较好的?商标注册查询官网怎么查商标有没有注册怎么上传音乐如何上传音乐服务器连接异常服务器连接异常是怎么回事啊,怎么解决php购物车PHP中用json实现购物车功能,怎么实现王炳坤nike男子跑步鞋42码的对应同款女子跑步鞋是多少码?网页打不开的原因网页老打不开是什么原因啊
免费网站域名注册 大庆服务器租用 国外免费域名网站 免费申请网页 主机点评 godaddy域名优惠码 美国php空间 512m内存 北京双线机房 韩国名字大全 网站卫士 中国电信测速网 web服务器安全 安徽双线服务器 免费邮件服务器 ebay注册 智能dns解析 谷歌台湾 申请免费空间 广东主机托管 更多