SystemRequirementsBitfusionGuideWHITEPAPER–OCTOBER2019WHITEPAPER|2VMwareBitfusion:SystemRequirementsTableofContentsSystemRequirements3HardwareRequirements3Networking4VerifyingSystemHealth5SoftwareDependencies6Ubuntu6CentOSandRedHat7ListCUDALibraries.
7VerifytheCUDAInstallation7CUDAInstallationonCPUNodes/VMs/Containers7WHITEPAPER|3SystemRequirementsFlexDirectcanbeinstalledfordifferentmodesofoperation.
Somemodesinclude,inwholeorinpart,theabilitiesofothermodes.
Thisonedocumentliststherequirementsforeachmode.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGERUbuntuLTS16.
04CentOS7orRHEL7.
4+NVIDIAToolkitCUDA7.
5CUDA8CUDA9CUDA10UbuntuLTS16.
04CentOS7orRHEL7.
4+UbuntuLTS16.
04CentOS7orRHEL7.
4+UbuntuLTS16.
04CentOS7orRHEL7.
4+NVIDIAGPUdriverversion372orhigherNVIDIAGPUdriverversion372orhigherNVIDIAGPUdriverversion372orhigherNote:CanalsorunApplicationsunderFlexDirectClient,inwhichcaseNVIDIAToolkitprerequisitesareneeded.
Note:CanalsorunApplicationsunderFlexDirectClient,inwhichcaseNVIDIAToolkitprerequisitesareneeded.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGERN/AAnygenerationCUDA-enabledNVIDIAGPU(s)AnygenerationCUDA-enabledNVIDIAGPU(s)AnygenerationCUDA-enabledNVIDIAGPU(s)10GbsormoreEthernet(TCP/IP)RoCEorInfinibandNetworkConnectivity10GbsormoreEthernet(TCP/IP)RoCEorInfiniband10GbsormoreEthernet(TCP/IP)RoCEorInfinibandSystemmemoryontheclientCPUmachinesshouldbe1.
5xthetotalaggregateGPUmemoryacrossalltheGPUsinthelargestGPUmachine.
N/AN/AN/AHardwareRequirementsWHITEPAPER|4NetworkingFlexDirectmakesuseofseveralportsorrangesofportsforprocesstoprocesscommunication.
Pleaseensurethatyourfirewallsdonotblocktheportsbelow.
Pleaseensuretherearenoportconflictswithotherapplications.
Someoftherangescanbemodifiedwithcommand-linearguments.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGER56001ServerCommunicationOutboundto:–Servernodes–Managernodes56001ServerCommunicationInboundfrom:–Analyticsnodes–ManagernodesOutboundto:–Analyticsnodes–Managernodes56001ServerCommunicationInboundfrom:–Clientnodes56001ServerCommunicationInboundfrom:–Clientnodes–Analyticsnodes–ManagernodesOutboundto:–Analyticsnodes–ManagernodesNote:Overridewith--srs_port56008,forexample.
Note:Overridewith--srs_port56008,forexample.
Note:Overridewith--srs_port56008,forexample.
55001-55100DispatcherCommunicationOutboundto:–Servernodes–Managernodes55001-55100DispatcherCommunicationN/A55001-55100DispatcherCommunicationInboundfrom:–Clientnodes55001-55100DispatcherCommunicationInboundfrom:–Clientnodes45201-46225CUDACommunicationOutboundto:–Servernodes–Managernodes45201-46225CUDACommunicationN/A45201-46225CUDACommunicationInboundfrom:–Clientnodes45201-46225CUDACommunicationInboundfrom:–Clientnodes54000WebServerN/A54000WebServerInboundfrom:–Anyonewithabrowser54000WebServerN/A54000WebServerInboundfrom:–AnyonewithabrowserNote:Overridewith--web_port12345,forexample.
Note:Overridewith--web_port12345,forexample.
WHITEPAPER|5FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGER54001StatisticsCollectionN/A54001StatisticsCollectionInbound/Outbound–localhostonly54001StatisticsCollectionN/A54001StatisticsCollectionInbound/Outbound–localhostonlyNote:Overridewith--collection_port54321,forexample.
Note:Overridewith--collection_port54321,forexample.
Ensureyourmachineshaveingress/egressinternetaccessforaccesstodownloads,licensing,etc.
Checkforothernetworkingpolicies,suchasoutboundproxyorinternalDNS,staticversusdynamicIPs.
ConsultwiththeBitfusionteamifyouareusingdynamicIPsorhaveanyproxiessetup.
VerifyingSystemHealthOnceyouhaveinstalledFlexDirect,youcanvalidateyourenvironmentforthebestresultsbyrunningtheFlexDirecthealthcheck.
Shell#Assumesflexdirectisinstalled;seeinstallationguide.
flexdirecthealthThehealthcheckwillrunchecksonthenodesofyourclusterappropriatetotheirhardware(e.
g.
,GPUs)ortotheirconfiguration(e.
g.
RoCE).
Soyouroutputmaydifferfromthatshownbelow.
Checkstrytofindsettingsorproblemsthatwilllimitorpreventthehigh-bandwidth,low-latencycommunicationneededforthebestperformance.
Theresultsshouldbeselfexplanatory.
CheckswillbeperformedonthelocalhostandonallconfiguredGPUservers.
Shownbelowistheoutputfromthecheckonthelocalnode.
WHITEPAPER|6Output$flexdirecthealthHealthreportforserver:192.
168.
10.
41:56001SoftwareVersionschecks:[PASS]Checklibrarydependency[PASS]CheckOFEDVersion:4.
3[PASS]CheckCUDAversion>=7050:Currentversion:9020[PASS]CheckGPUdriverversion>=367.
0:Currentversion:396.
37SystemResourceschecks:[PASS]Checkflexdirectinstall[PASS]CheckexternalconnectivitytoBitfusionlicenseserver.
Ignoreforon-premiseinstallations.
[PASS]Checkshadowmemory192076MBandtotalgpumem64640MBPerformancechecks:[PASS]CheckMTUSize:hi-speedinterfacesMTU>=4K[PASS]Checkmemops[PASS]Checkulimit-n>=4096:4096[MARGINAL]Checkmultinodesupport:GPUdirectnotsupported[MARGINAL]Checknv_peer_mem:nv_peer_memmodulenotloaded[SKIPPED]CannotperformPCIediagnosiswithoutrootaccess.
RunthebinarywithsudotogetdiagnosisStabilitychecks:[PASS]CheckNetworkErrors/Drops:foundnoerrorsorpacketdrops[PASS]CheckIBphysicallyup[PASS]CheckIBSMup[PASS]CheckMADagentregistrationerrors[PASS]CheckGPUAPImismatch[PASS]CheckGPUXiderrors[PASS]Checktemperaturecom/compute/cuda/repos/ubuntu1604/x86_64/cuda-repo-ubuntu1604_9.
1.
85-1_amd64.
deb$sudodpkg-icuda-repo-ubuntu1604_9.
1.
85-1_amd64.
deb$sudoapt-keyadv--fetch-keyshttp://developer.
download.
nvidia.
com/compute/cuda/repos/ubuntu1604/x86_64/7fa2af80.
pub$sudoapt-getupdate$sudoapt-getinstall-ycuda-toolkit-9-1$rmcuda-repo-ubuntu1604_9.
1.
85-1_amd64.
debListCUDALibrariesCUDAlibsmightbemissingifyourinstallationisaminimalone.
Copy/pastethefilesfrom/usr/local/cuda/lib64:VerifytheCUDAinstallationTheeasiestwaytodothiswillbetoinstalltheCUDAsamples.
HereistheRPMforCUDAsamples.
ItistheRPMforRHEL7andCUDA9.
1.
Afterinstallingthesamples,makesuredeviceQuerycanberuneitherdirectly(ifitisaGPUmachine)orwithFlexDirect(ifitisaClientCPUmachine)ScheduleatestjobusingCUDAdeviceQuery:CUDAInstallationonCPUNodes/VMs/ContainersInordertorunGPUapplicationswithFlexDirectonCPUnodes,VMs,orincontainerswhichdonothavedirectaccesstophysicalGPUhardware,youstillneedtoinstallCUDA—butthereisnoneedtoinstallanyNvidiadrivercomponents.
TheexamplebelowillustrateshowtotoinstallCUDA9.
1inafeweasysteps.
TheCUDAversionthatyouinstallontheClientshouldmatchtheCUDAversionwhichisrunningontheserver.
VMware,Inc.
3401HillviewAvenuePaloAltoCA94304USATel877-486-9273Fax650-427-5001vmware.
comCopyright2019VMware,Inc.
Allrightsreserved.
ThisproductisprotectedbyU.
S.
andinternationalcopyrightandintellectualpropertylaws.
VMwareproductsarecoveredbyoneormorepatentslistedatvmware.
com/go/patents.
VMwareisaregisteredtrademarkortrademarkofVMware,Inc.
anditssubsidiariesintheUnitedStatesandotherjurisdictions.
Allothermarksandnamesmentionedhereinmaybetrademarksoftheirrespectivecompanies.
ItemNo:VMW-0518-1843_VMW_CPBUTechnicalWhitePapers_BitfusionDocs_05SystemRequirements_1.
4_YC8/19
racknerd当前对美国犹他州数据中心的大硬盘服务器(存储服务器)进行低价促销,价格跌破眼镜啊。提供AMD和Intel两个选择,默认32G内存,120G SSD系统盘,12个16T HDD做数据盘,接入1Gbps带宽,每个月默认给100T流量,5个IPv4... 官方网站:https://www.racknerd.com 加密数字货币、信用卡、PayPal、支付宝、银联(卡),可以付款! ...
Chia矿机,Spinservers怎么样?Spinservers好不好,Spinservers大硬盘服务器。Spinservers刚刚在美国圣何塞机房补货120台独立服务器,CPU都是双E5系列,64-512GB DDR4内存,超大SSD或NVMe存储,数量有限,机器都是预部署好的,下单即可上架,无需人工干预,有需要的朋友抓紧下单哦。Spinservers是Majestic Hosting So...
WHloud Date(鲸云数据),原做大数据和软件开发的团队,现在转变成云计算服务,面对海内外用户提供中国大陆,韩国,日本,香港等多个地方节点服务。24*7小时的在线支持,较为全面的虚拟化构架以及全方面的技术支持!官方网站:https://www.whloud.com/WHloud Date 韩国BGP云主机少量补货随时可以开通,随时可以用,两小时内提交退款,可在工作日期间全额原路返回!支持pa...
WWW YC8 COM为你推荐
中国联通话费查询中国联通话费查询拨打什么号可以发外链的论坛可以发外链的论坛有那些?暴风影音怎么截图如何在暴风影音中截图?邮箱打不开怎么办163邮箱突然打不开了怎么办自助建站自助建站可信吗?1433端口如何打开SQL1433端口安卓应用平台安卓系统支持的软件并不是那么多,为什么这么多人推崇?ios7固件下载ios7发布当天是否有固件下载怎么点亮qq空间图标QQ空间的图标怎么点亮创维云电视功能创维新出的4K超高清健康云电视有谁用过,功能效果怎么样?
免费国外空间 论坛虚拟主机 绍兴服务器租用 美国和欧洲vps vps交流 awardspace win8.1企业版升级win10 php空间推荐 ntfs格式分区 刀片式服务器 lol台服官网 英国伦敦 华为云建站 免费个人主页 ipower 带宽测速 阿里云宕机故障 shuangshiyi ddos攻击教程 云主机 更多