OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphOracleWhitePaperJune,2020Copyright2020,Oracleand/oritsaffiliatesPublic2WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicPURPOSESTATEMENTThisdocumentprovidesanoverviewofhowOracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
DISCLAIMERThisdocumentinanyform,softwareorprintedmatter,containsproprietaryinformationthatistheexclusivepropertyofOracle.
YouraccesstoanduseofthisconfidentialmaterialissubjecttothetermsandconditionsofyourOraclesoftwarelicenseandserviceagreement,whichhasbeenexecutedandwithwhichyouagreetocomply.
Thisdocumentandinformationcontainedhereinmaynotbedisclosed,copied,reproducedordistributedtoanyoneoutsideOraclewithoutpriorwrittenconsentofOracle.
ThisdocumentisnotpartofyourlicenseagreementnorcanitbeincorporatedintoanycontractualagreementwithOracleoritssubsidiariesoraffiliates.
Thisdocumentisforinformationalpurposesonlyandisintendedsolelytoassistyouinplanningfortheimplementationandupgradeoftheproductfeaturesdescribed.
Itisnotacommitmenttodeliveranymaterial,code,orfunctionality,andshouldnotberelieduponinmakingpurchasingdecisions.
Thedevelopment,release,andtimingofanyfeaturesorfunctionalitydescribedinthisdocumentremainsatthesolediscretionofOracle.
Duetothenatureoftheproductarchitecture,itmaynotbepossibletosafelyincludeallfeaturesdescribedinthisdocumentwithoutriskingsignificantdestabilizationofthecode.
3WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTABLEOFCONTENTSPurposeStatement2Disclaimer2Introduction4ATrillionEdgesRDFGraphBenchmarkonOracleDatabase5TheResults5TheConfiguration5BestPracticesUsed6Databasesettings:6Tuning:6Conclusion64WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicINTRODUCTIONOnetrillionisareallybignumber.
Whatcouldyoustorewithonetrillionfacts1000tweetsforeveryoneofthe1BillionTwitterusers.
770factsabouteveryoneofthe1.
3BillionFacebookusers.
10factsfrom107Billionsensors,locatedsomewhereontheplanet.
400metabolicreadingsforeachofthe2.
5Billionheartbeatsoveranaveragehumanlifetime.
12factsabouteveryoneofthe86Billionneuronsinthehumanbrain.
5factsabouteveryoneofthe200BillionstarsintheMilkyWayGalaxy.
7factsabouteveryoneofthe150Billiongalaxiesintheuniverse.
6,350factsabouteachofthe158MillionbooksintheLibraryofCongress,thelargestintheworld.
10factsabouteachofthe107BillionpeoplewhoeverlivedResourceDescriptionFramework(RDF)graphsandtheanalyticstheypermitarebecomingcentraltobigdataapplicationsforsocialnetworksandlinkeddata.
Theseapplicationsareoftenfoundinpublicsector,healthcareandlifesciences,finance,media,andintelligencecommunities.
TheWorldWideWebConsortium(W3C)1definesRDFandtheWebOntologyLanguage(OWL)graphstandardsforrepresentinganddefiningsemanticdataandrules,andSPARQL,apatternmatchingquerylanguagedesignedspecificallyforgraphanalysis.
ThebasicnatureofanRDFgraphfacilitatesidentification,integration,anddiscovery:RDFdataelementsaregloballyunique.
TheyaredefinedusingUniformResourceIdentifiers(URIs)thatenableaconsistentmetadatalayerforintegrationofdisparatedatasources.
RDFdataelementsarelinkedtoformagraph.
Elementsareusedtomakestatementsintheformofsubject-predicate-objecttriples.
Predicates(edges)linkthesubjectandobject(nodes)andcandescribeanyrelationshiporproperty.
Theobjectcanbeanothersubjecttolinktriplestogethertoformagraphoraliteralthatisanattributeofthesubject.
Thetriplescanbefurtherqualifiedwithafourthnamedgraphcomponent,whicharereferredtoasRDFquads.
TheRDFmodelallowseasy,dynamicschemaevolution.
Addinganewschemaelementisaseasyasinsertingatriplewithanewpredicate.
RDFandSPARQLsupportadhocqueries.
Queriesmaynotbeknownwhentheschemaisdesigned.
TheRDFmodelmakesanOpenWorldAssumptionthatcanfacilitatediscovery.
Itassumesthatwhatisunknownisundefined,ratherthanfalse,asisthecasewithrelationaltechnology.
Italsohastechnologiesthathelpdiscovermissingresults.
RDFembedssemantics(meaning)directlyinthedata.
Entitiesarecategorizedwithclasses,predicatesarepropertiesorrelationships,andtheyareallpartofthedata,unlikecolumnheaders,foreignkeys,orconstraintsinrelationaldata.
RDFsupportsmachine-driveninferencingfordiscovery.
TheOWLsemanticlanguageandrulesusedtodefinethepredicatesintriplesarebasedonformalDescriptionLogicsthatenableautomaticdiscovery,suchasidentifying"same-as"relationshipsbetweendifferenttermswiththesamemeaningintwoapplications.
Thesetofinferredtriples(conclusionsthatcanbedrawn)isreferredtoasanentailment.
TheOWLlanguagecanunifyanenterprise'sdictionaries,vocabularies,andtaxonomies.
Allofthetermsusedbytheapplicationsinanenterprisecanberelatedtoeachotherandformconcepts.
Conceptsaremanagedasoneormoredomain-specificontologiesandstoredinRDFgraphs.
Ontologiesarelinkedtotheassertedinstancedataingraphsandusedforinferencingandquerying.
Thisisanothercapabilitythatfacilitatescreatingaconsistentmetadatalayerfordataintegration.
1http://www.
w3.
org/RDF/5WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheLehighUniversityBenchmark2(LUBM)isadefactoindustrystandardbenchmarkforevaluatingRDFgraphstoreproductperformance.
ItisusedbyRDFgraphstorevendorstocharacterizetheload,inference,andqueryperformanceoftheirproduct.
VendorspostresultsontheW3CLargeTripleStorespage3.
EndusersuseLUBMbenchmarkresultsaspartoftheirevaluationofanRDFGraphstoreproduct.
ThebenchmarkincludesaW3COWL-baseduniversityontology,adatageneratortocreateagraphofanysize,andfourteentestqueries.
OracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withOracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachineandachievedtworecord-settingaccomplishments:OraclebelievesitsbenchmarkisthelargestcompleteLUBMbenchmarkintheindustrytodate.
Thecombinedload,inference,andqueryresultsarethefastestRDFgraphperformancenumbersreported;thisisespeciallysignificantforabenchmarkofthisscaleandcomplexity.
Thedetailsforthisbenchmark,includingresults,configuration,andbestpracticesarediscussedinthenextsectionofthispaper.
ATRILLIONEDGESRDFGRAPHBENCHMARKONORACLEDATABASEAsbigdatagraphsgrowfrombillionstotrillionsofrelationshipsitbecomesincreasinglyimportanttocharacterizeproductperformance.
OracleconductedanRDFgraphLUBM4400kbenchmark.
Itinvolvedloading,inferencing,andqueryingoveronetrillionedgeswithRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
TheLUBMenvironmentwasusedtogeneratedataaboutuniversitiesandtheirdepartments.
Thedatawascreatedandorderedinto4.
4millionnamedgraphsbyexpandingthetriplesintoquads.
Therewasonenamedgraphperuniversity.
Theoverallgraphincluded605.
4billionuniqueassertedquadsandanentailmentofanother475.
6+billionquads.
TheResultsTheRDFGraphLUBM4400kbenchmarkonOracleDatabaseachievedthefollowingresults:DataLoadingPerformance:1.
420millionQuadsLoadedandIndexedperSecond.
605.
4BillionQuadswereloadedandtwoindexeswerecreatedin115.
2hours.
Note:GraphloadinginOracleDatabaseisuniqueintheindustryforcheckingthatquadsarewellformedandforremovingduplicates.
InferencePerformance:1.
527millionTriplesInferredandIndexedperSecond.
475.
6BillionTriplesandtwoindexeswerecreatedin86.
5hours.
SPARQLQueryPerformance:1.
130MillionQueryResultsperSecond.
92.
5BillionAnswersweregeneratedin22.
5hours.
ATrilliontriplesgraphAssertedInferredTotalAnswers605.
4BillionQuads475.
6BillionTriples1.
081TrillionQuads92.
5BillionTheConfigurationThemarket-leadingperformanceofthisbenchmarkwasduetothecombinationofthenativeRDFgraphstorecapabilitiesofRDFGraphinOracleDatabaseonthebalancedconfigurationofanOracleExadataDatabaseMachineX4-2.
TheuniquecapabilitiesoftheExadataDatabaseMachinethatassistedbenchmarkqueryperformanceinclude:SmartScanthatreducesdatamovementbetweenstorageservers(cells)anddatabaseserverbypushingqueriesdowntothestoragecell,storageindexesusedbythestoragecelltoreadonlyregionsofstoragethathaverelevantdata,andInfiniBandfabricthatprovidesfasttransfer(40Gb/second)ofrelevantbytesbacktothedatabaseservertocompletetheexecutionofaquery.
2http://swat.
cse.
lehigh.
edu/projects/lubm/3https://www.
w3.
org/wiki/LargeTripleStores6WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheOracleExadataDatabaseMachineX4-2Highcapacityfullrackwasconfiguredasfollows:8databasenodesand14storagenodesforatotalof168CPUcores2TBtotalRAMand44.
8TBFlashCacheZS3-2storagewith2controllersand8traysofdisksSoftware:OracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachine.
BestPracticesUsedThebestpracticesfallintotwocategories,databasesettingsandtuning.
Databasesettings:SGA_TARGET=132GBPGA_AGGREGATE_TARGET=100GOpencursors=1000Processes=100032KblocksizegiventoallgraphtablespacesaTEMPgroupcreatedwith3bigfiletablespacesUseoftheauto-allocateoptionforallocationoftablespaceextentscoupledwithalarge,8millionbytesextentsize.
ThisreducedthenumberofwaitscausedbyHVenqueuecontention;thatis,waitsonalockthatisusedtoalterthehigh-watermarkinatablespace.
Asaresult,contentionamongmultipleprocessesrequestingtablespaceexpansioncouldbeavoided.
DOPsettings(296,256,192)forautomaticdegreesofparallelismusedinloading,inferencing,andquerying.
UseofadditionalcompressionbeyondbasictablecompressionduringinferencingprovidedbytheHybridColumnarCompressionfeatureofOracleExadataDatabaseMachine.
Tuning:OracleEnterpriseManagerprovidedspecificperformanceinsightsintooperationsfortuning.
ThemethodologyusedisdocumentedintheOracleDatabasePerformanceTuningGuide.
4CONCLUSIONRDFgraphsprovideunique,standards-based,bigdatacapabilitiesformetadataintegration,anddiscoverytosupportsocialnetworksandlinkeddataapplicationsinavarietyofindustries.
RDFGraphdemonstratedindustry-leadingscalabilityandperformanceforloading,inference,andqueryingaonetrillionedgesRDFgraphmanagedinOracleDatabase.
TheLUBM4400kRDFgraphbenchmarkbenefitedfromthebalancedhardwareconfigurationofanOracleExadataDatabaseMachineX4-2.
Thebestpracticessettingsusedtoachievethesebenchmarkresultsarealsogenerallyapplicabletoreal-worldapplicationsonOracleExadataDatabaseMachineandotherbalancedhardwareconfigurations.
4http://docs.
oracle.
com/database/121/TGDBA/toc.
htm7WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicCONNECTWITHUSCall+1.
800.
ORACLE1orvisitoracle.
com.
OutsideNorthAmerica,findyourlocalofficeatoracle.
com/contact.
blogs.
oracle.
comfacebook.
com/oracletwitter.
com/oracleCopyright2020,Oracleand/oritsaffiliates.
Allrightsreserved.
Thisdocumentisprovidedforinformationpurposesonly,andthecontentshereofaresubjecttochangewithoutnotice.
Thisdocumentisnotwarrantedtobeerror-free,norsubjecttoanyotherwarrantiesorconditions,whetherexpressedorallyorimpliedinlaw,includingimpliedwarrantiesandconditionsofmerchantabilityorfitnessforaparticularpurpose.
Wespecificallydisclaimanyliabilitywithrespecttothisdocument,andnocontractualobligationsareformedeitherdirectlyorindirectlybythisdocument.
Thisdocumentmaynotbereproducedortransmittedinanyformorbyanymeans,electronicormechanical,foranypurpose,withoutourpriorwrittenpermission.
ThisdevicehasnotbeenauthorizedasrequiredbytherulesoftheFederalCommunicationsCommission.
Thisdeviceisnot,andmaynotbe,offeredforsaleorlease,orsoldorleased,untilauthorizationisobtained.
OracleandJavaareregisteredtrademarksofOracleand/oritsaffiliates.
Othernamesmaybetrademarksoftheirrespectiveowners.
IntelandIntelXeonaretrademarksorregisteredtrademarksofIntelCorporation.
AllSPARCtrademarksareusedunderlicenseandaretrademarksorregisteredtrademarksofSPARCInternational,Inc.
AMD,Opteron,theAMDlogo,andtheAMDOpteronlogoaretrademarksorregisteredtrademarksofAdvancedMicroDevices.
UNIXisaregisteredtrademarkofTheOpenGroup.
0120OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphJune2020
前些天赵容分享过DogYun(狗云)香港BGP线路AMD 5950X经典低价云服务器的信息(点击查看),刚好账户还有点余额够开个最低配,所以手贱尝试下,这些贴上简单测试信息,方便大家参考。官方网站:www.dogyun.com主机配置我搞的是最低款优惠后14.4元/月的,配置单核,512MB内存,10GB硬盘,300GB/50Mbps月流量。基本信息DogYun的VPS主机管理集成在会员中心,包括...
今天看到群里的老秦同学在布局自己的网站项目,这个同学还是比较奇怪的,他就喜欢用这些奇怪的域名。比如前几天看到有用.in域名,个人网站他用的.me域名不奇怪,这个还是常见的。今天看到他在做的一个范文网站的域名,居然用的是 .asia 后缀。问到其理由,是有不错好记的前缀。这里简单的搜索到.ASIA域名的新注册价格是有促销的,大约35元首年左右,续费大约是80元左右,这个成本算的话,比COM域名还贵。...
香港ceranetworks提速啦是成立于2012年的十分老牌的一个商家这次给大家评测的是 香港ceranetworks 8核16G 100M 这款产品 提速啦老板真的是豪气每次都给高配我测试 不像别的商家每次就给1核1G,废话不多说开始跑脚本。香港ceranetworks 2核2G 50G硬盘20M 69元/月30M 99元/月50M 219元/月100M 519元/月香港ceranetwork...
oracle数据库视频为你推荐
站酷zcool站酷zcool字体下载后怎么安装到PS中敬汉卿姓名被抢注身份证号码被别人抢注了12306帐号怎么办12306崩溃iphone 12306网络错误ip在线查询通过对方的IP地址怎么样找到他的详细地址?www.mywife.ccmywife哪部最经典partnersonline电脑内一切浏览器无法打开www.15job.com南方人才市场有官方网站是什么?机器蜘蛛求一个美国的科幻电影名!里面有大型的机械蜘蛛。汴京清谈汴京还被称为什么?龚如敏请问这张图片出自哪里?
fc2新域名 securitycenter l5520 ev证书 搜狗12306抢票助手 七夕促销 怎么测试下载速度 asp免费空间申请 老左来了 腾讯实名认证中心 免费测手机号 绍兴电信 服务器监测 网站在线扫描 重庆电信服务器托管 路由跟踪 服务器论坛 摩尔庄园注册 电信宽带测速软件 阿里云个人邮箱 更多