recommendations37
yw372:Com 时间:2021-02-13 阅读:(
)
DISCOVERYANDANALYSISOFWEBUSAGEMININGMARATHEDAGADUMITHARAMR.
C.
PatelA.
C.
S.
College,Shirpur,Maharashtra,IndiaABSTRACTInthispaperwedescribesomeofthemostcommontypesofpatterndiscoveryandanalysistechniquesemployedintheWebusagemining.
InthispapermentionAssociationandClusterAnalysis.
AssociationRuleisafundamentalofDataminingtask.
Itsobjectivetofindallco-occurrencerelationshipcalled,Associationamongdataitem.
LetI={i1,i2,…,im}beasetofitems.
LetT=(t1,t2,…,tn)beasetoftransactions.
ClusteranalysisandvisitorssegmentationClusteringisadataminingtechniquethatgroupstogetherasetofitemshavingsimilarcharacteristics.
Intheusagedomain,therearetwokindsofinterestingclustersthatcanbediscovered:userclustersandpageclusters.
GoalDiscoveryandanalysisofwebusagepatternsusingAssociationanalysis.
DiscoveryandanalysisofwebusagepatternsusingClusterAnalysisandVisitorssegmentation.
KEYWORDS:AssociationAnalysis,ClusterAnalysisandVisitorsSegmentationINTRODUCTIONAssociationrulediscoveryandstatisticalcorrelationanalysiscanfindgroupsofitemsorpagesthatarecommonlyaccessedorpurchasedtogether.
AssociationbasedonApriorialgorithm.
Thisalgorithmfindsgroupsofitemusingsupportandconfidence.
Satisfyingauserspecifiedminimumsupportthreshold.
Suchgroupsofitemsarereferredtoasfrequentitemsets&frequentitemsetsgraph.
Logfilesgeneratedbywebserverscontainenormousamountsofwebusagedatathatispotentiallyvaluableforunderstandingthebehaviorofwebsitevisitors.
Clusteringofuserrecords(sessionsortransactions)isoneofthemostcommonlyusedanalysistasksinWebusageminingandWebanalytics.
Clusteringofuserstendstoestablishgroupsofusersexhibitingsimilarbrowsingpatterns.
Suchknowledgeisespeciallyusefulforinferringuserdemographicsinordertoperformmarketsegmentationine-commerceapplicationsorprovidepersonalizedWebcontenttotheuserswithsimilarinterests.
Furtheranalysisofusergroupsbasedontheirdemographicattributes(e.
g.
,age,gender,incomelevel,etc.
)mayleadtothediscoveryofvaluablebusinessintelligence.
Usage-basedclusteringhasalsobeenusedtocreateWeb-based"usercommunities"reflectingsimilarinterestsofgroupsofusers,andtolearnusermodelsthatcanbeusedtoprovidedynamicrecommendationsinWebpersonalizationapplications.
ASSOCIATIONRULESupport&ConfidenceTheSupportofrule,XYthepercentageoftransactioninTthatcontainsXUY.
nisthenumberoftransactioninT.
Supportisusefulmeasurementofitemsetoritems.
IfXistruethenchecksforY,ifXisfalsethennothingtobesayY.
InthefollowingexampleXunionYthencount.
InternationalJournalofComputerScienceEngineeringandInformationTechnologyResearch(IJCSEITR)ISSN2249-6831Vol.
3,Issue1,Mar2013,313-320TJPRCPvt.
Ltd.
314MaratheDagaduMitharame.
g.
(XUY).
CountSupportN(XUY).
CountConfidenceX.
CountUsingaboveexampleswecanaccepttheminsubandminconf.
Tocalculateminsubandminconfasfollows.
T1C++,JAVA,RUBYT2C++,ASPT3ASP,VBT4C++,JAVA,ASPT5C++,JAVA,PHP,ASP,RUBYT6JAVA,PHP,RUBYT7JAVA,RUBY,PHPJAVA,PHPRUBY[sup=3/7,conf=3/3]Inabove7transactionsJAVA,PHP&RUBYshow3/7times.
EveryitemchecksitemsettoeveryusingJoiningandPruningsteps.
Inwebusageminingsuchrulecanbeusetooptimizestructureofwebsite.
e.
g.
Language,/product/softwareRCPACSCOLLEGEWebsiteEXPERIMENT-FINDINGWEBUSAGEASSOCIATIONRULESInstances:14Attributes:5outlooktemperatureDiscoveryandAnalysisofWebUsageMining315humiditywindyplayIfchecksunny,falseyes[sub1/14conf1/1]Thepurposeofthisexperimentwastogivesomeinsightintotheusefulnessofassociationruleswhentheyareappliedtotheweblogdatasetofaneducationinstitutionandothers.
Weexpectedtofindrulesthatcorrelatetowebpagesthatcontaininformationaboutsunny,rainyortemperatureetc.
SupposethisistransactiontableandfindoutFrequentItemsetthen,T1C++,JAVA,RUBYT2C++,ASPT3ASP,VBT4C++,JAVA,ASPT5C++,JAVA,PHP,ASP,RUBYT6JAVA,PHP,RUBYT7JAVA,RUBY,PHPSize1Size2Size3Size4ItemSetSupp.
ItemSetSupp.
ItemSetSupp.
ItemSetSupp.
C++4C++,JAVA3C++,JAVA,RUBY2C++,JAVA,RUBY,ASP1JAVA5C++,RUBY2C++,JAVA,ASP2C++,JAVA,RUBY,PHP1RUBY4C++,ASP3JAVA,RUBY,ASP1ASP4C++,PHP1JAVA,RUBY,PHP3VB1JAVA,RUBY4RUBY,ASP,PHP1PHP3JAVA,ASP2JAVA,PHP3RUBY,ASP1RUBY,PHP3ASP,PHP1Figure1:WebTransactionsandResultingFrequentItemsets(Minsup=1)FindoutFrequentItemsetbyUsingJoiningandPruningMethodsofAssociationRuleFREQUENTITEMSETGRAPHFig.
2,findsitemsC++andRUBYascandidaterecommendations.
TherecommendationscoresofitemAandCare1,correspondingtotheconfidencesoftherules,JAVA,ASP->C++andJAVA,ASP->RUBY,respectively.
Aproblemwithusingasingleglobalminimumsupportthresholdinassociationruleminingisthatthediscoveredpatternswillnotinclude"rare"butimportantitemswhichmaynotoccurfrequentlyinthetransactiondata.
316MaratheDagaduMitharamC=C++J=JAVAA=ASPR=RUBYP=PHPFigure2:FrequentItemsetsCLUSTERANALYSISANDVISITORSSEGMENTATIONConceptandExampleClusteringofuserrecords(sessionsortransactions)isoneofthemostcommonlyusedanalysistasksinWebusageminingandWebanalytics.
Clusteringofuserstendstoestablishgroupsofusersexhibitingsimilarbrowsingpatterns.
Suchknowledgeisespeciallyusefulforinferringuserdemographicsinordertoperformmarketsegmentationine-commerceapplicationsorprovidepersonalizedWebcontenttotheuserswithsimilarinterests.
DiscoveryandAnalysisofWebUsageMining317HereweUsetheformulaof"WebDataMining"-Bingliubook.
Asanexample,considerthetransactiondatadepictedinsimplicityweassumethatfeature(pageview)weightsineachtransactionvectorarebinary(incontrasttoweightsbasedonafunctionofpageviewduration).
Weassumethatthedatahasalreadybeenclusteredusingastandardclusteringalgorithmsuchask-means,resultinginthreeclustersofusertransactions.
Itshowstheaggregateprofilecorrespondingtocluster1.
Asindicatedbythepageviewweights,pageviewsBandFarethemostsignificantpagescharacterizingthecommoninterestsofusersinthissegment.
PageviewC,however,onlyappearsinonetransactionandmightberemovedgivenafilteringthresholdgreaterthan0.
25.
Suchpatternsareusefulforcharacterizinguserorcustomersegments.
Thisexample,forinstance,indicatesthattheresultingusersegmentisclearlyinterestedinitemsBandFandtoalesserdegreeinitemA.
GivenanewuserwhoshowsinterestinitemsAandB,thispatternmaybeusedtoinferthattheusermightbelongtothissegmentand,therefore,wemightrecommenditemFtothatuser.
ExperimentandResultsInthisexperimentwedefinetable"weather"anddefinefields.
318MaratheDagaduMitharamOutputUsingClusterinWeka===Runinformation===Scheme:weka.
clusterers.
HierarchicalClusterer-N2-LSINGLE-P-A"weka.
core.
EuclideanDistance-Rfirst-last"Relation:weatherInstances:13Attributes:5outlooktemperaturehumiditywindyIgnoredplayTestmode:Classestoclustersevaluationontrainingdata===Modelandevaluationontrainingset===Cluster0((((((1.
0:0.
18505,1.
0:0.
18505):0.
05959,1.
0:0.
24464):0.
7557,(1.
0:0.
16832,(1.
0:0.
08235,1.
0:0.
08235):0.
08597):0.
83201):0.
00109,((0.
0:0.
22986,0.
0:0.
22986):0.
77157,0.
0:1.
00142):0):0.
00106,(0.
0:0.
21648,0.
0:0.
21648):0.
78601):0.
00135,1.
0:1.
00384)ClusteredInstances012(92%)11(8%)Classattribute:playClassestoClusters:01<--assignedtocluster71|yes50|noCluster0<--yesCluster1<--NoclassIncorrectlyclusteredinstances:6.
046.
1538%DiscoveryandAnalysisofWebUsageMining319VisualizationsofPatternsCONCLUSIONSUsagepatternsdiscoveredthroughWebusageminingareeffectiveincapturingitem-to-itemanduser-to-userrelationshipsandsimilaritiesatthelevelofusersessions.
Thispaperhasattemptedtoforthepurposeofwebusagemining.
TheproposedmethodsweresuccessfullytestedonthedatasetordatabasesusingassociationruleandclusteranalysismethodusingWekaTool.
Ourexperimentsconfirmedthatoneofthemajorissuesinassociationruleandclusterfindingistheexistenceoftoomanyrulesandgroups,allofwhichsatisfydefinedconstraints.
REFERENCES1.
Webdatamining–BingLiu320MaratheDagaduMitharam2.
PPTforWebusagemining-BingLiu3.
Srivastava,J.
,Cooley,R.
,Deshpande,M.
,Tan,P.
N.
(2000).
WebUsageMining:DiscoveryandApplicationsofUsagePatternsfromWebData.
ACMSIGKDD,Jan2000.
4.
JaideepSrivastavaPaper5.
WCA.
Webcharacterizationterminology&definitions.
6.
http://www.
w3.
org/1999/05/WCA-terms/.
Vigenteal19/11/2005
sharktech怎么样?sharktech鲨鱼机房(Sharktech)我们也叫它SK机房,是一家成立于2003年的老牌国外主机商,提供的产品包括独立服务器租用、VPS主机等,自营机房在美国洛杉矶、丹佛、芝加哥和荷兰阿姆斯特丹等,主打高防产品,独立服务器免费提供60Gbps/48Mpps攻击防御。机房提供1-10Gbps带宽不限流量服务器,最低丹佛/荷兰机房每月49美元起,洛杉矶机房最低59美元...
说明一下:gcorelabs的俄罗斯远东机房“伯力”既有“Virtual servers”也有“CLOUD SERVICES”,前者是VPS,后者是云服务器,不是一回事;由于平日大家习惯把VPS和云服务器当做一回事儿,所以这里要特别说明一下。本次测评的是gcorelabs的cloud,也就是云服务器。 官方网站:https://gcorelabs.com 支持:数字加密货币、信用卡、PayPal...
hostkvm怎么样?hostkvm是一家国内老牌主机商家,商家主要销售KVM架构的VPS,目前有美国、日本、韩国、中国香港等地的服务,站长目前还持有他家香港CN2线路的套餐,已经用了一年多了,除了前段时间香港被整段攻击以外,一直非常稳定,是做站的不二选择,目前商家针对香港云地和韩国机房的套餐进行7折优惠,其他套餐为8折,商家支持paypal和支付宝付款。点击进入:hostkvm官方网站地址hos...
yw372:Com为你推荐
企业推广品牌推广的目的是什么?centos6.5如何安装linux centos6.5googlepr百度权重和googlePR都是些什么东西??波音737起飞爆胎一般的客机的起飞速度是多少?internetexplorer无法打开为什么Internet Explorer浏览器无法打开filezilla_serverFileZilla无法连接服务器怎么解决密码cuteftp三友网网测是什么意思?闪拍网关于闪拍网骗人的情况?网站方案设计网站文案策划怎么写
主机域名 电信服务器租赁 浙江vps 申请免费域名 荣耀欧洲 冰山互联 美国主机评论 60g硬盘 php免费空间 亚洲小于500m 流媒体加速 512mb 国内域名 百度云空间 摩尔庄园注册 大化网 睿云 新网dns 北京汽车摇号申请网站 英国伦敦天气 更多