recommendations37

yw372:Com  时间:2021-02-13  阅读:()
DISCOVERYANDANALYSISOFWEBUSAGEMININGMARATHEDAGADUMITHARAMR.
C.
PatelA.
C.
S.
College,Shirpur,Maharashtra,IndiaABSTRACTInthispaperwedescribesomeofthemostcommontypesofpatterndiscoveryandanalysistechniquesemployedintheWebusagemining.
InthispapermentionAssociationandClusterAnalysis.
AssociationRuleisafundamentalofDataminingtask.
Itsobjectivetofindallco-occurrencerelationshipcalled,Associationamongdataitem.
LetI={i1,i2,…,im}beasetofitems.
LetT=(t1,t2,…,tn)beasetoftransactions.
ClusteranalysisandvisitorssegmentationClusteringisadataminingtechniquethatgroupstogetherasetofitemshavingsimilarcharacteristics.
Intheusagedomain,therearetwokindsofinterestingclustersthatcanbediscovered:userclustersandpageclusters.
GoalDiscoveryandanalysisofwebusagepatternsusingAssociationanalysis.
DiscoveryandanalysisofwebusagepatternsusingClusterAnalysisandVisitorssegmentation.
KEYWORDS:AssociationAnalysis,ClusterAnalysisandVisitorsSegmentationINTRODUCTIONAssociationrulediscoveryandstatisticalcorrelationanalysiscanfindgroupsofitemsorpagesthatarecommonlyaccessedorpurchasedtogether.
AssociationbasedonApriorialgorithm.
Thisalgorithmfindsgroupsofitemusingsupportandconfidence.
Satisfyingauserspecifiedminimumsupportthreshold.
Suchgroupsofitemsarereferredtoasfrequentitemsets&frequentitemsetsgraph.
Logfilesgeneratedbywebserverscontainenormousamountsofwebusagedatathatispotentiallyvaluableforunderstandingthebehaviorofwebsitevisitors.
Clusteringofuserrecords(sessionsortransactions)isoneofthemostcommonlyusedanalysistasksinWebusageminingandWebanalytics.
Clusteringofuserstendstoestablishgroupsofusersexhibitingsimilarbrowsingpatterns.
Suchknowledgeisespeciallyusefulforinferringuserdemographicsinordertoperformmarketsegmentationine-commerceapplicationsorprovidepersonalizedWebcontenttotheuserswithsimilarinterests.
Furtheranalysisofusergroupsbasedontheirdemographicattributes(e.
g.
,age,gender,incomelevel,etc.
)mayleadtothediscoveryofvaluablebusinessintelligence.
Usage-basedclusteringhasalsobeenusedtocreateWeb-based"usercommunities"reflectingsimilarinterestsofgroupsofusers,andtolearnusermodelsthatcanbeusedtoprovidedynamicrecommendationsinWebpersonalizationapplications.
ASSOCIATIONRULESupport&ConfidenceTheSupportofrule,XYthepercentageoftransactioninTthatcontainsXUY.
nisthenumberoftransactioninT.
Supportisusefulmeasurementofitemsetoritems.
IfXistruethenchecksforY,ifXisfalsethennothingtobesayY.
InthefollowingexampleXunionYthencount.
InternationalJournalofComputerScienceEngineeringandInformationTechnologyResearch(IJCSEITR)ISSN2249-6831Vol.
3,Issue1,Mar2013,313-320TJPRCPvt.
Ltd.
314MaratheDagaduMitharame.
g.
(XUY).
CountSupportN(XUY).
CountConfidenceX.
CountUsingaboveexampleswecanaccepttheminsubandminconf.
Tocalculateminsubandminconfasfollows.
T1C++,JAVA,RUBYT2C++,ASPT3ASP,VBT4C++,JAVA,ASPT5C++,JAVA,PHP,ASP,RUBYT6JAVA,PHP,RUBYT7JAVA,RUBY,PHPJAVA,PHPRUBY[sup=3/7,conf=3/3]Inabove7transactionsJAVA,PHP&RUBYshow3/7times.
EveryitemchecksitemsettoeveryusingJoiningandPruningsteps.
Inwebusageminingsuchrulecanbeusetooptimizestructureofwebsite.
e.
g.
Language,/product/softwareRCPACSCOLLEGEWebsiteEXPERIMENT-FINDINGWEBUSAGEASSOCIATIONRULESInstances:14Attributes:5outlooktemperatureDiscoveryandAnalysisofWebUsageMining315humiditywindyplayIfchecksunny,falseyes[sub1/14conf1/1]Thepurposeofthisexperimentwastogivesomeinsightintotheusefulnessofassociationruleswhentheyareappliedtotheweblogdatasetofaneducationinstitutionandothers.
Weexpectedtofindrulesthatcorrelatetowebpagesthatcontaininformationaboutsunny,rainyortemperatureetc.
SupposethisistransactiontableandfindoutFrequentItemsetthen,T1C++,JAVA,RUBYT2C++,ASPT3ASP,VBT4C++,JAVA,ASPT5C++,JAVA,PHP,ASP,RUBYT6JAVA,PHP,RUBYT7JAVA,RUBY,PHPSize1Size2Size3Size4ItemSetSupp.
ItemSetSupp.
ItemSetSupp.
ItemSetSupp.
C++4C++,JAVA3C++,JAVA,RUBY2C++,JAVA,RUBY,ASP1JAVA5C++,RUBY2C++,JAVA,ASP2C++,JAVA,RUBY,PHP1RUBY4C++,ASP3JAVA,RUBY,ASP1ASP4C++,PHP1JAVA,RUBY,PHP3VB1JAVA,RUBY4RUBY,ASP,PHP1PHP3JAVA,ASP2JAVA,PHP3RUBY,ASP1RUBY,PHP3ASP,PHP1Figure1:WebTransactionsandResultingFrequentItemsets(Minsup=1)FindoutFrequentItemsetbyUsingJoiningandPruningMethodsofAssociationRuleFREQUENTITEMSETGRAPHFig.
2,findsitemsC++andRUBYascandidaterecommendations.
TherecommendationscoresofitemAandCare1,correspondingtotheconfidencesoftherules,JAVA,ASP->C++andJAVA,ASP->RUBY,respectively.
Aproblemwithusingasingleglobalminimumsupportthresholdinassociationruleminingisthatthediscoveredpatternswillnotinclude"rare"butimportantitemswhichmaynotoccurfrequentlyinthetransactiondata.
316MaratheDagaduMitharamC=C++J=JAVAA=ASPR=RUBYP=PHPFigure2:FrequentItemsetsCLUSTERANALYSISANDVISITORSSEGMENTATIONConceptandExampleClusteringofuserrecords(sessionsortransactions)isoneofthemostcommonlyusedanalysistasksinWebusageminingandWebanalytics.
Clusteringofuserstendstoestablishgroupsofusersexhibitingsimilarbrowsingpatterns.
Suchknowledgeisespeciallyusefulforinferringuserdemographicsinordertoperformmarketsegmentationine-commerceapplicationsorprovidepersonalizedWebcontenttotheuserswithsimilarinterests.
DiscoveryandAnalysisofWebUsageMining317HereweUsetheformulaof"WebDataMining"-Bingliubook.
Asanexample,considerthetransactiondatadepictedinsimplicityweassumethatfeature(pageview)weightsineachtransactionvectorarebinary(incontrasttoweightsbasedonafunctionofpageviewduration).
Weassumethatthedatahasalreadybeenclusteredusingastandardclusteringalgorithmsuchask-means,resultinginthreeclustersofusertransactions.
Itshowstheaggregateprofilecorrespondingtocluster1.
Asindicatedbythepageviewweights,pageviewsBandFarethemostsignificantpagescharacterizingthecommoninterestsofusersinthissegment.
PageviewC,however,onlyappearsinonetransactionandmightberemovedgivenafilteringthresholdgreaterthan0.
25.
Suchpatternsareusefulforcharacterizinguserorcustomersegments.
Thisexample,forinstance,indicatesthattheresultingusersegmentisclearlyinterestedinitemsBandFandtoalesserdegreeinitemA.
GivenanewuserwhoshowsinterestinitemsAandB,thispatternmaybeusedtoinferthattheusermightbelongtothissegmentand,therefore,wemightrecommenditemFtothatuser.
ExperimentandResultsInthisexperimentwedefinetable"weather"anddefinefields.
318MaratheDagaduMitharamOutputUsingClusterinWeka===Runinformation===Scheme:weka.
clusterers.
HierarchicalClusterer-N2-LSINGLE-P-A"weka.
core.
EuclideanDistance-Rfirst-last"Relation:weatherInstances:13Attributes:5outlooktemperaturehumiditywindyIgnoredplayTestmode:Classestoclustersevaluationontrainingdata===Modelandevaluationontrainingset===Cluster0((((((1.
0:0.
18505,1.
0:0.
18505):0.
05959,1.
0:0.
24464):0.
7557,(1.
0:0.
16832,(1.
0:0.
08235,1.
0:0.
08235):0.
08597):0.
83201):0.
00109,((0.
0:0.
22986,0.
0:0.
22986):0.
77157,0.
0:1.
00142):0):0.
00106,(0.
0:0.
21648,0.
0:0.
21648):0.
78601):0.
00135,1.
0:1.
00384)ClusteredInstances012(92%)11(8%)Classattribute:playClassestoClusters:01<--assignedtocluster71|yes50|noCluster0<--yesCluster1<--NoclassIncorrectlyclusteredinstances:6.
046.
1538%DiscoveryandAnalysisofWebUsageMining319VisualizationsofPatternsCONCLUSIONSUsagepatternsdiscoveredthroughWebusageminingareeffectiveincapturingitem-to-itemanduser-to-userrelationshipsandsimilaritiesatthelevelofusersessions.
Thispaperhasattemptedtoforthepurposeofwebusagemining.
TheproposedmethodsweresuccessfullytestedonthedatasetordatabasesusingassociationruleandclusteranalysismethodusingWekaTool.
Ourexperimentsconfirmedthatoneofthemajorissuesinassociationruleandclusterfindingistheexistenceoftoomanyrulesandgroups,allofwhichsatisfydefinedconstraints.
REFERENCES1.
Webdatamining–BingLiu320MaratheDagaduMitharam2.
PPTforWebusagemining-BingLiu3.
Srivastava,J.
,Cooley,R.
,Deshpande,M.
,Tan,P.
N.
(2000).
WebUsageMining:DiscoveryandApplicationsofUsagePatternsfromWebData.
ACMSIGKDD,Jan2000.
4.
JaideepSrivastavaPaper5.
WCA.
Webcharacterizationterminology&definitions.
6.
http://www.
w3.
org/1999/05/WCA-terms/.
Vigenteal19/11/2005

提速啦(24元/月)河南BGP云服务器活动 买一年送一年4核 4G 5M

提速啦的来历提速啦是 网站 本着“良心 便宜 稳定”的初衷 为小白用户避免被坑 由赣州王成璟网络科技有限公司旗下赣州提速啦网络科技有限公司运营 投资1000万人民币 在美国Cera 香港CTG 香港Cera 国内 杭州 宿迁 浙江 赣州 南昌 大连 辽宁 扬州 等地区建立数据中心 正规持有IDC ISP CDN 云牌照 公司。公司购买产品支持3天内退款 超过3天步退款政策。提速啦的市场定位提速啦主...

妮妮云(43元/月 ) 香港 8核8G 43元/月 美国 8核8G

妮妮云的来历妮妮云是 789 陈总 张总 三方共同投资建立的网站 本着“良心 便宜 稳定”的初衷 为小白用户避免被坑妮妮云的市场定位妮妮云主要代理市场稳定速度的云服务器产品,避免新手购买云服务器的时候众多商家不知道如何选择,妮妮云就帮你选择好了产品,无需承担购买风险,不用担心出现被跑路 被诈骗的情况。妮妮云的售后保证妮妮云退款 通过于合作商的友好协商,云服务器提供2天内全额退款,超过2天不退款 物...

百驰云(19/月),高性能服务器,香港三网CN2 2核2G 10M 国内、香港、美国、日本、VPS、物理机、站群全站7.5折,无理由退换,IP免费换!

百驰云成立于2017年,是一家新国人IDC商家,且正规持证IDC/ISP/CDN,商家主要提供数据中心基础服务、互联网业务解决方案,及专属服务器租用、云服务器、云虚拟主机、专属服务器托管、带宽租用等产品和服务。百驰云提供源自大陆、香港、韩国和美国等地骨干级机房优质资源,包括BGP国际多线网络,CN2点对点直连带宽以及国际顶尖品牌硬件。专注为个人开发者用户,中小型,大型企业用户提供一站式核心网络云端...

yw372:Com为你推荐
ROBUSTEeset安顺网易yeahsns平台什么是SNS?企业推广企业营销活动主要包括哪些内容?cuteftp什么是CuteFTP?如何将网站内容上传(FTP)到网站空间?字节跳动回应TikTok易主抖音字节跳动是什么意思?客服电话中国移动的人工服务电话号码是多少2828商机网2828商机网的信息准确吗,可信度高吗佛山海虹广东海虹药通电子商务有限公司怎么样?123456hd手机卡上出现符号hd怎么取消
香港虚拟主机 vps虚拟服务器 zpanel t楼 国外服务器 免费cdn加速 光棍节日志 win8升级win10正式版 韩国网名大全 上海域名 工信部icp备案号 申请免费空间和域名 华为云盘 东莞主机托管 杭州电信宽带 云销售系统 开心online 国外bt网站 腾讯空间登录首页 万网主机代理 更多