ThePlanteomeProjectLaurelCooper,AustinMeier,JustinL.
Elser,JustinPreece,XuXu,RyanS.
Kitchen,BotongQu,EugeneZhang,SinisaTodorovic,PankajJaiswalOregonStateUniversity,Corvallis,OR,USAMarie-AngéliqueLaporte,ElizabethArnaudBioversityInternational,Montpellier,FranceSethCarbon,ChrisMungallLawrenceBerkeleyNationalLaboratory,Berkeley,CA,USABarrySmithUniversityatBuffalo,Buffalo,NY,USAGeorgiosGkoutosUniversityofBirmingham,UKandUniversityofAberystwyth,UKJohnDoonanUniversityofAberystwyth,UKAbstract—ThePlanteomeprojectisacentralizedonlineplantinformaticsportalwhichprovidessemanticintegrationofwidelydiversedatasetswiththegoalofplantimprovement.
Traditionalplantbreedingmethodsforcropimprovementmaybecombinedwithnext-generationanalysismethodsandautomatedscoringoftraitsandphenotypestodevelopimprovedvarieties.
ThePlanteomeproject(www.
planteome.
org)developsandhostsasuiteofreferenceontologiesforplantsassociatedwithagrowingcorpusofgenomicsdata.
Dataannotationslinkingphenotypesandgermplasmtogenomicsresourcesareachievedbydatatransformationandmappingspecies-specificcontrolledvocabulariestothereferenceontologies.
Analysisandannotationtoolsarebeingdevelopedtofacilitatestudiesofplanttraits,phenotypes,diseases,genefunctionandexpressionandgeneticdiversitydataacrossawiderangeofplantspecies.
TheprojectdatabaseandtheonlineresourcesprovideresearcherstoolstosearchandbrowseandaccessremotelyviaAPIsforsemanticintegrationinannotationtoolsanddatarepositoriesprovidingresourcesforplantbiology,breeding,genomicsandgenetics.
Keywords—ontology;traitsphenotype;semantic;dataintegration,plantsI.
INTRODUCTIONA.
RationaleItisestimatedthattheworldpopulationisprojectedtoreach9.
6billionpeopleinnextfewdecades(http://www.
wri.
org/blog/2013/12/global-food-challenge-explained-18-graphics).
Therefore,thechallengeishowtofeedthisgrowingpopulation,whileprotectingtheearth'senvironment.
Traditionalplantbreedingmethodsforplantimprovementmaybecombinedwithnext-generationanalysismethods,includingthehigh-throughputandautomatedscoringoftraitsandphenotypestodevelopimprovedvarieties.
Datafromhigh-throughputsequencing,transcriptomic,proteomic,phenomicandgenomeannotationprojectscanbelinkedtogermplasmresourcesthroughtheuseofinteroperable,referencevocabularies(ontologies).
Inthisway,theknowledgegainedfromthenext-generationdatacanbeutilizedforcropimprovement.
B.
WhatisthePlanteomeThePlanteomeProject(www.
planteome.
org)isacentralizedonlineinformaticsportalanddatabase,consistingofasuiteofreferenceontologiesforplants,anassociatedcorpusofplantgenomicsandphenomicsdata,andtoolsfordataanalysisandannotation.
Analysesofthesedatasetsfromgeneticandgenomicstudieshavethepotentialtoimproveourunderstandingofthemolecularbasisofeconomicallyrelevanttraits.
Inordertoutilizethisdata,researchersmustbeabletoconnecttherelevantplanttraitsofinteresttothespatialandtemporalexpressionpatternsofgenes,andelucidatetheirrolesinbiologicalprocessesinplants.
C.
GoalsofthePlanteomeProject:1.
Asuiteofinterrelatedreferenceontologiestodescribemajorknowledgedomainsofplantbiology,comprisingplantphenotypeandtraits,environments,andbioticandabioticstresses.
2.
Standards,workflowsandtoolsforannotationofplantgenomicsdata,andmetadataforcurationandimprovedannotationofgenes,genomes,phenotypeandgermplasm.
3.
ThePlanteomebrowseranddatabase,acentralized,onlineinformaticsportalandrepositorywherereferenceontologiesforplantsareusedtoaccessdataresourcesforplanttraits,phenotypes,diseases,geneexpressionandgeneticdiversitydataacrossawiderangeofplantspecies.
4.
OutreachinvolvingtheplantresearchcommunityandK-12andundergraduatestudents.
II.
THESCOPEOFTHEPLANTEOMEThescopeoftheontologiesinthePlanteomeprojectrangesfromabroadoverviewofplantenvironmentsandtaxonomy,tothecellularandmolecularlevelofexpressedgenesandtheirbiologicalfunctions.
ThePlanteomeontologies,describedinmoredetailbelow,consistofthePlantOntology(PO)[1-6],PlantTraitOntology(TO)[7,8],thePlantEnvironmentOntology(EO)[7]andthePlantStressOntology(PSO).
ThePlanteomeprojectimportsandintegrateswithrelevantreferenceontologiesdevelopedbycollaboratinggroups;theGeneOntology(GO)[9,10],thePhenotypicQualitiesOntology(PATO)[11],theEnvironmentOntology(ENVO)[12],andtheChemicalEntitiesofBiologicalInterest(ChEBI)[13].
Inaddition,thePlanteomeintegratesandmapsspecies-orclade-specificapplicationontologiesdevelopedbytheCropOntology(CO)project[14].
Togetherthissuiteofreferenceontologiescanbeusedtofullyannotateandlinktogetherthevitalplantknowledgedomain.
Thecentralreferenceontologyforplantanatomyandplantdevelopmentalstages,thePlantOntology(PO)[1-6]grewoutoftheneedtocreateassociationsbetweenstandardizedterminologyforplantsandgenomicsdata,andwasbasedtheworkdonetodeveloptheGeneOntologyinthelate1990s[9,10].
ThePOisrecognizedworldwideasthereferenceontologyforplantstructuresanddevelopmentalstages,andislinkedtodatafromawidevarietyofplants,fromtraditionalmodelspeciestothecropplantsthatfeedtheworld'sgrowingpopulation.
Plantimprovementreliesonanalysesofplanttraitsandphenotypes.
Forthesepurposes,thePlantTraitOntology(TO)[9,10]describesawiderangeofprecomposedplanttraitsconsistentwithEntity(E)-Quality(Q)statementsandleadstoanunderstandingofthemolecularprocessesthatunderliethem.
Eachtraitisameasurableorobservablecharacteristicofaplantstructure(PO:000901),aplantcellularcomponent(GO:0005575),oraplantstructuredevelopmentstage(PO:0009012),aswellasplantbiologicalprocesses(GO:0008150)andmolecularfunctions(GO:0003674).
TheTOencompassesninebroad,upper-levelcategoriesofplanttraits:biochemicaltrait(TO:0000277),biologicalprocesstrait(TO:0000283),plantgrowthanddevelopmenttrait(TO:0000357),plantmorphologytrait(TO:0000017),qualitytrait(TO:0000597),statureorvigortrait(TO:0000133),sterilityorfertilitytrait(TO:0000392),stresstrait(TO:0000164)andyieldtrait(TO:0000371).
ThePlantEnvironmentOntology(EO)isusedtodescribetheplantgrowthconditionsandstudytypesandcanbecombinedwiththetermsfromtheotherreferenceontologiestofullyannotateaplantphenotypedescription.
Inadditiontothereferenceontologies,thePlanteomeworkscloselywithdevelopersofthespecies-specificvocabulariessuchastheCropOntology[14]tointegratetheirterms,createmappingstothereferenceontologiesandlinkphenotypesandgermplasmtogenomicsresources.
III.
DEVELOPMENTOFTHEPLANTEOMEONTOLOGYNETWORKThedevelopmentofthePlanteomeProjectontologynetworkisafundamentalchangeinthewayofthinkingaboutontologiesforplants.
Inthepreviousproject,thePlantOntology(http://www.
plantontology.
org/),asinglereferenceontologywasdevelopedandusedtoannotateplantgenomicdatatoontologytermsdescribingplantstructuresandplantdevelopmentalstages.
Theadditionoftheotherreferenceandspecies-specifcontologiesforplantsenrichestheannotationenvironmentsoamorecompletepictureofthemetadataofplantpheotypescanbeexpressed.
Inordertocreatethenetwork,ontologytermsintheTOandthespecies-specifccroptraitontologieshavebeen'decomposed'intothecorrespondingEntity(E)-Quality(Q)statementswhichutilizetermsfromtheotherreferenceontologies,suchasPOandGOfortheentitiesandPATOforthequalities.
Inthisway,anetworkisformedwhichlinksallthevariousontologiestogether.
Oneofthelessonslearnedindevelopingthisnetworkisthatsomeofthereferenceontologiesandvocabulariesdevelopedbyourcollaborators(suchasChEBI,andtheNCBITaxonomy)aresolargethattheyarecumbersometodisplayonourbrowser.
Forthese,wehavedevelopedscripttoextractarelevant"slim"versionwhichcontainstheneededterms.
IV.
PLANTEOMEANNOTATIONDATABASEThePlanteomedatabaseprovidesontologytermsanddefinitionsalongwiththeassociated'annotations'[15],betweentheontologytermsanddatasourcedfromnumerousplantgenomicsdatasets.
ThePlanteome1.
0BetaRelease(Nov.
2015)containsabout47millionannotationslinkingreferenceontologytermstodataobjectsrepresentinggenes,genemodels,proteins,RNAs,germplasmandquantitativetraitloci(QTLs)from87differentplantspecies.
Thesedataarecurrentlycontributedby29differentdatasources.
Planteomecuratorsandresearchersatvariouscollaboratingdatabasegroupsworkcloselytodeveloptheannotationfilesinthestandardizeddataformatdatabase.
Thedatabaseisaccessibleonline(http://planteome.
org/)andalsoavailableforbulkdownload(http://palea.
cgrb.
oregonstate.
edu/viewsvn/associations/).
TheannotationdatabaseincludesfunctionalGeneOntologyannotationsfor60species.
Thesepredictionsweredoneusingtwomethods.
ThefirstmethodutilizedanInterProScan[16]toidentifyproteindomains.
TheresultinganalysisfileswerethenparsedtoassociatetheproteindomainstoGOterms.
ThesecondmethodwastoprojectontologyannotationsbasedonFig.
1.
AnnotationofRicebrd1mutantwithreferenceontologytermstocapturethephenotype.
Thericeplantimageisadaptedwithpermissionfrom[19]JohnWileyandSons.
orthologytoArabidopsisthalianagenes.
OrthologywaspredictedwithInParanoid[17],aprogramthattakesreciprocalBLASToutputandusespairwisesimilarityscorestodetermineorthologousclustersofgenes.
Thisisfollowedbycreatinggenesuperclustersbypoolingspecies-pairclusterswithcommongenes.
Theorthologoussuperclustersofthe60specieswerecomparedwiththeknownannotationfilesforArabidopsisthalianaforGO,andnewannotationfilesweregenerated.
PlanteomeistheonlyonlinesourceprovidingGOfunctionalannotationofgenesidentifiedformanyofthesespecies.
V.
CASESTUDYEXAMPLE:PHENOTYPEANNOTATIONOFRICEBRASSINOSTEROID(BR)-DEFICIENTDWARFMUTANTBrassinosteroid(BR)-deficient(brd1)dwarfmutantsofricewerecharacterizedtodeterminetherolesthatBRsplayinnormalplantgrowthanddevelopmentinamonocotplant[19].
Fig.
1showsanexampleofhowthereferenceontologiescanbeusedtoannotatethephenotypeofa(BR)-Deficientdwarfmutantrice,brd1-1.
ThisimageisacompliationofontologytermsfromvariousPlanteomereferenceontologiesthathavebeenusedtoannotatetheexpressionofbrd1(Os03g0602300)inthePlanteomedatabase.
Theseannotationswerecontributedfromavarietyofsources,suchasGramene(http://www.
gramene.
org/),EnsemblPlants(http://plants.
ensembl.
org/index.
html),andTheRiceAnnotationProject(RAP)(http://rapdb.
dna.
affrc.
go.
jp/)andcanbeusedtodescribeallaspectsofthebrd1mutantphenotype.
GatheringtheannotationstogetherinaunifiedplatformsuchasthePlanteomeallowsthedatatobemadeaccessibleandfacilitatesgenediscoverythroughinter-andintra-speciescomparisons.
VI.
PLANTEOMETOOLSFORCOLLABORATIONANDONTOLOGYINTEGRATIONThePlanteomeprojectisdevelopinganumberoftoolstoincreaseaccesstotheontologytermsandtoincreasetheinteroperabilityoftheannotateddata.
AllthePlanteomeontologiesarepublicallyavailableandaremaintainedatthePlanteomeGitHubsite(https://github.
com/Planteome)forsharingandtrackingrevsions.
Thissitefacilitatescommunityfeedback;userscanmakecomments,requesttermsandsuggestchangestothePlanteomeontologies.
Inaddition,thePlanteomeGitHubsitealsofeaturesspecies-specificvocabulariessuchasthosefromCropOntology(http://www.
cropontology.
org/).
AnothernewtoolwhichisunderdevelopmentisaTraitOntology-specific(http://to.
termgenie.
org/)instanceoftheTermGenietool[20].
TermGenieusesapattern-basedapproachtorapidlygeneratenewtermsandplacethemappropriatelywithintheontologystructure.
AlltermsarereviewedbyaPlanteomecuratorbeforethefinalcommittotheontology.
TermGeniecanbeusedtoquicklyobtainaTOtermforannotation,ifanappriopriateonedoesnotalreadyexist.
Planteomeisdevelopinganapplicationprogramminginterface(API)thatwillallowcollaboratorstoaccessandusethehosteddataintheirwebsitesandapplications.
ThefirsttwoAPImethods–currentlyaccessiblefromthePlanteomedevelopmentenvironment–queryPlanteome-hostedontologiesforterms,termdefinitions,andotherattributes,returningtheminJSONformat.
The"search"methodisfastenoughtobeusedinanautocompletesearchbox.
AllthePlanteomereferenceandspecies-specificontologiesareavailablethroughtheAPIservice.
Currently,theAPIonlyservestheterminformation,butthePlanteomeprojectplanstoaddAPImethodstoaccessannotationdata,aswell.
ThePlanteomeprojectiscollaboratingwiththeBisqueImageAnalysisEnvironment(CenterforBio-ImageInformatics,UCSB;http://www.
cyverse.
org/bisque)onintegratedimagesegmentationandontologyannotationfeatures.
ThePlanteomeprojectalreadyhostssuchatoolasadesktopapplication;AnnotationofImageSegmentswithOntologies(AISO;http://planteome.
org/node/3),butwewishtomoveitsfunctionalityonlineasamodulewithinBisque,takingadvantageofitssharedCyVerseauthentication,datastore,andcomputationinfrastructure.
Theontologydataitselfwillbeservedfromexternalservices,suchasthePlanteomeAPI.
VII.
CONCLUSIONSThePlanteomeprojectisacentralizedonlineplantinformaticsportalandwhichintegratesreferenceontologiesforplants,andspecies-specificcontrolledvocabularieswithalargeandgrowingcorpusofplantgenomicsdata.
Thisplatformprovidessemanticintegrationofwidelydiversedatasetswiththegoalofplantimprovement.
ACKNOWLEDGMENTFundingforthePlanteomeprojectisprovidedbytheNationalScienceFoundationawardIOS#1340112REFERENCES[1]Jaiswal,P,SAvraham,KIlic,EAKellogg,SMcCouch,APujar,etal.
,2005.
PlantOntology(PO):AControlledVocabularyofPlantStructuresandGrowthStages.
CompFunctGenomics,.
6(7--‐8):p.
388-97(references)[2]Pujar,A,PJaiswal,EAKellogg,KIlic,LVincent,SAvraham,etal.
2006.
Whole-‐plantgrowthstageontologyforangiospermsanditsapplicationinplantbiology.
PlantPhysiol,142(2):p.
414--‐28.
[3]Ilic,K,EAKellogg,PJaiswal,FZapata,PFStevens,LPVincent,etal.
,2007.
Theplantstructureontology,aunifiedvocabularyofanatomyandmorphologyofafloweringplant.
PlantPhysiol.
143(2):p.
587--‐599.
[4]Avraham,S,CWTung,KIlic,PJaiswal,EAKellogg,SMcCouch,etal.
,2008.
ThePlantOntologyDatabase:acommunityresourceforplantstructureanddevelopmentalstagescontrolledvocabularyandannotations.
NucleicAcidsRes.
,36(Databaseissue):p.
D449--‐54.
.
[5]CooperL,WallsRL,ElserJ,GandolfoMA,StevensonDW,SmithB,etal.
(2013)ThePlantOntologyasatoolforcomparativeplantanatomyandgenomicanalyses.
PlantandCellPhysiology54:e1–e1[6]CooperLandJaiswalP(2016)ThePlantOntology:AToolforPlantGenomics.
InDEdwards,ed,PlantBioinformatics.
SpringerNewYork,pp89–114[7]JaiswalP,WareD,NiJ,ChangK,ZhaoW,SchmidtS,etal.
(2002)Gramene:developmentandintegrationoftraitandgeneontologiesforrice.
ComparativeandFunctionalGenomics3:132–136.
[8]ArnaudE,CooperL,ShresthaR,MendaN,NelsonRT,MatteisL,etal.
(2012)TowardsareferencePlantTraitOntologyformodelingknowledgeofplanttraitsandphenotypes.
ProceedingsoftheInternationalConferenceonKnowledgeEngineeringandOntologyDevelopment.
Barcelona,Spain,pp220–225.
[9]AshburnerM,BallCA,BlakeJA,BotsteinD,ButlerH,CherryJM,etal.
(2000)GeneOntology:toolfortheunificationofbiology.
NatGenet25:25–29.
[10]TheGeneOntologyConsortium(2014)GeneOntologyConsortium:goingforward.
NucleicAcidsResearch.
doi:10.
1093/nar/gku1179.
[11]GkoutosG,GreenE,MallonA-M,HancockJ,DavidsonD(2004)Usingontologiestodescribemousephenotypes.
GenomeBiol6:R8[12]ButtigiegP,MorrisonN,SmithB,MungallC,LewisS(2013)Theenvironmentontology:contextualisingbiologicalandbiomedicalentities.
JournalofBiomedicalSemantics4:43[13]HastingsJ,OwenG,DekkerA,EnnisM,KaleN,MuthukrishnanV,etal.
(2016)ChEBIin2016:Improvedservicesandanexpandingcollectionofmetabolites.
NucleicAcidsResearch44:D1214–D1219[14]Shrestha,R,Davenport,GFBruskiewich,R,Arnaud,E.
(2011)Developmentofcropontologyforsharingcropphenotypicinformation.
Droughtphenotypingincrops:fromtheorytopractice.
pp171–179[15]HillDP,SmithB,McAndrews-HillMS,BlakeJ(2008)GeneOntologyannotations:whattheymeanandwheretheycomefrom.
BMCBioinformatics9:S2[16]QuevillonE,SilventoinenV,PillaiS,etal.
2005.
InterProScan:proteindomainsidentifier.
NucleicAcidsResearch.
33(WebServerissue):W116-W120.
doi:10.
1093/nar/gki442.
[17]RemmM,StormCEVandSonnhammerELL(2001).
AutomaticClusteringofOrthologsandIn-paralogsfromPairwiseSpeciesComparisons.
JMB,314:1041-1052.
[18]Altschul,SF,Madden,TL,Schffer,AA,Zhang,J,Zhang,Z,Miller,W,etal.
(1997).
GappedBLASTandPSI-BLAST:anewgenerationofproteindatabasesearchprograms.
NucleicAcidsRes.
25:3389-3402.
[19]Hong,Z,Ueguchi-Tanaka,M,Shimizu-Sato,S,Inukai,Y,Fujioka,S,Shimada,Y,etal(2002)Loss-of-functionofaricebrassinosteroidbiosyntheticenzyme,C-6oxidase,preventstheorganizedarrangementandpolarelongationofcellsintheleavesandstem.
ThePlantJournal32:495–508[20]Dietze,H,Berardini,T,Foulger,R,Hill,D,Lomax,J,OsumiSutherland,D,RoncagliaP,MungallC(2014)TermGenie-Awebapplicationforpattern-basedontologyclassgeneration.
JournalofBiomedicalSemantics5:48[21]LingutlaN,PreeceJ,TodorovicS,CooperL,MooreL,JaiswalP(2014)AISO:AnnotationofImageSegmentswithOntologies.
JournalofBiomedicalSemantics5:50
Pia云是一家2018的开办的国人商家,原名叫哔哔云,目前整合到了魔方云平台上,商家主要销售VPS服务,采用KVM虚拟架构 ,机房有美国洛杉矶、中国香港和深圳地区,洛杉矶为crea机房,三网回程CN2 GIA,带20G防御,常看我测评的朋友应该知道,一般带防御去程都是骨干线路,香港的线路也是CN2直连大陆,目前商家重新开业,价格非常美丽,性价比较非常高,有需要的朋友可以关注一下。活动方案...
Hostigger 主机商在前面的文章中也有介绍过几次,这个商家运营时间是有一些年份,只不过在我们圈内好像之前出现的次数不多。最近这段时间商家有提供不限流量的VPS主机,逐渐的慢慢被人认识到。在前面的介绍到他们提供的机房还是比较多的,比如土耳其、美国等。今天看到Hostigger 商家居然改动挺大的,原来蛮好的域名居然这次连带官方域名都更换掉去掉一个G(Hostiger )。估摸着这个域名也是之前...
提速啦简单介绍下提速啦 是成立于2012年的IDC老兵 长期以来是很多入门级IDC用户的必选商家 便宜 稳定 廉价 是你创业分销的不二之选,目前市场上很多的商家都是从提速啦拿货然后去分销的。提速啦最新物理机活动 爆炸便宜的香港CN2物理服务器 和 日本CN2物理服务器香港CTG E5 2650 16G内存 20M CN2带宽 1T硬盘 150元/月日本CN2 E5 2650 16G内存 20M C...
www.meansys为你推荐
硬盘工作原理硬盘的读写原理咏春大师被ko咏春高手散打冠军林文学近况5xoy.comhttp www.05eee.comwww.hhh258comwww.tx88d.com 有这个网站吗?lcoc.topoffsettop和scrolltop的区别partnersonline国内有哪些知名的ACCA培训机构www.03024.comwww.sohu.com是什么m.yushuwu.org花样滑冰名将YU NA KIM的资料谁有?javlibrary.com大家有没有在线图书馆WWW。QUESTIA。COM的免费帐号hao.rising.cn如何解除瑞星主页锁定(hao.rising.cn). 不想用瑞星安全助手
免费网站空间 windows虚拟主机 vps动态ip 已经备案域名 BWH 大容量存储 免费全能空间 灵动鬼影 腾讯实名认证中心 qq云端 免费测手机号 购买国外空间 台湾google net空间 万网主机 免费网络空间 免费稳定空间 上海联通 windowsserver2012r2 cdn加速 更多