ThePlanteomeProjectLaurelCooper,AustinMeier,JustinL.
Elser,JustinPreece,XuXu,RyanS.
Kitchen,BotongQu,EugeneZhang,SinisaTodorovic,PankajJaiswalOregonStateUniversity,Corvallis,OR,USAMarie-AngéliqueLaporte,ElizabethArnaudBioversityInternational,Montpellier,FranceSethCarbon,ChrisMungallLawrenceBerkeleyNationalLaboratory,Berkeley,CA,USABarrySmithUniversityatBuffalo,Buffalo,NY,USAGeorgiosGkoutosUniversityofBirmingham,UKandUniversityofAberystwyth,UKJohnDoonanUniversityofAberystwyth,UKAbstract—ThePlanteomeprojectisacentralizedonlineplantinformaticsportalwhichprovidessemanticintegrationofwidelydiversedatasetswiththegoalofplantimprovement.
Traditionalplantbreedingmethodsforcropimprovementmaybecombinedwithnext-generationanalysismethodsandautomatedscoringoftraitsandphenotypestodevelopimprovedvarieties.
ThePlanteomeproject(www.
planteome.
org)developsandhostsasuiteofreferenceontologiesforplantsassociatedwithagrowingcorpusofgenomicsdata.
Dataannotationslinkingphenotypesandgermplasmtogenomicsresourcesareachievedbydatatransformationandmappingspecies-specificcontrolledvocabulariestothereferenceontologies.
Analysisandannotationtoolsarebeingdevelopedtofacilitatestudiesofplanttraits,phenotypes,diseases,genefunctionandexpressionandgeneticdiversitydataacrossawiderangeofplantspecies.
TheprojectdatabaseandtheonlineresourcesprovideresearcherstoolstosearchandbrowseandaccessremotelyviaAPIsforsemanticintegrationinannotationtoolsanddatarepositoriesprovidingresourcesforplantbiology,breeding,genomicsandgenetics.
Keywords—ontology;traitsphenotype;semantic;dataintegration,plantsI.
INTRODUCTIONA.
RationaleItisestimatedthattheworldpopulationisprojectedtoreach9.
6billionpeopleinnextfewdecades(http://www.
wri.
org/blog/2013/12/global-food-challenge-explained-18-graphics).
Therefore,thechallengeishowtofeedthisgrowingpopulation,whileprotectingtheearth'senvironment.
Traditionalplantbreedingmethodsforplantimprovementmaybecombinedwithnext-generationanalysismethods,includingthehigh-throughputandautomatedscoringoftraitsandphenotypestodevelopimprovedvarieties.
Datafromhigh-throughputsequencing,transcriptomic,proteomic,phenomicandgenomeannotationprojectscanbelinkedtogermplasmresourcesthroughtheuseofinteroperable,referencevocabularies(ontologies).
Inthisway,theknowledgegainedfromthenext-generationdatacanbeutilizedforcropimprovement.
B.
WhatisthePlanteomeThePlanteomeProject(www.
planteome.
org)isacentralizedonlineinformaticsportalanddatabase,consistingofasuiteofreferenceontologiesforplants,anassociatedcorpusofplantgenomicsandphenomicsdata,andtoolsfordataanalysisandannotation.
Analysesofthesedatasetsfromgeneticandgenomicstudieshavethepotentialtoimproveourunderstandingofthemolecularbasisofeconomicallyrelevanttraits.
Inordertoutilizethisdata,researchersmustbeabletoconnecttherelevantplanttraitsofinteresttothespatialandtemporalexpressionpatternsofgenes,andelucidatetheirrolesinbiologicalprocessesinplants.
C.
GoalsofthePlanteomeProject:1.
Asuiteofinterrelatedreferenceontologiestodescribemajorknowledgedomainsofplantbiology,comprisingplantphenotypeandtraits,environments,andbioticandabioticstresses.
2.
Standards,workflowsandtoolsforannotationofplantgenomicsdata,andmetadataforcurationandimprovedannotationofgenes,genomes,phenotypeandgermplasm.
3.
ThePlanteomebrowseranddatabase,acentralized,onlineinformaticsportalandrepositorywherereferenceontologiesforplantsareusedtoaccessdataresourcesforplanttraits,phenotypes,diseases,geneexpressionandgeneticdiversitydataacrossawiderangeofplantspecies.
4.
OutreachinvolvingtheplantresearchcommunityandK-12andundergraduatestudents.
II.
THESCOPEOFTHEPLANTEOMEThescopeoftheontologiesinthePlanteomeprojectrangesfromabroadoverviewofplantenvironmentsandtaxonomy,tothecellularandmolecularlevelofexpressedgenesandtheirbiologicalfunctions.
ThePlanteomeontologies,describedinmoredetailbelow,consistofthePlantOntology(PO)[1-6],PlantTraitOntology(TO)[7,8],thePlantEnvironmentOntology(EO)[7]andthePlantStressOntology(PSO).
ThePlanteomeprojectimportsandintegrateswithrelevantreferenceontologiesdevelopedbycollaboratinggroups;theGeneOntology(GO)[9,10],thePhenotypicQualitiesOntology(PATO)[11],theEnvironmentOntology(ENVO)[12],andtheChemicalEntitiesofBiologicalInterest(ChEBI)[13].
Inaddition,thePlanteomeintegratesandmapsspecies-orclade-specificapplicationontologiesdevelopedbytheCropOntology(CO)project[14].
Togetherthissuiteofreferenceontologiescanbeusedtofullyannotateandlinktogetherthevitalplantknowledgedomain.
Thecentralreferenceontologyforplantanatomyandplantdevelopmentalstages,thePlantOntology(PO)[1-6]grewoutoftheneedtocreateassociationsbetweenstandardizedterminologyforplantsandgenomicsdata,andwasbasedtheworkdonetodeveloptheGeneOntologyinthelate1990s[9,10].
ThePOisrecognizedworldwideasthereferenceontologyforplantstructuresanddevelopmentalstages,andislinkedtodatafromawidevarietyofplants,fromtraditionalmodelspeciestothecropplantsthatfeedtheworld'sgrowingpopulation.
Plantimprovementreliesonanalysesofplanttraitsandphenotypes.
Forthesepurposes,thePlantTraitOntology(TO)[9,10]describesawiderangeofprecomposedplanttraitsconsistentwithEntity(E)-Quality(Q)statementsandleadstoanunderstandingofthemolecularprocessesthatunderliethem.
Eachtraitisameasurableorobservablecharacteristicofaplantstructure(PO:000901),aplantcellularcomponent(GO:0005575),oraplantstructuredevelopmentstage(PO:0009012),aswellasplantbiologicalprocesses(GO:0008150)andmolecularfunctions(GO:0003674).
TheTOencompassesninebroad,upper-levelcategoriesofplanttraits:biochemicaltrait(TO:0000277),biologicalprocesstrait(TO:0000283),plantgrowthanddevelopmenttrait(TO:0000357),plantmorphologytrait(TO:0000017),qualitytrait(TO:0000597),statureorvigortrait(TO:0000133),sterilityorfertilitytrait(TO:0000392),stresstrait(TO:0000164)andyieldtrait(TO:0000371).
ThePlantEnvironmentOntology(EO)isusedtodescribetheplantgrowthconditionsandstudytypesandcanbecombinedwiththetermsfromtheotherreferenceontologiestofullyannotateaplantphenotypedescription.
Inadditiontothereferenceontologies,thePlanteomeworkscloselywithdevelopersofthespecies-specificvocabulariessuchastheCropOntology[14]tointegratetheirterms,createmappingstothereferenceontologiesandlinkphenotypesandgermplasmtogenomicsresources.
III.
DEVELOPMENTOFTHEPLANTEOMEONTOLOGYNETWORKThedevelopmentofthePlanteomeProjectontologynetworkisafundamentalchangeinthewayofthinkingaboutontologiesforplants.
Inthepreviousproject,thePlantOntology(http://www.
plantontology.
org/),asinglereferenceontologywasdevelopedandusedtoannotateplantgenomicdatatoontologytermsdescribingplantstructuresandplantdevelopmentalstages.
Theadditionoftheotherreferenceandspecies-specifcontologiesforplantsenrichestheannotationenvironmentsoamorecompletepictureofthemetadataofplantpheotypescanbeexpressed.
Inordertocreatethenetwork,ontologytermsintheTOandthespecies-specifccroptraitontologieshavebeen'decomposed'intothecorrespondingEntity(E)-Quality(Q)statementswhichutilizetermsfromtheotherreferenceontologies,suchasPOandGOfortheentitiesandPATOforthequalities.
Inthisway,anetworkisformedwhichlinksallthevariousontologiestogether.
Oneofthelessonslearnedindevelopingthisnetworkisthatsomeofthereferenceontologiesandvocabulariesdevelopedbyourcollaborators(suchasChEBI,andtheNCBITaxonomy)aresolargethattheyarecumbersometodisplayonourbrowser.
Forthese,wehavedevelopedscripttoextractarelevant"slim"versionwhichcontainstheneededterms.
IV.
PLANTEOMEANNOTATIONDATABASEThePlanteomedatabaseprovidesontologytermsanddefinitionsalongwiththeassociated'annotations'[15],betweentheontologytermsanddatasourcedfromnumerousplantgenomicsdatasets.
ThePlanteome1.
0BetaRelease(Nov.
2015)containsabout47millionannotationslinkingreferenceontologytermstodataobjectsrepresentinggenes,genemodels,proteins,RNAs,germplasmandquantitativetraitloci(QTLs)from87differentplantspecies.
Thesedataarecurrentlycontributedby29differentdatasources.
Planteomecuratorsandresearchersatvariouscollaboratingdatabasegroupsworkcloselytodeveloptheannotationfilesinthestandardizeddataformatdatabase.
Thedatabaseisaccessibleonline(http://planteome.
org/)andalsoavailableforbulkdownload(http://palea.
cgrb.
oregonstate.
edu/viewsvn/associations/).
TheannotationdatabaseincludesfunctionalGeneOntologyannotationsfor60species.
Thesepredictionsweredoneusingtwomethods.
ThefirstmethodutilizedanInterProScan[16]toidentifyproteindomains.
TheresultinganalysisfileswerethenparsedtoassociatetheproteindomainstoGOterms.
ThesecondmethodwastoprojectontologyannotationsbasedonFig.
1.
AnnotationofRicebrd1mutantwithreferenceontologytermstocapturethephenotype.
Thericeplantimageisadaptedwithpermissionfrom[19]JohnWileyandSons.
orthologytoArabidopsisthalianagenes.
OrthologywaspredictedwithInParanoid[17],aprogramthattakesreciprocalBLASToutputandusespairwisesimilarityscorestodetermineorthologousclustersofgenes.
Thisisfollowedbycreatinggenesuperclustersbypoolingspecies-pairclusterswithcommongenes.
Theorthologoussuperclustersofthe60specieswerecomparedwiththeknownannotationfilesforArabidopsisthalianaforGO,andnewannotationfilesweregenerated.
PlanteomeistheonlyonlinesourceprovidingGOfunctionalannotationofgenesidentifiedformanyofthesespecies.
V.
CASESTUDYEXAMPLE:PHENOTYPEANNOTATIONOFRICEBRASSINOSTEROID(BR)-DEFICIENTDWARFMUTANTBrassinosteroid(BR)-deficient(brd1)dwarfmutantsofricewerecharacterizedtodeterminetherolesthatBRsplayinnormalplantgrowthanddevelopmentinamonocotplant[19].
Fig.
1showsanexampleofhowthereferenceontologiescanbeusedtoannotatethephenotypeofa(BR)-Deficientdwarfmutantrice,brd1-1.
ThisimageisacompliationofontologytermsfromvariousPlanteomereferenceontologiesthathavebeenusedtoannotatetheexpressionofbrd1(Os03g0602300)inthePlanteomedatabase.
Theseannotationswerecontributedfromavarietyofsources,suchasGramene(http://www.
gramene.
org/),EnsemblPlants(http://plants.
ensembl.
org/index.
html),andTheRiceAnnotationProject(RAP)(http://rapdb.
dna.
affrc.
go.
jp/)andcanbeusedtodescribeallaspectsofthebrd1mutantphenotype.
GatheringtheannotationstogetherinaunifiedplatformsuchasthePlanteomeallowsthedatatobemadeaccessibleandfacilitatesgenediscoverythroughinter-andintra-speciescomparisons.
VI.
PLANTEOMETOOLSFORCOLLABORATIONANDONTOLOGYINTEGRATIONThePlanteomeprojectisdevelopinganumberoftoolstoincreaseaccesstotheontologytermsandtoincreasetheinteroperabilityoftheannotateddata.
AllthePlanteomeontologiesarepublicallyavailableandaremaintainedatthePlanteomeGitHubsite(https://github.
com/Planteome)forsharingandtrackingrevsions.
Thissitefacilitatescommunityfeedback;userscanmakecomments,requesttermsandsuggestchangestothePlanteomeontologies.
Inaddition,thePlanteomeGitHubsitealsofeaturesspecies-specificvocabulariessuchasthosefromCropOntology(http://www.
cropontology.
org/).
AnothernewtoolwhichisunderdevelopmentisaTraitOntology-specific(http://to.
termgenie.
org/)instanceoftheTermGenietool[20].
TermGenieusesapattern-basedapproachtorapidlygeneratenewtermsandplacethemappropriatelywithintheontologystructure.
AlltermsarereviewedbyaPlanteomecuratorbeforethefinalcommittotheontology.
TermGeniecanbeusedtoquicklyobtainaTOtermforannotation,ifanappriopriateonedoesnotalreadyexist.
Planteomeisdevelopinganapplicationprogramminginterface(API)thatwillallowcollaboratorstoaccessandusethehosteddataintheirwebsitesandapplications.
ThefirsttwoAPImethods–currentlyaccessiblefromthePlanteomedevelopmentenvironment–queryPlanteome-hostedontologiesforterms,termdefinitions,andotherattributes,returningtheminJSONformat.
The"search"methodisfastenoughtobeusedinanautocompletesearchbox.
AllthePlanteomereferenceandspecies-specificontologiesareavailablethroughtheAPIservice.
Currently,theAPIonlyservestheterminformation,butthePlanteomeprojectplanstoaddAPImethodstoaccessannotationdata,aswell.
ThePlanteomeprojectiscollaboratingwiththeBisqueImageAnalysisEnvironment(CenterforBio-ImageInformatics,UCSB;http://www.
cyverse.
org/bisque)onintegratedimagesegmentationandontologyannotationfeatures.
ThePlanteomeprojectalreadyhostssuchatoolasadesktopapplication;AnnotationofImageSegmentswithOntologies(AISO;http://planteome.
org/node/3),butwewishtomoveitsfunctionalityonlineasamodulewithinBisque,takingadvantageofitssharedCyVerseauthentication,datastore,andcomputationinfrastructure.
Theontologydataitselfwillbeservedfromexternalservices,suchasthePlanteomeAPI.
VII.
CONCLUSIONSThePlanteomeprojectisacentralizedonlineplantinformaticsportalandwhichintegratesreferenceontologiesforplants,andspecies-specificcontrolledvocabularieswithalargeandgrowingcorpusofplantgenomicsdata.
Thisplatformprovidessemanticintegrationofwidelydiversedatasetswiththegoalofplantimprovement.
ACKNOWLEDGMENTFundingforthePlanteomeprojectisprovidedbytheNationalScienceFoundationawardIOS#1340112REFERENCES[1]Jaiswal,P,SAvraham,KIlic,EAKellogg,SMcCouch,APujar,etal.
,2005.
PlantOntology(PO):AControlledVocabularyofPlantStructuresandGrowthStages.
CompFunctGenomics,.
6(7--‐8):p.
388-97(references)[2]Pujar,A,PJaiswal,EAKellogg,KIlic,LVincent,SAvraham,etal.
2006.
Whole-‐plantgrowthstageontologyforangiospermsanditsapplicationinplantbiology.
PlantPhysiol,142(2):p.
414--‐28.
[3]Ilic,K,EAKellogg,PJaiswal,FZapata,PFStevens,LPVincent,etal.
,2007.
Theplantstructureontology,aunifiedvocabularyofanatomyandmorphologyofafloweringplant.
PlantPhysiol.
143(2):p.
587--‐599.
[4]Avraham,S,CWTung,KIlic,PJaiswal,EAKellogg,SMcCouch,etal.
,2008.
ThePlantOntologyDatabase:acommunityresourceforplantstructureanddevelopmentalstagescontrolledvocabularyandannotations.
NucleicAcidsRes.
,36(Databaseissue):p.
D449--‐54.
.
[5]CooperL,WallsRL,ElserJ,GandolfoMA,StevensonDW,SmithB,etal.
(2013)ThePlantOntologyasatoolforcomparativeplantanatomyandgenomicanalyses.
PlantandCellPhysiology54:e1–e1[6]CooperLandJaiswalP(2016)ThePlantOntology:AToolforPlantGenomics.
InDEdwards,ed,PlantBioinformatics.
SpringerNewYork,pp89–114[7]JaiswalP,WareD,NiJ,ChangK,ZhaoW,SchmidtS,etal.
(2002)Gramene:developmentandintegrationoftraitandgeneontologiesforrice.
ComparativeandFunctionalGenomics3:132–136.
[8]ArnaudE,CooperL,ShresthaR,MendaN,NelsonRT,MatteisL,etal.
(2012)TowardsareferencePlantTraitOntologyformodelingknowledgeofplanttraitsandphenotypes.
ProceedingsoftheInternationalConferenceonKnowledgeEngineeringandOntologyDevelopment.
Barcelona,Spain,pp220–225.
[9]AshburnerM,BallCA,BlakeJA,BotsteinD,ButlerH,CherryJM,etal.
(2000)GeneOntology:toolfortheunificationofbiology.
NatGenet25:25–29.
[10]TheGeneOntologyConsortium(2014)GeneOntologyConsortium:goingforward.
NucleicAcidsResearch.
doi:10.
1093/nar/gku1179.
[11]GkoutosG,GreenE,MallonA-M,HancockJ,DavidsonD(2004)Usingontologiestodescribemousephenotypes.
GenomeBiol6:R8[12]ButtigiegP,MorrisonN,SmithB,MungallC,LewisS(2013)Theenvironmentontology:contextualisingbiologicalandbiomedicalentities.
JournalofBiomedicalSemantics4:43[13]HastingsJ,OwenG,DekkerA,EnnisM,KaleN,MuthukrishnanV,etal.
(2016)ChEBIin2016:Improvedservicesandanexpandingcollectionofmetabolites.
NucleicAcidsResearch44:D1214–D1219[14]Shrestha,R,Davenport,GFBruskiewich,R,Arnaud,E.
(2011)Developmentofcropontologyforsharingcropphenotypicinformation.
Droughtphenotypingincrops:fromtheorytopractice.
pp171–179[15]HillDP,SmithB,McAndrews-HillMS,BlakeJ(2008)GeneOntologyannotations:whattheymeanandwheretheycomefrom.
BMCBioinformatics9:S2[16]QuevillonE,SilventoinenV,PillaiS,etal.
2005.
InterProScan:proteindomainsidentifier.
NucleicAcidsResearch.
33(WebServerissue):W116-W120.
doi:10.
1093/nar/gki442.
[17]RemmM,StormCEVandSonnhammerELL(2001).
AutomaticClusteringofOrthologsandIn-paralogsfromPairwiseSpeciesComparisons.
JMB,314:1041-1052.
[18]Altschul,SF,Madden,TL,Schffer,AA,Zhang,J,Zhang,Z,Miller,W,etal.
(1997).
GappedBLASTandPSI-BLAST:anewgenerationofproteindatabasesearchprograms.
NucleicAcidsRes.
25:3389-3402.
[19]Hong,Z,Ueguchi-Tanaka,M,Shimizu-Sato,S,Inukai,Y,Fujioka,S,Shimada,Y,etal(2002)Loss-of-functionofaricebrassinosteroidbiosyntheticenzyme,C-6oxidase,preventstheorganizedarrangementandpolarelongationofcellsintheleavesandstem.
ThePlantJournal32:495–508[20]Dietze,H,Berardini,T,Foulger,R,Hill,D,Lomax,J,OsumiSutherland,D,RoncagliaP,MungallC(2014)TermGenie-Awebapplicationforpattern-basedontologyclassgeneration.
JournalofBiomedicalSemantics5:48[21]LingutlaN,PreeceJ,TodorovicS,CooperL,MooreL,JaiswalP(2014)AISO:AnnotationofImageSegmentswithOntologies.
JournalofBiomedicalSemantics5:50
菠萝云国人商家,今天分享一下菠萝云的广州移动机房的套餐,广州移动机房分为NAT套餐和VDS套餐,NAT就是只给端口,共享IP,VDS有自己的独立IP,可做站,商家给的带宽起步为200M,最高给到800M,目前有一个8折的优惠,另外VDS有一个下单立减100元的活动,有需要的朋友可以看看。菠萝云优惠套餐:广州移动NAT套餐,开放100个TCP+UDP固定端口,共享IP,8折优惠码:gzydnat-8...
昔日数据怎么样?昔日数据是一个来自国内服务器销售商,成立于2020年底,主要销售国内海外云服务器,目前有国内湖北十堰云服务器和香港hkbn云服务器 采用KVM虚拟化技术构架,湖北十堰机房10M带宽月付19元起;香港HKBN,月付12元起; 此次夏日活动全部首月5折促销,有需要的可以关注一下。点击进入:昔日数据官方网站地址昔日数据优惠码:优惠码: XR2021 全场通用(活动持续半个月 2021/7...
炭云怎么样?炭云(之前的碳云),国人商家,正规公司(哈尔滨桓林信息技术有限公司),主机之家测评介绍过多次。现在上海CN2共享IP的VPS有一款特价,上海cn2 vps,2核/384MB内存/8GB空间/800GB流量/77Mbps端口/共享IP/Hyper-v,188元/年,特别适合电信网络。有需要的可以关注一下。点击进入:炭云官方网站地址炭云vps套餐:套餐cpu内存硬盘流量/带宽ip价格购买上...
www.meansys为你推荐
编程小学生惊库克大家觉得VIPCODE少儿编程怎么样?permissiondenied求问permission denied是什么意思啊?特朗普取消访问丹麦特朗普当选总统后对准备出国留学的学生有什么影响微信回应封杀钉钉为什么微信被封以后然后解封了过了一会又被封了地图应用什么地图导航最好用最准确李子柒年入1.6亿宋朝鼎盛时期 政府财政收入有将近1亿贯铜钱,那么GDP是多少呢?同ip网站同IP的两个网站,做单向链接,会不会被K掉??网站检测如何进行网站全面诊断lcoc.toptop weenie 是什么?www.99vv1.comwww.in9.com是什么网站啊?
仿牌空间 a2hosting 10t等于多少g ixwebhosting 360抢票助手 evssl 免费网站申请 微信收钱 域名转向 服务器维护方案 网络空间租赁 vip购优惠 昆明蜗牛家 Updog 宏讯 海外空间 太原联通测速 新疆服务器 免费获得q币 美国代理服务器 更多