Monarch geneset OGS2.0

DPOGS207452
TranscriptDPOGS207452-TA5607 bp
ProteinDPOGS207452-PA1868 aa
Genomic positionDPSCF300051 - 304322-324497
RNAseq coverage131x (Rank: top 56%)
Annotation
HeliconiusHMEL0148480.088.46% 
BombyxBGIBMGA001194-TA0.075.87% 
Drosophilarab3-GEF-PD0.069.47% 
EBI UniRef50UniRef50_D2A2D70.058.60%Putative uncharacterized protein GLEAN_07834 n=1 Tax=Tribolium castaneum RepID=D2A2D7_TRICA
NCBI RefSeqXP_975086.20.057.59%PREDICTED: similar to rab3-GEF CG5627-PB [Tribolium castaneum]
NCBI nr blastpgi|2700057300.058.60%hypothetical protein TcasGA2_TC007834 [Tribolium castaneum]
NCBI nr blastxgi|2700057300.056.20%hypothetical protein TcasGA2_TC007834 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[82-323] IPR0011941.3e-47DENN
[9-106] IPR0051136.5e-24uDENN
[442-512] IPR0051125.5e-15dDENN
Orthology groupMCL16050 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207452-TA
ATGGACGTCCAAAAACAGCAACTTTGTCCGCGACTTGTGGACTATTTAACAATAGTTGGTGCAAAACCGTACACTACTGGGAAAGGACTAGCCCCGGTGCAGGCGCCAGAACTACTTCGTCGTTATCCCCTAACCAACCATGATGACTTCCCTCTACCGCTGGATATGGTGTATTTCTGCCAGCCAGAAGGTTGCGTGTCTGTGGGGCCTCGGCGCCAACTTGCACATATCGCAACACGGGACACAACCTCCTTCGTATTTACACTTACAGATAAAGATTCGGGTAAGACCCGATATGGCATCTGCATCAATTTTTACCGCGCGATGGAGCGTGCACCGACACCTGGTCCCCGGGAGAGAAGTGTTTTGCGACGCGAGTCTTGGAGGAAGTCTATGGAGAGGAGTTCAGACTCCGCTTTCTCGAGAGATACAGTGTGGTCAGTGCTCACTGGCCAGGCGTACGATAATACGCCGACAATAGTAGTGCATGACGTTAAAGAGATTGAGACGTGGATACTACGATTGCTGTCAGCTCCCGTACCAGTTCCAGGGAAAACTCGACTCGAACTCGAAGTACTCTCACCAACAGCACACGCACCACTGGTATTTGCATTACCTGATCACACAAGATTCACTCTTGTTGACTTCCCTCTACATCTACCATTGGAATTGCTAGGTGTTGATACTTGCTTGAGAGTCTTAACCTTGATAATGCTGGAAAACAAAGTGGTAGTTCAATCGAGGGACTATAACGCGCTCTCAATGTCGGTGATGGCGTTGGTAGCGATGCTGTATCCACTAGAGTACATGTTCCCAGCGATACCACTATTACCGAGTTGTATGAGTTGTGCGGAACAATTACTTCTCGCTCCAACTCCATTCCTAATAGGAATACCCGCTACATTCTTAACATACAAGAAGAATTTTAAATTACCAGATGACATTTGGTTAGTAGACCTGGACGCTACTAAGCTTAGTGGGCCTTACGGTAACGAACAGGACCTACCTCCTCTGCCAGAACCAGAGAGTTCAGTCCTTAAAAACCATCTAAAACAGGCACTTAGCAGCTTAACAAATAGTTCAGCAGAACAAGCTGCAGCGCCTCTTCTACCTTCGAGAAGAGATAGTGTTGGTGGGGCTACATTAAAGGTACAACCAGCGACCTTCCGTGAAGGGTCTCATAGTACCCCAGAGAGTCGACGTGTGTCTGTGGGCAGCGCACACACAAGACTGTCGCTCGCGTCGCCACACTCACCGGCACCTCAGAGCTCTCCACAGGCACAACCGTTCAACCCTTTGATATACGGCAACGACGTAGATTCCGTCGACATCGCTACAAGAGTCGCTATGGTTCGTTTCTTCAATTCTCAAAATATTTTGGCGAATTTTATGGAGCACACTCGAACACTGCGTTTGTATCCACGACCCGTCGTAGCTTTTCAGATCAATAGCTTCTTGCGATCAAGACCTCGATCTTCTTCTTTCTTGAATAAATTCGCTAGAACACAAGCTGTAGAGTTCTTAGCAGAGTGGTCTTTGACACCATGCAATGTTGCTTTCCTAAGAGTACAAACAGGTGTGTTTGATCCTCGACAAATTGGTGATAAACCAAAATGGTTTGCGGACCAACTGCAGCCCATACGATTCGCAGTTTGGGATGACGGCAGTTCACTAAACGGTGCCTTGAGACAGTTGCAAAGACAGGAGAATCAACCAACAGATGAGAGCGGATCTGATTCTGAAGCTGCTGAGAGCACTAGTTCATCATATTCATCTTTAAGCGACTTCGTTTCGGAGATGGCCTCATCAGATTTATCACCAGGTGGAAACGTTCAACAGCACGTTATTGGCGAAACTTATAGTGCTGTAGTACAGGTTCCTATGACACTCTCTTCATCGTTAGATCCGAAAACGGTATACAGTCCACCATCTTCGCTAATATTTGGAGAAGAAGGAGAAAGAGAAGGCCAGCGAGACGGACGGGAGTCCACTTCTCCTTCCCCATCAGCTTCTAGTTCAGATCACAGCGACTTGTCTGATGATGACATCCCTGGAACTATGAACCGTCCCGACATCACCGATCAACCAACACCAGTCAAAAAAAGTGACACAGATAGTGGTAGCTTGGGACGAGAGTCGGACTCATCAACGACTCCTGCGACAGTCGCGTCGCGTCGTCCACGTGATCCTGACCCTGTGCGAGCTTCTCCACATTCCCGCCCTAGGAGTTCTAACAGTGGTGTATCAAGACAAGCGTCGCAAACGTCACTATTAGAACAATTTGCTGCTCAGGCGAAGGAGCTTGTGCGAGAGACCACTCGTCAGAGCAGTCAGGAGGGAATATTAGCGCATATGGACAAAAACGGGAAAGCTGATCAAGGACAAGATAAGAACATATTTGCTCCTTTCGATAAGTTAACCCTTCATGCAAAAAAAGCCGCAGAAGAGGCATCGAAGAGTGTTCAAGAGGCGTCGAAGTCAGCATTAGAAGCGAGTAAAACGGCTACAGCAGTCAGCAAAAACACGTTCGAAGATCTTACTTATGTGGGAAAGTCTACTTTAGGAGATCTTACGAAAAGTGCCAAAGAAGCTGCAGCTAAGAAAGGCTTGTTGAAGGGTGAAAGTCAGGATGCTTCTACTAGTTCCAATGCAAGGAGGGATTCGACAGCGCTGCAGACGACTAACTTACTTGCTACCACACATCGTGACTTCTTTTCCAATATTAGTTCTGACTTAAACGGTCTTGCTGCTTCAACTACTAGTATGTTCAGTGACTTCTTTGGTTCCGCTAAAGGAAAGCAATCAAAACCTGAACCGTCCCCGAATACTCCAATGACAGCAACATTTGGTCCTTTCTCTCAAGGTGCCAAAGGTTTAGTACAGCGCTCGCCACTTATTCGCCATTCTTCTCCAGCACCCGTCGCACCACCAATAAATACGAGATCTACTAATAGCGAAAATCAAGCGTTCCTGAATGATCTTGTACAACACGTTCTTGAAGGAGAAGGCGTTGGATGGCTTAAACTAAATCGTTTAAAGAAGCTAATGGAAGACGAATCATACAGGAACATGGTTCTTAGTAAACTTAATAGAAACTTTAATAGAAAGACTTCACCGAATGATAAAGTGGATGACGTGTTTATAAGCAAACCCGTATGGAAAGGCATGCTAAAAGTACTTCAGGCTGTGGTTCATGGCTTAGAACATACGTATTCCAATTTCGGGCTTGGGGGAATGGCTTCCGTTTTCCAATTAGCCGAAATGGCACACACTCACTATTGGAGTAAAGAATTCGCGGGATTAGAACATGGAGGTATGGCTGGTTCTGCACTATCTGAACATTATGGAAGGCAAGATTACGAGACTCCATTGTCAACGCCGTCTTCCAGAAAGAGCTCGCAGTCCGATGCACCCGTTGTCAATTACCCAGAACAAGAACACGGTGACACTCAGAGTACAACAGAAATCTTCAAGGATATGTTAAATCAAAAACGAAACCTTTTATTTAGCAAGTTGACTTCTTTTGATTCCGATGCCGCGTCATCGGAGTGTTCAGACAGCGGGTCCATTACCACCAATCGCGCGCTCGCCGATCACCGCGCTTCCTTTAAATCAAATCTCTCTGACACTGACGTCATGTTCCTTAATGTTGGTCGACCAGGTGTCAAGGGACGCACCGGTAGTGTATTCTCGACCAAATCTTCTGTTAATGGTAGACCGATGGCCGGAGTACCTACCACCTCACCACTTACCTCTCCTGAAACCGTCCGCACTTATCTATTTCAAGGATTAATAGGGAAAGAGAGATCGAACTTATGGGACCAAATGCAGTTCTGGGAAGATTCATTCTTAGATGCGGTGAGTCAAGAAAGAGATATGATTGGAATGGACCAAGGAGCGAACGAAATGATGGAACGATATAAATGTCTCAGCGAAACGGAACGTAAACGCTTGGAGCACGAGGAAGACAGGCTGCTGTCTACTGCTCTATATAATTTAACTGCCGCGATGGTAATGCTTGGAGTTGAAGCGGACATTATTAGGAATAAAGTAAGACGATTGCTGGCAAAAAGTCATATCGGACTTGTTTATAGCCAAGAAGTCAACCACCTGTTAGATGTTGTCCATAATCTGCATGGAAATGATATAAGTCTTAAGGCTCTTGGATCTCGAGCCACACATCGCGCCACGTTCACAGTCCATGAGCGAGATGCAACTGGAGCTTTACGCTTCTTGGACGTAAGACATGATGGACTCGTTCTAAGAACAACTCAAGGGACCATTGTCGAGAGGTGGTGGTACGAACGTCTCGTGAATATGACGTACAGTCCGAAAATACGAGTATTGTGCCTTTGGAGAAAAAATGGTGGACAGACGCAACTTCATAAATATTATACCAAAAAGTGCAAAGCGCTATACTACTGCATCAAGGAGGCGATGGAGAAAAGCGGAAGGCGACAAGATGCAGCGGAACTGGGCGGGGAATTCCCTGTACAGGATTGTGCTACTGGCGAGGGTGGTCTTATACAGGTGTGCATGGAAGGCGTCGGACTCCTGTTCCACCATAGCAAGTTCTTCGTGCGGCTCGATCACATTCGGAAGTGCTTCACGCAGAATGGGGGTATCTTTGTTTTAGAAGAATTTAATCCTAAAACCAGGCAGATAATTCAAAGAAAATATAAATCTATAATGGTAAGCAGACAAAGTAGTGCTTGCCCTTCATCGTTTCTTCTCAATATGGCTTACTCGGTACTATCTCGTTGTCGAAGTCAATGGAATCCATCCGAATTGCGCACCTTACGTTTCATAATTCACTCTAATGCGGATCAAATATGCTATGCGGTGTTGTGCGTCTTCTCGTACTTCGCGGCTGGGCAGGAACAGAAAAAAGCTATATTGGAGCAAGCGGCACAGATCCATGCACCTGAAGCAACCACGAAAGCTCCGTTGTCCCCAAGAAACGACGATGAAGTGTTCCAGAGCAAGCCTAGCCCACAGAATACAAGAAAGGCATCGGAGGTCCGTGTGAGCGAAACCGAAAGACAGAGGGTGCCACCGGACAAGCCGCGTATTCTACCCGAGCGGCCGAAGCAGCTGATCGAACCAGAACGAGAACCTGCGAATGATGGTTCAGATAGTAAAACAAAAGAGACAGGTAGAAGGGATAGTGAGGGAAGCTCGGAGAGAGAAAGAGACAGCACACGTCCAGCTGCCCAAAGAACGGACAGCTTACCACCACGACGGCCGCCACCACCAGTGTTACCACCGCAGAGACTGGTGCGAGCTTACTCGCAGGCCTCACCCAGACATCACGAACCGCCCTCGATCCCTCCCCGAGTCGGAGTCACCCCACGTGCCGGTCCGCCACCCGCACTACCTCCGAGACAGATGTCGGCAGCTGATGCTAGTGCTAGCCCCAGACATTCAGCGACATCTAGCCCAGTGCGTCGTGAAGGCCTGGCCCGTCAGAGCTCGATTAGTGCATCACCCGCCAGCACAGTTGCCTCCGCCCCATTTTCATCCACTAACCCCTTCACCGCAACTCGACACACAGAATTCGTTATACCGCAACGAGGTTCCCGTCGCCCATCTACCGACCGCAATTAA

Protein sequence:

>DPOGS207452-PA
MDVQKQQLCPRLVDYLTIVGAKPYTTGKGLAPVQAPELLRRYPLTNHDDFPLPLDMVYFCQPEGCVSVGPRRQLAHIATRDTTSFVFTLTDKDSGKTRYGICINFYRAMERAPTPGPRERSVLRRESWRKSMERSSDSAFSRDTVWSVLTGQAYDNTPTIVVHDVKEIETWILRLLSAPVPVPGKTRLELEVLSPTAHAPLVFALPDHTRFTLVDFPLHLPLELLGVDTCLRVLTLIMLENKVVVQSRDYNALSMSVMALVAMLYPLEYMFPAIPLLPSCMSCAEQLLLAPTPFLIGIPATFLTYKKNFKLPDDIWLVDLDATKLSGPYGNEQDLPPLPEPESSVLKNHLKQALSSLTNSSAEQAAAPLLPSRRDSVGGATLKVQPATFREGSHSTPESRRVSVGSAHTRLSLASPHSPAPQSSPQAQPFNPLIYGNDVDSVDIATRVAMVRFFNSQNILANFMEHTRTLRLYPRPVVAFQINSFLRSRPRSSSFLNKFARTQAVEFLAEWSLTPCNVAFLRVQTGVFDPRQIGDKPKWFADQLQPIRFAVWDDGSSLNGALRQLQRQENQPTDESGSDSEAAESTSSSYSSLSDFVSEMASSDLSPGGNVQQHVIGETYSAVVQVPMTLSSSLDPKTVYSPPSSLIFGEEGEREGQRDGRESTSPSPSASSSDHSDLSDDDIPGTMNRPDITDQPTPVKKSDTDSGSLGRESDSSTTPATVASRRPRDPDPVRASPHSRPRSSNSGVSRQASQTSLLEQFAAQAKELVRETTRQSSQEGILAHMDKNGKADQGQDKNIFAPFDKLTLHAKKAAEEASKSVQEASKSALEASKTATAVSKNTFEDLTYVGKSTLGDLTKSAKEAAAKKGLLKGESQDASTSSNARRDSTALQTTNLLATTHRDFFSNISSDLNGLAASTTSMFSDFFGSAKGKQSKPEPSPNTPMTATFGPFSQGAKGLVQRSPLIRHSSPAPVAPPINTRSTNSENQAFLNDLVQHVLEGEGVGWLKLNRLKKLMEDESYRNMVLSKLNRNFNRKTSPNDKVDDVFISKPVWKGMLKVLQAVVHGLEHTYSNFGLGGMASVFQLAEMAHTHYWSKEFAGLEHGGMAGSALSEHYGRQDYETPLSTPSSRKSSQSDAPVVNYPEQEHGDTQSTTEIFKDMLNQKRNLLFSKLTSFDSDAASSECSDSGSITTNRALADHRASFKSNLSDTDVMFLNVGRPGVKGRTGSVFSTKSSVNGRPMAGVPTTSPLTSPETVRTYLFQGLIGKERSNLWDQMQFWEDSFLDAVSQERDMIGMDQGANEMMERYKCLSETERKRLEHEEDRLLSTALYNLTAAMVMLGVEADIIRNKVRRLLAKSHIGLVYSQEVNHLLDVVHNLHGNDISLKALGSRATHRATFTVHERDATGALRFLDVRHDGLVLRTTQGTIVERWWYERLVNMTYSPKIRVLCLWRKNGGQTQLHKYYTKKCKALYYCIKEAMEKSGRRQDAAELGGEFPVQDCATGEGGLIQVCMEGVGLLFHHSKFFVRLDHIRKCFTQNGGIFVLEEFNPKTRQIIQRKYKSIMVSRQSSACPSSFLLNMAYSVLSRCRSQWNPSELRTLRFIIHSNADQICYAVLCVFSYFAAGQEQKKAILEQAAQIHAPEATTKAPLSPRNDDEVFQSKPSPQNTRKASEVRVSETERQRVPPDKPRILPERPKQLIEPEREPANDGSDSKTKETGRRDSEGSSERERDSTRPAAQRTDSLPPRRPPPPVLPPQRLVRAYSQASPRHHEPPSIPPRVGVTPRAGPPPALPPRQMSAADASASPRHSATSSPVRREGLARQSSISASPASTVASAPFSSTNPFTATRHTEFVIPQRGSRRPSTDRN-