Monarch geneset OGS2.0

DPOGS211124
TranscriptDPOGS211124-TA3216 bp
ProteinDPOGS211124-PA1071 aa
Genomic positionDPSCF300007 - 402573-407447
RNAseq coverage189x (Rank: top 48%)
Annotation
HeliconiusHMEL0124150.073.87% 
BombyxBGIBMGA003000-TA0.072.67% 
DrosophilaCG8155-PA1e-17744.50% 
EBI UniRef50UniRef50_D6WJS00.053.82%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WJS0_TRICA
NCBI RefSeqXP_001811946.10.053.82%PREDICTED: similar to CG8155 CG8155-PA [Tribolium castaneum]
NCBI nr blastpgi|1892379680.053.82%PREDICTED: similar to CG8155 CG8155-PA [Tribolium castaneum]
NCBI nr blastxgi|1700373190.043.29%TBC1 domain family [Culex quinquefasciatus]
Group
Gene OntologyGO:00050973.3e-56Rab GTPase activator activity
GO:00056223.3e-56intracellular
GO:00323133.3e-56regulation of Rab GTPase activity
KEGG pathway 
InterPro domain[207-440] IPR0001953.3e-56Rab-GAP/TBC domain
Orthology groupMCL15297 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211124-TA
ATGTTTGGGTATAGTAAAGAAGCTGTTCGTGTAAAAGTCAAGAAATGTGAGGGAAAACTACAACCAGAACTTAGAAAATTTTCTGTCGATCCACAAATCACGTCTTTGGAAGTCTTGCAAAGTATTTTAGTAAAAGCATTTGATATAAAATCAGATTTCACATTATCATACAGGACAGTTGATGATTATGGACAGGAAATATACCTGCCACTACTCTCAGACTGGGATCTGGACGCAGCTTTTCTTAAAGCTCATAATATAGCACTAACAATGAGGTTGGAGCCGTGTGTACAGTTAAAGGTTGATATGAAACCATTTGCTGAAGCTTCCGAGGACTGGGAGCCCCCAACCATCCCAGAAGCGCCGGTTGGTCAAGTCAGTAAAGATCAACCGGCAGCTCCACAATTAACTCAAGTCGAGAAACAAGAATCACAGACTGGATTCCAGGGTCTATTTATGAATCAGGTTGAGAAGACGTTTAACATGGTTTCGAGGGCGCTTAACCTTTATGAGGACCCCAACACCCCTCCACGGCCTCCGCTGAATGATATCGAGTTCCGAGCATTTCTAGACGCTGTGGGACAAATAACGAATACAATAAAACTACGAGAGGTTATATACTGCGGAGGAATAGAGCCGAGTCTAAGGAAAGTTGTATGGAAACACATATTGAACGTTTACCCGGACGGTATGACTGGCAAAGAGAGAATGGACTATATGAAAAGAAAAGCCAACGAGTATTACACATTACGATCTCAATGGAAAGACTGTATACAGAGAGGAAAGGTGAATGCTGACTTAGCGTATGTCACAAGCATGGTTCGTAAGGACGTGCTCCGAACGGACAGACATCACAACTTCTACGCCGGAAGTGATGACAATCAAAACATCGCATCGTTATTTAATATACTGACGACGTACGCATTGTATCACCCTACGGTAAGCTACTGCCAAGGTATGTCAGATCTGGCTTCACCACTCCTGGTGACTATGGGTGATGAAGCCCATGCGTACATCTGTCTCTGTGCTCTTATGACTCGGTTGTATCCAAACTTCCTTCTGGACGGAGAGGCAATGACTCTTAAATTCACACACTTAACGGAGTCTTTGCAAGTCTATGATCCAGATTTCTATAATTATCTCAAATCCCAACAGGCCGATGACTTACTTTTCTGCTATAGATGGCTCCTTCTAGAAATGAAAAGGGAATTTGCTTTCGAAGACGCTCTCAGAATGTTGGAAGTATTGTGGGCGTCTCTACCGCAAAAAGCACCCACAGCCGAGTTGCCTTTGAAAGAAAGAGATTTTGATCCAACATTGGAGGTCATTATAGATCCACCTCCATCGAGTCCCTTAGTAAAGACTCCAAGAGAAAACGCTTACACGAAAGTATGCGCAATTAGACGTCAAAGCTCAAGTTTTAGTCTGGCTAACATTAAAGCACCAAAGCTTTCGACTATCCGGCAGTCTAACCACAGCTTGGACGAAAATGCTACTAGAAAAATTTTAAATAACTCCTCATTAATACACGCGAAGGAATTCCAGAGCTTAGATGATGCTGCTTTGCAAAAACAGAAAACTATTAAAGATAATGTTGAAAACGAGGAAAAATTGAAAACTGATAAAGACGGAAAAGATTCTCAAGTTAAAGTTAAGAGGTGTATGAAAAGCGTGCCTGAACACGGTAGAGTGGACACTCAGGAACCCAATACGGTTTGGTCGAGTACAGGCAATCTTCCTAACGGAGTTACGCTAAGTGATACAAAAGACACGCCCCTCGGCAGTCAGATTAGATTACTTAGAGATAAAATATCAGTACATAATAATAGGTTCTTTTCGTCTCTCGACGCTTTGGAAGGACCTAATTCTTCGAACGCAAGTAAACCGACCAAACAAGTGAAAATGATAAAAAATCTCAATGAATTTTTAAACTTTGCCACCGGTAATAAAGACGCAGTAAAGCCAGAAAAACTTCAAAGAACACATTCCGTTATCGAGAGAAAGCATTCCAAAGAGTGCCCTAAAATTATTTTAACCAAGACATCCTACGATGAAAGTGATATTTCTTCGGCCAACCAAAGGAGTAATCGTTTGAGCAAGCTAACCAAAAGTAGTTTTGATACGAACGACGGCAGTAGTCCTGACGATTCACAAGAGTATCATCCGATGACGACTTCCATGACGAGAGAATTAAGACTGGAACTAGAGCACCTGGATCGCCAGGTTTTTGGTAATTCATATGTTAATCGGTGCAGCATGATTTGTGATTCCCCATCCGATTCCACGAGCACCGAGGACAAACCCGCTGAATACGCAGATGCGAAAGGAGACACCGTGGCGACTGACGCTGTGGAATTGAGAGCCGAAGTTGTTGACCCGAATGGGACGGTAAAAAAACCGCTCGACACAATCAAATGTAACGCAAGAAGCGCTGAAGATATATATTTGTGGGAAAATCCACTACATCGTAATACTCCAACAACAAGAACTACAGCGAGCTGCCCTCAAACTCCTGACGAGCAAGCAGAGCTGGATTTCGACGGAGATACGGGCGAGATATTTGAAGAACATTCCGGAAAAAAATCAATTACACCAATAAGGCTATTGAGAAAAAATCACTCCCTAGAACCACAGGAGCCATCGAGGAACAGGAGACATTCCATGACACATTCCAGTGAATCGGATTATTCAGAAAAAGATCCAGTGGACTCTACGGAAACCGTTACAGCCAAGAACATTAATCAAGAGCCTCTCCACTTGAAGTCGCACAAGTTTTTCTCTAACATGTCGCACGAATTAGAAAATGCAAAGAAAAACTGCGAAAGCCTATTGAGCGTGAAACAGACAAACTTACAAAGCGTTGCTTCGACAATAAATAACTTTCAGAAGAAATGCGAGGTTTTCAAGCCGCCGGTACAGAAGAACACGACAGTGTCCACCATCAGCGACAATTCGGTTAAGAATAGTATTAGTAGCTTGCCGTCGCCGGCAGTCTTTGGTGGTGGCAATCCGTTCTTGATGTATTTATGTCTGACAGTATTACTACAGCACAGAGATTACATAATGAGAAACCGGATGGACTACAACGAACTAGCGATGCATTTCGACAAGATGGTGCGTAAACATAACGTAAACAGGGTATTGAATCAGGCGCGGCAGATGTACGCGATGTATTTGAAACAGCAAGCGAACAAGACCGGTGACGTCACCACCTGA

Protein sequence:

>DPOGS211124-PA
MFGYSKEAVRVKVKKCEGKLQPELRKFSVDPQITSLEVLQSILVKAFDIKSDFTLSYRTVDDYGQEIYLPLLSDWDLDAAFLKAHNIALTMRLEPCVQLKVDMKPFAEASEDWEPPTIPEAPVGQVSKDQPAAPQLTQVEKQESQTGFQGLFMNQVEKTFNMVSRALNLYEDPNTPPRPPLNDIEFRAFLDAVGQITNTIKLREVIYCGGIEPSLRKVVWKHILNVYPDGMTGKERMDYMKRKANEYYTLRSQWKDCIQRGKVNADLAYVTSMVRKDVLRTDRHHNFYAGSDDNQNIASLFNILTTYALYHPTVSYCQGMSDLASPLLVTMGDEAHAYICLCALMTRLYPNFLLDGEAMTLKFTHLTESLQVYDPDFYNYLKSQQADDLLFCYRWLLLEMKREFAFEDALRMLEVLWASLPQKAPTAELPLKERDFDPTLEVIIDPPPSSPLVKTPRENAYTKVCAIRRQSSSFSLANIKAPKLSTIRQSNHSLDENATRKILNNSSLIHAKEFQSLDDAALQKQKTIKDNVENEEKLKTDKDGKDSQVKVKRCMKSVPEHGRVDTQEPNTVWSSTGNLPNGVTLSDTKDTPLGSQIRLLRDKISVHNNRFFSSLDALEGPNSSNASKPTKQVKMIKNLNEFLNFATGNKDAVKPEKLQRTHSVIERKHSKECPKIILTKTSYDESDISSANQRSNRLSKLTKSSFDTNDGSSPDDSQEYHPMTTSMTRELRLELEHLDRQVFGNSYVNRCSMICDSPSDSTSTEDKPAEYADAKGDTVATDAVELRAEVVDPNGTVKKPLDTIKCNARSAEDIYLWENPLHRNTPTTRTTASCPQTPDEQAELDFDGDTGEIFEEHSGKKSITPIRLLRKNHSLEPQEPSRNRRHSMTHSSESDYSEKDPVDSTETVTAKNINQEPLHLKSHKFFSNMSHELENAKKNCESLLSVKQTNLQSVASTINNFQKKCEVFKPPVQKNTTVSTISDNSVKNSISSLPSPAVFGGGNPFLMYLCLTVLLQHRDYIMRNRMDYNELAMHFDKMVRKHNVNRVLNQARQMYAMYLKQQANKTGDVTT-