Monarch geneset OGS2.0

DPOGS207029
TranscriptDPOGS207029-TA3174 bp
ProteinDPOGS207029-PA1057 aa
Genomic positionDPSCF300001 + 1554786-1566672
RNAseq coverage99x (Rank: top 61%)
Annotation
HeliconiusHMEL0094270.084.74% 
BombyxBGIBMGA012971-TA0.084.05% 
DrosophilaCG1695-PB0.049.69% 
EBI UniRef50UniRef50_B0X2Y10.053.86%Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0X2Y1_CULQU
NCBI RefSeqXP_001864003.10.053.86%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700563800.053.86%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|3479694460.053.67%AGAP003198-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00050971.5e-42Rab GTPase activator activity
GO:00056221.5e-42intracellular
GO:00323131.5e-42regulation of Rab GTPase activity
KEGG pathway 
InterPro domain[495-950] IPR0001951.5e-42Rab-GAP/TBC domain
[55-129] IPR0040123.4e-21RUN
Orthology groupMCL12685 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207029-TA
ATGAAAACGTTCACAAAAAATTGCCCGGAAGCGGCTTTAGTATCAGCTCGTGTTCTAGCAGCGGAAGGTGCCCCTGCAACTCGATCAGCTTCAGGAGTCGAGAGACGACCTTCAGCCCCTCGACCACCTCTAAATAAGAGAGGCTCTGGGTCCCTGCTTTCACAATCACCACCACAGTCCAACAGTATTCTCCTCCTCCTCCTCCACAGGTACCTCTGGATACGGATAGCTCTCTTCGAACGTCAATTGTCGAAGATAATAGAGCATCTGGTGAACAATGCAAGCAGATATTACGAAAGAGATGCACTGGTAGCGGATCCCGATTACGGCAGTATCTTGAGCTCTCTGCTGGTTGGACCTTGTGCTCTCGAATATTCAAAGGCCAAAGCACCGGCATGTTTCTGGACAGACCCACCAGCAGACGAACTTGTTCAGAGGCATAGAATGTCAGCAGGAACAACAACTCCACCATCAGTACGAAGACCCATACTAAACTTCAGAAGGAGTCTGAACGCATCTTCGGAGGACAGTGGAGCTGTTAGCTCAGCCTCTGGCAATACTTCAAGTTCTACTGCCAAAGATTATGTGGAGAGCTTACACCAAAATTCGAGAGCTACGCTTTTATATGGCAAAAACAATGTACGGGTTCAACCGAAAGATGTAGAAGAGCCCATGCCAGGTTATTTGAGCCTGCATCAGACAGCAGCTGGTCTCGTCATAAAATGGACGCCTAATCAGTTGATGAATGGATACGCTGAGAGCGAAGGTGTCGACAAAAGTGTATACTGGGCGATGTCCCTCCAAGTCTGTGTGTGGGAGGTCGTGTACGTTCATGTACATCGGTCACAGGCGAGTGATGCCCTCATACTTGTTGGACAAGACGGCGTCCAACGTCCACCGATACAATTCCCCAAAGGAGGACATCTTCTATCATTTCTAAGCAATTTAGAAACTGGCCTTCTACCACACGGACAATTGGATCCGCCGTTGTGGTCACAAAGGGGAAGCGGAAAAGTGTTCGGACGTGGTAAAATCCGTAGACGACCGATGCCAAGTCTTTGCGAGTCCGGTGAATCCGAAGAACCGGATTGGGACGAGGCCGCGGGAGATTACGTGTTTCGTATTGTTAATAACGCACTCACTGACAGAGAAGCAATACGTCATTCTTTATTGGAGCGTGTGATACAATCTCCTCCAAGGACGCCTCGTAGACCTTTGGCCTCTACGTCAACAACTTCATCAGATTCGTCCACCATGTCCGTAGATTGTCCACCTCCTTCTGTGAATGGGATCGTAGCTACGCCGGCTAATAATTTCGTAGAACAGGCTGCATCTATAGAACTCGTCTGCAGTACTATGCGTCGTCAAATCATATCCAGGGCGTTCTACGGATGGCTAGCCTACTGTCGCCATTTATCAACAGTACGGACACATCTCAGCGGTCTTGTTCATCCCAATATAATTTGCAGAGAAGGCACAGAGAACGGCCTTACTGCGGAACTTTGGAATTCCATGATGGATAATAAAGGTGCGGTGACTGATAAGGACGAAGTTTACCGTGTAGTATACTATGGAGGGGTACAACACGACATACGACGAGAGGTTTGGCCGTATCTCCTCGGTTACTATGAGTTTGGTTCAACAGCAGAAGAGCGAACCGAACAAGACGCAGCTTACCGCCGTCAGTATGAAACAACAATGTCAGAGTGGTTATGTGTTGAGGCGATTGTTCGACAGCGTGACAGGGAAGCCACAGCCGCGTCCATAGCGCGACTTAGTGAAGCCTCGGGAAAACCACGACCTGAAGCCAGTGAGGCTAATGAGGTATTTGAAGACGATTGCAGTGTGATTTCTGATAATCCTCCAATAGTTTCCCCTGAACCAAGCGAAAATGAACAAAAGAAGCCTGAATCTAAACCGGTGCTATCACGAGCGCCTAGCATAGACGAAGTGGAAAACATAGAAATGGACTCAGAGGAGAAAGGAAAAGACGATGTAACTTTAAACGGAGAGACTGAGACAAGTGATAGAGATATTGCTGATGATAGTTTCACTGAGCTAGAAGAAGACGACGAAGATGTCTACGCTAAGAGTGTAATCGAAAATCCAGAGATGAGGAGGGAGAGCATAGCGGAAATCAAGAGATTAGCTGAAGGTATCGTACAAAAGAGCGAAGGGAGAGGTCTGCTTTGCAGTGTTGATAGTGCCAATATAAATATTAATGGTAATGATAAAGTGGATGAAAACCAAACAACAAGACAGGAACAGCAACATACTAGCGTGATTATAACAAATCCCTCCGTTGACTTGGTGCCATCTGGTTCACCTGCGTCCGGTGGTACTACAGTTAACGTAAGTCCAGCTCGTAGTCCATTGGGTGTGGTCCGTGAGGAGTCTCAGTCCGCGGGAGCATCCTTCGACACTTTAGAGCCAAGTGACGCTCGTCCTGACAACAGATCCGACTGCGTATCACCAGCTAGCTCTAATGGAGGGGTCTATTCGAATGAATTGGTTGAGAGTTTCGCGTTAAATCTTCACCGCATCGAAAAGGATGTTCAGAGATGTGACCGAAATTATCCTTTCTTTAATGACGAAAATTTAGATAAACTGAGGAACATTATGTGCACGTATGTGTGGGAGCATCTGGAGACAGGTTACATGCAGGGCATGTGTGACCTAGCGGCTCCTCTGCTGGTTGTAGTCCGCGAGGAGGCGGCCGCCCACGCCCTGTTCACACAGCTGATGACACGCGCTAGAGACAACTTCCCATCGGGACAGGCCATGGATGCACATTTTGCAGATATGAGGTCTTTGATACAAATTCTCGACTGCGAGCTGTACGAGTTAATGCACGCCCACGGCGATTACACTCACTTCTACTTCTGCTACCGCTGGTTCTTGCTGGACTTTAAGAGAGAACTTTTGTACCAGGATGTGTTCTCCGCGTGGGAATTGATCTGGTCAGCTCGTTACGTGTCCTCTGAGCACATGGTGCTGTTCTTAGCGCTGGCGCTGCTGGAGACCTATCGTGACGTCATCCTCGCTAATGCCATGGACTTCACGGACATCATCAAGTTCTTCAACGAGATGGCCGAGCGTCACGACGCTGCCGCCGTTCTATCACTAGCAAGGGATCTCGTCCTTCAAGTTCAAACTCTAATAGAGAACAAATAA

Protein sequence:

>DPOGS207029-PA
MKTFTKNCPEAALVSARVLAAEGAPATRSASGVERRPSAPRPPLNKRGSGSLLSQSPPQSNSILLLLLHRYLWIRIALFERQLSKIIEHLVNNASRYYERDALVADPDYGSILSSLLVGPCALEYSKAKAPACFWTDPPADELVQRHRMSAGTTTPPSVRRPILNFRRSLNASSEDSGAVSSASGNTSSSTAKDYVESLHQNSRATLLYGKNNVRVQPKDVEEPMPGYLSLHQTAAGLVIKWTPNQLMNGYAESEGVDKSVYWAMSLQVCVWEVVYVHVHRSQASDALILVGQDGVQRPPIQFPKGGHLLSFLSNLETGLLPHGQLDPPLWSQRGSGKVFGRGKIRRRPMPSLCESGESEEPDWDEAAGDYVFRIVNNALTDREAIRHSLLERVIQSPPRTPRRPLASTSTTSSDSSTMSVDCPPPSVNGIVATPANNFVEQAASIELVCSTMRRQIISRAFYGWLAYCRHLSTVRTHLSGLVHPNIICREGTENGLTAELWNSMMDNKGAVTDKDEVYRVVYYGGVQHDIRREVWPYLLGYYEFGSTAEERTEQDAAYRRQYETTMSEWLCVEAIVRQRDREATAASIARLSEASGKPRPEASEANEVFEDDCSVISDNPPIVSPEPSENEQKKPESKPVLSRAPSIDEVENIEMDSEEKGKDDVTLNGETETSDRDIADDSFTELEEDDEDVYAKSVIENPEMRRESIAEIKRLAEGIVQKSEGRGLLCSVDSANININGNDKVDENQTTRQEQQHTSVIITNPSVDLVPSGSPASGGTTVNVSPARSPLGVVREESQSAGASFDTLEPSDARPDNRSDCVSPASSNGGVYSNELVESFALNLHRIEKDVQRCDRNYPFFNDENLDKLRNIMCTYVWEHLETGYMQGMCDLAAPLLVVVREEAAAHALFTQLMTRARDNFPSGQAMDAHFADMRSLIQILDCELYELMHAHGDYTHFYFCYRWFLLDFKRELLYQDVFSAWELIWSARYVSSEHMVLFLALALLETYRDVILANAMDFTDIIKFFNEMAERHDAAAVLSLARDLVLQVQTLIENK-