Monarch geneset OGS2.0

DPOGS210932
TranscriptDPOGS210932-TA1329 bp
ProteinDPOGS210932-PA442 aa
Genomic positionDPSCF300045 + 793587-797434
RNAseq coverage659x (Rank: top 19%)
Annotation
HeliconiusHMEL0132941e-12283.95% 
BombyxBGIBMGA003782-TA1e-15380.97% 
DrosophilaCG17883-PA9e-8650.47% 
EBI UniRef50UniRef50_D6WM567e-9748.81%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WM56_TRICA
NCBI RefSeqXP_001653390.12e-10153.74%hypothetical protein AaeL_AAEL001505 [Aedes aegypti]
NCBI nr blastpgi|1571194553e-10053.74%hypothetical protein AaeL_AAEL001505 [Aedes aegypti]
NCBI nr blastxgi|1582968137e-9856.36%AGAP008312-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00050971.5e-23Rab GTPase activator activity
GO:00056221.5e-23intracellular
GO:00323131.5e-23regulation of Rab GTPase activity
KEGG pathway 
InterPro domain[95-287] IPR0001951.5e-23Rab-GAP/TBC domain
Orthology groupMCL13434 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210932-TA
ATGGAATCCATTAATACTGATTCAGAAAGAAGTGATAAATGTAATATAAATGGAGACTGCTCTCCGATAACAGATAATGAAAGCAATAACTTTAATCACGCGAAAAATATTGATGATATCTCTACACCTTCCGAGTTAAAATTTGACGAAGACTTGGAATTCGAGAGCTTCGACATAATTGAAAAACGAAAGGATATAGAAAAGTGTCTTGCTAATCCGGACCTTGTAAATTTGGAGCAATGGCAGAACTTTGCTAAAAGTAAAGGGGGTTTAATTTGTGATAAATATAGAAGAAAGATATGGCCACTGCTAGTTGGAGTAACTCAAGAGGAATTGACTGATCCTCCGTCACTGGATGAGCTCTCTACACACCCTGAATACAATCAAGTTGTGCTGGATGTGAACAGATCCCTCAAACGGTTTCCACCTGGAATTCCGTACGAGCAAAGAGTAGCCCTACAGGACCAGCTTACTGTTCTTATACTTCGGGTGATCATTAAATATCCACACCTCAAATATTATCAGGGATACCACGACGTGGCCATAACTCTCCTTCTAGTATGTGGAGACCGAGCCTCGTTCCCACTGCTGTGTCGCCTGTCGTACGGCTCGGGGGCTCCGCTAGCGCCGTTCATGCAGACCACGATGCAGCCCACGCAGCACCTCCTCAACTACATGCTGCCCATACTGCGACGAGCTGATCATGGCCTGGCTGACTGCCTGGATAAGGCTGGTGTCGGCACGATGTTCGCTCTGCCGTGGTACTTGACGTGGTTCGGTCACAGTCTGAACCGGTACTCGGACGTGGTGCGTCTCTACGACTACTTCCTGTGCGCTCCGCCCCTGTTTCCGGTGTACGTGACGGCCTCCATCGTGCTGCAGAGGGCCGCTGATGTTTATCAGTGTGACTGCGACATGGCCATGATGCACTGCCTTCTGTCCAGGTTACCAGATGACTTACCATTCGAAGACATCCTGGTGACAGCTGAGAAGATATACAACGAGAACGATCCCACAGACCTCGAGGATGAGGTCGCCGCGCTGGAGAAACGAGAAGAGGAGCAGCGCAAGTTGGACGAGGAGCGCATGCGGCGGCGGCAGGAGGAGCGGCGGCGGGGCGCGGGTCGGGGTGCGGCCCGGGGCTGGTGGGTGCTCCGCCGCCCGGCCCACCGCCGCGCCGCCCTGGCGCTCACGGCGGCCCTACTCGGGCGGCCCTGCTCGCCATCTACGTCTACTACCGGCCGGAGCTGTTCAGGTAGCGCCACGACTATCAGTTTGTTCTGTTCCGCTTCTAGCTCCTGGTCACTGTCGGGGGACCGCGCGTCGTGA

Protein sequence:

>DPOGS210932-PA
MESINTDSERSDKCNINGDCSPITDNESNNFNHAKNIDDISTPSELKFDEDLEFESFDIIEKRKDIEKCLANPDLVNLEQWQNFAKSKGGLICDKYRRKIWPLLVGVTQEELTDPPSLDELSTHPEYNQVVLDVNRSLKRFPPGIPYEQRVALQDQLTVLILRVIIKYPHLKYYQGYHDVAITLLLVCGDRASFPLLCRLSYGSGAPLAPFMQTTMQPTQHLLNYMLPILRRADHGLADCLDKAGVGTMFALPWYLTWFGHSLNRYSDVVRLYDYFLCAPPLFPVYVTASIVLQRAADVYQCDCDMAMMHCLLSRLPDDLPFEDILVTAEKIYNENDPTDLEDEVAALEKREEEQRKLDEERMRRRQEERRRGAGRGAARGWWVLRRPAHRRAALALTAALLGRPCSPSTSTTGRSCSGSATTISLFCSASSSWSLSGDRAS-