Monarch geneset OGS2.0

DPOGS205702
TranscriptDPOGS205702-TA1752 bp
ProteinDPOGS205702-PA583 aa
Genomic positionDPSCF300250 + 335-5191
RNAseq coverage409x (Rank: top 30%)
Annotation
HeliconiusHMEL0147974e-15465.14% 
BombyxBGIBMGA009915-TA1e-16262.89% 
DrosophilaCG4030-PA2e-5930.56% 
EBI UniRef50UniRef50_D6WYD45e-8136.88%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WYD4_TRICA
NCBI RefSeqXP_001122167.14e-8237.23%PREDICTED: similar to CG4030-PA [Apis mellifera]
NCBI nr blastpgi|3800158442e-8537.21%PREDICTED: rab GTPase-binding effector protein 1-like [Apis florea]
NCBI nr blastxgi|3800158441e-9637.04%PREDICTED: rab GTPase-binding effector protein 1-like [Apis florea]
Group
Gene OntologyGO:00468724.3e-18metal ion binding
KEGG pathwaytca:6606763e-81 
 K12480 (RABEP1)maps-> Endocytosis
InterPro domain[504-568] IPR0110112.5e-18Zinc finger, FYVE/PHD-type
[501-568] IPR0003064.3e-18Zinc finger, FYVE-type
[502-567] IPR0130831e-11Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL13908 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205702-TA
ATGGCTAGCGGTTCCGCAGTTTTGAAAGAAAGCGGAGAAGAAGGTGCTAAAGGTCCTCAGAACGCTAAGCGAGATCTGGAAGAGGAGTTTAACGTGCAAAGAGCTAAAATGAAAGAGTTATTCTTACAGAAAGAAGAGGATCTCCGTCAACTGGTGCTAGAGAAGCAGCAACTGGACTCTGAGGTGTTGGGGCTACGGGAGGAGCTGCAACAACTGCAAACACTCAGTGAGAACCAGAAGTCAGAGATACAGAGCTTACAGATGTTAGTCAGTGAGACAGTGGAGGCATCTTCATCGGGCTCCGAGGAGGTGAGGAGGCTCCGAGCTCGGAACGTCGACTTGGAACAACAGCTCGCACAACTTAGACAACAACAGGAGCTTCCCCTGGCGCCCGCCACCTTCGTGCGGTCGTTGGCGCGCAAACTGGGAGCGGAGCCGGAGGAGCCACCGCCGCCCAGGAAGGCCGAGGACGAGCTGCTGCAGAGCATCATACAGCCGCTCGAGCTGGAGATCGGCGCGCTCAAGAACAAGCTGCGGGAGAACGACGCTCTGCTACAGGATGCTCTGAAATCCAAAGTAGCAGCTCCCAACACCTCGGGGGCCGTGGCCAGCGGCGGCGACTCTAAACCTGAAACGGAGAGCCGCGGCTGTGACATGTGCGCCAACTACGAGAGACAACTGGTCGCCGAGCAGACACGCGCAGACCACGCACGAGATAAAGCGAGGAAATTTGAACTTTCGCTCAAATTGGCCACAGAGGAGTTAGAGGGCGTCCGCAGCGTCCATGACGAGACGACGCGCGCCTGGCAGGCCGAGAGGACAGAGGGAGGGGCGCGACTCGCGGACCTGCAGCGGGCCCTCGACCACGCCAAGGAACAGATAGCACAAAGGAGCGAACAGGCCGACCGGGCCTCGCGACAGGCTCTCCACAACGTGACGGCGCTCACGGTCGCCAGGGAGACGCTACAGGGGAAGCTGGACGAACTAGAAAGGGAAAATGAGATGCTCGTAGGACGATATCTCCGGAAGGCCGCAGAGATGGAGAGCGAGGTCATCGACCTGCCGGACGACGTGCCCGCGCTGCAGGAGAAGGCGATCCAGCTGCACGAACAGCTGCTCGTCTGCCAGGTTGGGCGAGAGAGGGCGCTGGAGGACGAGGAGGAACTACGAGCGCAGCTACAGCAGCACGCCGCCATGTTGCACCGGAGAGAAGACGAGCTGGCCGAGCAGGGAGCCAGGCTCAAGGACGTCGGCAAAGAGCTGGATCGGCTGCAGACTGAGCACGAGCAGATGACGGAGCTGGCGGACAAGCTGAGGCAGTCCAACGACACCATCGAGAAGCTGCTCGAGGATAAAAAACGTCTCCAGAACGAGGTGAGCGAGGTGCGCACGAGGGTGTGCGTGTTACAGCAGGAGCTGGACAACAGTGAGAAGGTGCAGCAGGACTTCGTGCGACTGTCGCAGAGCCTGCAGGTCCAGTTGCAGAGGATACGGGAAGCCGACTCCGAGGTGCGCTGGCAGCACGACGAGGACGTGAGCGAGTGCCCCGCGTGCCGCACGCCGCTGCCCACCAACAAGAAGAAGATCCACTGCCGCCACTGCGGCCGCATCTTCTGCGGGCCTTGCGTGAGCCAGGTGGTGGCGAGCGGCCCGCGCGGTCTGCCAGCGCGCGTGTGCTCCGTGTGTCGCACGTTGCTGCAGCCGCACGCCGCGCCCTACTTCAGCACCCGCCCGCCCAACTCGCCCGACTAG

Protein sequence:

>DPOGS205702-PA
MASGSAVLKESGEEGAKGPQNAKRDLEEEFNVQRAKMKELFLQKEEDLRQLVLEKQQLDSEVLGLREELQQLQTLSENQKSEIQSLQMLVSETVEASSSGSEEVRRLRARNVDLEQQLAQLRQQQELPLAPATFVRSLARKLGAEPEEPPPPRKAEDELLQSIIQPLELEIGALKNKLRENDALLQDALKSKVAAPNTSGAVASGGDSKPETESRGCDMCANYERQLVAEQTRADHARDKARKFELSLKLATEELEGVRSVHDETTRAWQAERTEGGARLADLQRALDHAKEQIAQRSEQADRASRQALHNVTALTVARETLQGKLDELERENEMLVGRYLRKAAEMESEVIDLPDDVPALQEKAIQLHEQLLVCQVGRERALEDEEELRAQLQQHAAMLHRREDELAEQGARLKDVGKELDRLQTEHEQMTELADKLRQSNDTIEKLLEDKKRLQNEVSEVRTRVCVLQQELDNSEKVQQDFVRLSQSLQVQLQRIREADSEVRWQHDEDVSECPACRTPLPTNKKKIHCRHCGRIFCGPCVSQVVASGPRGLPARVCSVCRTLLQPHAAPYFSTRPPNSPD-