Monarch geneset OGS2.0

DPOGS210526
TranscriptDPOGS210526-TA3678 bp
ProteinDPOGS210526-PA1225 aa
Genomic positionDPSCF300186 + 270115-286962
RNAseq coverage1199x (Rank: top 11%)
Annotation
HeliconiusHMEL0163456e-6848.51% 
BombyxBGIBMGA012628-TA3e-8757.43% 
DrosophilaEps-15-PA5e-7562.04% 
EBI UniRef50UniRef50_F4WAX42e-14241.19%Epidermal growth factor receptor substrate 15-like 1 (Fragment) n=7 Tax=Coelomata RepID=F4WAX4_ACREC
NCBI RefSeqXP_967469.12e-14643.77%PREDICTED: similar to GA14224-PA [Tribolium castaneum]
NCBI nr blastpgi|910941073e-14543.77%PREDICTED: similar to GA14224-PA [Tribolium castaneum]
NCBI nr blastxgi|3800181593e-16837.94%PREDICTED: LOW QUALITY PROTEIN: epidermal growth factor receptor substrate 15-like 1-like [Apis florea]
Group
Gene OntologyGO:00055151e-39protein binding
GO:00055094.1e-32calcium ion binding
KEGG pathwaytgu:1002260462e-79 
 K12472 (EPS15)maps-> Endocytosis
InterPro domain[332-426] IPR0002611e-39EPS15 homology (EH)
[330-422] IPR0119924.1e-32EF-hand-like domain
[1-91] IPR0123363.1e-21Thioredoxin-like fold
[88-198] IPR0109871.3e-14Glutathione S-transferase, C-terminal-like
[131-192] IPR0040468.1e-07Glutathione S-transferase, C-terminal
Orthology groupMCL10780 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210526-TA
ATGAATGAGAGCAAATTTATTCTTTATGGTGATGAGAAATCACCTCCCGTAAGATTCGTTTTAATGACAGCTTCAGTCTTAGGAGTTGATTTACACTTTCAAAAAGTAGACCTATTTAAAAATGAACATAGAAGCGAATCTTATAGAAAGATAAATCCACTTCAAAAAGTACCAGCCATGGTTACAGAGAGTTCTACAATTTGTGATAGTCATGCTATCTCACTGTATCTTTGCGAAGTGGCGGGTCTACACGGGTTGCAACTTTACCCTGCAGATATTCTTACACGATCTGTTATTAATGAGCTGCTGTTTTTTAATTCCAGTACTTTATTCAGACTTGATAGTGAAATAATGACTATGTTTTTTGCTGGAAATTGGCCTCCCACTGAGGTGAAAATGGAAGAATGGAATAAAGCTTTAGATTATCTTGAACATCGTTTAAAAAAAACGTCATGGCTGGCTGGTGAAAAAATGCTATTATGCGATATCTGTGCCGTGACTGTGGTGACATCTGTACTTCCACTTTTTCCACTAACAAAAAGACATTTTAAAGTTAATAAATGGATTAGTCAGTTTGAAAATGTTGCTTGTTATGGAATAAATAAACGCGAAACTCGTATCAAAATGTGGATTGAACCATTGCAGGTGGCAGGGGCCCACAGTTCAATATACGAAGCATATTATCATCAAGTTGACCCGAATGGCTCGGGAGCGATCCAAGCTCTGGATGCAGCACGTTTCCTTAAGAAGTCTCGTCTCAGTGATGTTGTGCTCAGTAAGATATGGGACCTCTCGGACCCCACCGGCAAGGGCTATCTCGATAAAGCGGGGCTGTTCGTGGCCCTGAAGCTGGTGTCCCTGGCCCAGGCGGGTAAGGAGATCAATATGAGCAACATACACTCGGAAGCGCCCCCTCCTAAAGTGGGCGAGTTACCCAAAGTGCCGCCCCCGTCGTTGCCGCCCGCGGCACCCCCGGCTCTCGCTCCACTTGGCGACTGGAGCGTCAAGCCAGCGGAGAGGGACAAGTACAGCGCTCTGTTTGACTCGCTTCAACCGAACAATGGCGTCATACCAGGGAATAAGGTGAAGGGAGTCCTCATGGAGTCGAAGCTTCCCCTGGAGACCCTCGGGAAGATCTGGGACCTCGCCGACCAGGACAAAGACGGCATGCTCGATAGACACGAGTTTATTGTGGCCATGCACCTAGTGTACAAAGCATTAGAGAAACATGCAGTGCCCACAACGTTGCCGCCGGAGCTGCGCGCTCGCCCCGCCAGGCCCCCCTCCAGGCCGCCGTCGAGGCCACAGACTCGCCCCCCACCACCCCGGCCGCAGCCCCCTCCCCAACAGTCGAACGCTACCTTACTGGAGGGACTCCTGGATCTGTCCAGCCCGCCGAGCGCACCCCCCGCGGCGGGCCAGGCCTCCGGGCCGTGGATGACTGCGGCGGAGCGCAGCCAGTACGACGCGCAGTTTGAGGCGGCCGACCTGGACCGCGACGGGTTCGTGTCCGGCGCAGAGATCCGCGGCGTGTTCCTGGACAGCGGACTGCCGCAGATGACGCTCGCCCAGATCTGGTCGCTGTGTGACCAGTCAGGGTCGGGCAAGCTGTCCGTGGTGCAGTTTCGCGCCGCCATGTGCCTGGTGCAGAGAGCCCTGCGCGGACATCCGCCGCCCGCCGCCCTCCCGCCGCACCTCATGGAAGAACACTCTCTACCACCAGCACAGTCGCGAGTCTGTCAGACCGTGCAGATAGTGGTCGCGGTGTTGTTGTTTTTATCACCGAGGGTTATCGCAGCGCGCGGGCCGCACGGCCTCGTTAATGAGAAGGCGAACTCGGTAGGAGCGGGTGATCGGTCGCCCGCCCTGTTCAAACCCGAGCCGGTCAGTCTGGGCCCTCAACCCACGCCCGAGATGGATGCCATAGCTCGTGAGGTGGACGCGCTGGCCAGGGAGAGACTCGCGCTGGAGGCGGAGCTCACCAACAAGCAGAGAGAGGTCGCGGTGAAGACGGGCGAGGCGGACAGCTTACAGAGCGAGCTGGACACGCTCACGGCCACCTTGAAACAGTTAGAGAACCAAAAAGGAGAGGCGCAGAAGAGGCTGAATGACTTGAAATCCCAAGTGGATAAGCTTCGGTCTCAAGTGTCAGCTCAGGAGGCCGCGGCGGTGGAGACGGAAGCGGAAGTGTCCGCCCGGCGAGCGGCGCTGCTCGGGCGGCAGCAGCATGAGCAACGGCTCAAGGACGAGTTGGAGCACGAGAACCAACGCGTGGAGCAACTCACGGGGCAGCTGTCGGCCAGCGTGCTCGCCGTCAGCCAGGCCAGGATCAAGCTGGAGCATCTGGAGCAGCAGCACTCGGCGCTGGAGGCCGCGCTGCAGGCGCTGGACGCCGGCTCCGGGGACCTCGCGCTGCAGCCTCTGCATCGACACGAACACCTCGAGCGGCTGGTCAGGGGGTGCACGCCGAGCGACGCGCTGGTACGTGCAGCCTCGGCTGCTAGATCACCATCAACCCAACAGGAGGAGGATGGGGAGGGCCGCGGGGAGAGCGGGGGGAGCGGGGGCATGATGAACGGGTCATTCGCCAAGTTCGAGGACTCCTTCACCCACAACGGGGACCCCTTCGCGCCCTCGGACAGCAGGAACACTAAGACCGAACTGTTCTTCTCAGTATATATGACTTGTCGTATCTGCCCTCAGTCGTTCGCGTCGTTCTCCCCGGGCGCCGACCCGTTCTCCGGCGACTCCTTCGTCCAGTCCGAAGCCCCGGCCGCTAACGACAGCGCCTGGGAGTCCGATCCCTTCGCAGTGCTCCACGCGCCGACCCGCGCGACGGCCAACGCCAGCAGCACGCCCGCCGCGGCCAAGAGCCACAAGACCCCGCCCCCGCGGCCCGCCCCCCCTCGGCCCCTGCCGCCACACAAGTGCGTCCTCTCCCCGACGATCTCCCCCTCACCCGATCCCCTCCCTCACCCCCCCAACCCCCCCTCGCCCCCACACTCACAGTTTGTGTTCGCTCAGAACGACCGCAAACCCGCCCTCGACTTCACGGAGGACCCCTTCAAAGACTACAGATACGAAGATCCCTTTAACATCGACGACCCCTTCGCGGACATCGCGGACCCCAAGAAGGAGCCCCGAGCGCGGCCCGCCTCCGCCGCCGCTTTCCCAGCTGCCTTCCCCGCCGCGGTCAACGGCCGGGTGTCCGCGCCGCCGCTGCCCTACGACCCGTTTGCCTCCAGGGACAGGACGGACTCGTGGGCCGTCTGGCCCGAGGACGACTGGACCCCTCGCGCCGGGGATTCGCAGGGCAAGGACGACTGGGCGGCCGACTGGGATCACAACGCGAACCCCTCCACCACGCACAGACCTAACGACACCTGGCCCACCACCACGCTGCCGGCCAAGAAGGAGAAGTCTCCCAAGCCGGTGAAGTACGCGAGGTCGCTGGTGCACACGATCGGCGGCATCGGCAGGTCCAGGCACAAGGACAAGAAGGGGAAAGAGACCAAGGAGGTCAAGGAGGTCAAGGACACCGGCGATCTGTCGGAGGAGCAGCAGTGGGCCTGGGCCGAGGCCGAGTCGCGCCGTCTGCAGCGGGAGGCGGACGAGCGACGGCGGCGGGAAGAGCGTGAGCTGCAGCTGGCCCTGGCGCTGTCCCGGACCGAACAGTGA

Protein sequence:

>DPOGS210526-PA
MNESKFILYGDEKSPPVRFVLMTASVLGVDLHFQKVDLFKNEHRSESYRKINPLQKVPAMVTESSTICDSHAISLYLCEVAGLHGLQLYPADILTRSVINELLFFNSSTLFRLDSEIMTMFFAGNWPPTEVKMEEWNKALDYLEHRLKKTSWLAGEKMLLCDICAVTVVTSVLPLFPLTKRHFKVNKWISQFENVACYGINKRETRIKMWIEPLQVAGAHSSIYEAYYHQVDPNGSGAIQALDAARFLKKSRLSDVVLSKIWDLSDPTGKGYLDKAGLFVALKLVSLAQAGKEINMSNIHSEAPPPKVGELPKVPPPSLPPAAPPALAPLGDWSVKPAERDKYSALFDSLQPNNGVIPGNKVKGVLMESKLPLETLGKIWDLADQDKDGMLDRHEFIVAMHLVYKALEKHAVPTTLPPELRARPARPPSRPPSRPQTRPPPPRPQPPPQQSNATLLEGLLDLSSPPSAPPAAGQASGPWMTAAERSQYDAQFEAADLDRDGFVSGAEIRGVFLDSGLPQMTLAQIWSLCDQSGSGKLSVVQFRAAMCLVQRALRGHPPPAALPPHLMEEHSLPPAQSRVCQTVQIVVAVLLFLSPRVIAARGPHGLVNEKANSVGAGDRSPALFKPEPVSLGPQPTPEMDAIAREVDALARERLALEAELTNKQREVAVKTGEADSLQSELDTLTATLKQLENQKGEAQKRLNDLKSQVDKLRSQVSAQEAAAVETEAEVSARRAALLGRQQHEQRLKDELEHENQRVEQLTGQLSASVLAVSQARIKLEHLEQQHSALEAALQALDAGSGDLALQPLHRHEHLERLVRGCTPSDALVRAASAARSPSTQQEEDGEGRGESGGSGGMMNGSFAKFEDSFTHNGDPFAPSDSRNTKTELFFSVYMTCRICPQSFASFSPGADPFSGDSFVQSEAPAANDSAWESDPFAVLHAPTRATANASSTPAAAKSHKTPPPRPAPPRPLPPHKCVLSPTISPSPDPLPHPPNPPSPPHSQFVFAQNDRKPALDFTEDPFKDYRYEDPFNIDDPFADIADPKKEPRARPASAAAFPAAFPAAVNGRVSAPPLPYDPFASRDRTDSWAVWPEDDWTPRAGDSQGKDDWAADWDHNANPSTTHRPNDTWPTTTLPAKKEKSPKPVKYARSLVHTIGGIGRSRHKDKKGKETKEVKEVKDTGDLSEEQQWAWAEAESRRLQREADERRRREERELQLALALSRTEQ-