Monarch geneset OGS2.0

DPOGS215461
TranscriptDPOGS215461-TA2085 bp
ProteinDPOGS215461-PA694 aa
Genomic positionDPSCF300098 - 485683-490697
RNAseq coverage471x (Rank: top 26%)
Annotation
HeliconiusHMEL0083501e-17274.54% 
BombyxBGIBMGA007491-TA0.073.99% 
DrosophilaReps-PA9e-6443.80% 
EBI UniRef50UniRef50_D6WXV24e-10840.41%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WXV2_TRICA
NCBI RefSeqXP_974466.18e-10940.41%PREDICTED: similar to AGAP008180-PA [Tribolium castaneum]
NCBI nr blastpgi|910891931e-10740.41%PREDICTED: similar to AGAP008180-PA [Tribolium castaneum]
NCBI nr blastxgi|1947612381e-10434.78%GF14230 [Drosophila ananassae]
Group
Gene OntologyGO:00055152.8e-27protein binding
GO:00055098.7e-27calcium ion binding
KEGG pathwaygga:4246333e-11 
 K12472 (EPS15)maps-> Endocytosis
InterPro domain[259-353] IPR0002612.8e-27EPS15 homology (EH)
[244-354] IPR0119928.7e-27EF-hand-like domain
Orthology groupMCL14798 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215461-TA
ATGGAGGATCTCAATTTGACAGAGACCGAAATGAGATATTTCGGTGATTTGTTTTTGTGCTGCGACGAGGAATCCAACGGGAAAATACCGATTTTAAAGGCGACTGAACTTTTTAGATCATCCAACGTATCCAATGATGTATTACGACAGATCATGGATATTAGTGTAGCGCCTAATACATGCACGTCATTAAATCATATGAATAGAAAGCAATTTTATTCAGCTCTCAAATTGATAGCCGCTCATCAAACAAATATGTCCCTAAGACCAGAACTTTTATCAACTCCCTTAGATTTGCCATTGCCAAGATTTACATGGGCACTAAACTGTGATGCCAATGCTGATCTCATACAGTTGTCTAATTCACCGAAGGAACAGCATATATCGAAACGAGACAGAAATTTTGGTAGTATCGCTGCATACGAGCCAATAAGAGTGCCATCAAACCTCTCTGATAGCGACGCTCCACAAACAATGTCTCATGATAATACAGAAGCTTTAAGTACTGACTCAGAGATGGAGTCGGAGACATTGTCACAAAGATCTGGTTCACGTGGGAGACGTGTGAAGGCTGGTTCGCCGTGGAGTACGGCCAGTGAGAGTCCGACGCCCACTAACAGTGTAGCAGAACGTGTTCATCCTGTGTGGGAACACAGCGCTACAGGACGAGGGGTTTGGCCCACTAACACCACTGAAGAGCACACTCGCTTATTAGGTACGGAGGAAGAATCTTCTGACCGTCACTCCTCGGAGGAGGAGGCTGATGATGCCGATGTTTGTTCTATGAGTGAAGCCCAAGCTAGACATTATGCTGCACAGTTTGCACAGCTGCGACCTGAAAGGGGTATGCTGTCTGGACAAACGGCCAGACTTTTTTTCGAAAAATCTCGCCTTTCGGTTTCCGATCTGAGGAAAATTTGGCAACTGTCTGACATAACTCAGGACGGGATGCTCAGTCTAGAAGAGTTCAGTATAGCGATGCATCTGATCGTTCTACGGAGAAACAATATACCAGTACCAGACGTGTTGCCGGCCTGCCTCGTGCCGACCGTCGACTCGCATTTCACACAACGCGCTGTCACTACGGACTTAGTGGATCTCGGCTCGGACATGTTCAATTCCGGGACCTCGGCTGACTTCAACTTCGCACCCAAACCGGAAATCAACGACCCGTACGAAACCAAACCGGCCAAAAAACCGGAAGACCGCCAGCCGTCATCTAGTCCACCGAAACCGGAACATCAGATCAACAACAAGGAGTGGACCAAATTTGTCGATTCTCCGACCTCTAGCGTATCCAGCCCCGGGCCGAAGCCCGTCAATTTTGACTTTCACAAGTCTGCTCTGGAGCGGGACCCGAAGATCTTCCATCCTGTAGCGTTGAGAGTAACTCCCGAGTCGACCGCCCTCCCGCACGACACTGACACGCGCTCCAGCACGTCCCCACGACGAGACGACTTCGTGCACGCCTTCGAACTCGCCAGCCCCAAACGATTCAACTCACAACCACCAGAACAACCCAACGGCAATTCCGAGATCAAGTCCATACAACGTCCACAGCCCAAAAAGCCGATGAAGAGCGCGGGCGTACTGCCCCCGCCGCCCGCCCGCGACTCCCTCCCGCACACCGAGGATCCGGGAACATCAGCCGGGCCGCAGTCTCTTCAATACGCACCACGAAAGGAACCCCCGCCCCCGCCCCCGCCGCGGCCCCTCAGGAACCACGCTCGCTCCAGTTCCTTGGACCTGAACCGCCTGAAGGGCCAGCCGTGCCAGTCCGCGGCTCCACCCCCTCAGCCTCCGCCGCGAATGTCTCCCTCCGCGGTCCGTGACTACGAGCCTGATGGTTTCGGTTACAATCGCGACGACGCTCCCAAAATGCACGGTGCCTTCGAAGTTTACCGCAAGCCGTCGAGGGAGCCGTCCCGCGAGGGCGACGACCCCGACGTCCGCGCGCTCCACGAACAGAACTCGGTGCTCCACCGAGTGTGCCGCGCGCTCGCACACGAGCTGGCCGACGCCCAGCGTGAGAAGGAGGCGCTGCGAGTGCGTCTGCAACCGCCTCAGACGACGCCGCAGACCTAG

Protein sequence:

>DPOGS215461-PA
MEDLNLTETEMRYFGDLFLCCDEESNGKIPILKATELFRSSNVSNDVLRQIMDISVAPNTCTSLNHMNRKQFYSALKLIAAHQTNMSLRPELLSTPLDLPLPRFTWALNCDANADLIQLSNSPKEQHISKRDRNFGSIAAYEPIRVPSNLSDSDAPQTMSHDNTEALSTDSEMESETLSQRSGSRGRRVKAGSPWSTASESPTPTNSVAERVHPVWEHSATGRGVWPTNTTEEHTRLLGTEEESSDRHSSEEEADDADVCSMSEAQARHYAAQFAQLRPERGMLSGQTARLFFEKSRLSVSDLRKIWQLSDITQDGMLSLEEFSIAMHLIVLRRNNIPVPDVLPACLVPTVDSHFTQRAVTTDLVDLGSDMFNSGTSADFNFAPKPEINDPYETKPAKKPEDRQPSSSPPKPEHQINNKEWTKFVDSPTSSVSSPGPKPVNFDFHKSALERDPKIFHPVALRVTPESTALPHDTDTRSSTSPRRDDFVHAFELASPKRFNSQPPEQPNGNSEIKSIQRPQPKKPMKSAGVLPPPPARDSLPHTEDPGTSAGPQSLQYAPRKEPPPPPPPRPLRNHARSSSLDLNRLKGQPCQSAAPPPQPPPRMSPSAVRDYEPDGFGYNRDDAPKMHGAFEVYRKPSREPSREGDDPDVRALHEQNSVLHRVCRALAHELADAQREKEALRVRLQPPQTTPQT-