Monarch geneset OGS2.0

DPOGS207663
TranscriptDPOGS207663-TA2178 bp
ProteinDPOGS207663-PA725 aa
Genomic positionDPSCF300133 + 122868-125045
RNAseq coverage293x (Rank: top 38%)
Annotation
HeliconiusHMEL0026740.092.97% 
BombyxBGIBMGA010524-TA0.083.83% 
DrosophilaCed-12-PA0.053.56% 
EBI UniRef50UniRef50_Q7PVN10.059.67%AGAP009236-PA n=26 Tax=Coelomata RepID=Q7PVN1_ANOGA
NCBI RefSeqXP_395913.20.068.00%PREDICTED: similar to Ced-12 CG5336-PA [Apis mellifera]
NCBI nr blastpgi|3504030280.067.72%PREDICTED: engulfment and cell motility protein 1-like [Bombus impatiens]
NCBI nr blastxgi|3071723640.068.01%Engulfment and cell motility protein 1 [Camponotus floridanus]
Group
Gene OntologyGO:00058563.4e-43cytoskeleton
GO:00069093.4e-43phagocytosis
GO:00055153e-17protein binding
GO:00054881.3e-11binding
KEGG pathwayame:4124560.0 
 K12366 (ELMO1, CED12)maps-> Shigellosis
    Chemokine signaling pathway
    Bacterial invasion of epithelial cells
InterPro domain[302-481] IPR0068163.4e-43Engulfment/cell motility, ELMO
[548-676] IPR0119933e-17Pleckstrin homology-type
[88-247] IPR0160241.3e-11Armadillo-type fold
[101-250] IPR0119892.6e-07Armadillo-like helical
Orthology groupMCL10750 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207663-TA
ATGCCAGCGGTGACGAAAGATTCAACTATTTTGAAAATTGCTGTGGAGATGCTAAATGCACCAGACAAGGTCCCACAACTCATAGAATTCAATCAGAGTCACCCATTGTCCGGAATTATCCAATTATTGTGCAAGGCTTGGGGTCTTCCTGATCATACAAACTACGCCCTACAGTTTTCCGAGAGTAATAACCAAAATTATATTACAGAGAAAAATCGTAACGAAATTAAAAATGGTTCTGTTTTGCGACTCGAGCAATCTCCAGCGAAGACAGTTCAAGATATTTTGGCTAAGATCAATACAGGGACGGAATCAGACCAAATTACTGCCCTCACAAAATTATCCACTGTTAGTAGTGATCTCACATTTGCACTAGAGTTTATCAACAAGAATGGTTTAACGCTTATCGTAAATCACATTGAAACGGGTAAATTTAAGGGCAACTGTTTGAAATATGCACTAGTTACATTTGTGGAATTGATGGATCATGGTATAATATCCTGGGATATACTGGAGAATCAGTTTATTAATAAAGTTGTCAGTTTCGTGAGCAACCAATCCACTACACAGGACCCAAAAATAATTCAGTCATGTTTGTCAATACTCGAAAATATTGTCCTTAATAGCTCAGGAAAGAATTCGCATGTTGAAAAAGAAATAACAATACCTAGTTTGATATCACATTTGGAAAATCAGGACAGGATAATACAGCAAAATGCAATTGCTTTGATTAACGCAATTTTCTCAAAGGCAGATTTGACTAAAAGGAAGACGATAGCTGCTACCCTAAGTTCTAAACAAGTCAGGAATAAGATTTATGAGAATCTGATTACCAAAAATGTTTCCAAAGAAGCTCTCGGCACTGAACTCGCTCATCAACTGTATGTGTTGCAAACACTGATGCTGGGACTACTTGAACCTAAAATGCAAACAAGAGCAGACTCCCAAGAACAAGAATCACAGGAAAAGATAAAGGAACTCAGAAAATATGCCTTCGATAATGAAAACAACATCAGTGCTGAAGTAACAACTAGACGGCAAACAGGAAGCTTATCCAAAGATTTTAAGAAACTTGGTTTCAAATGTGAAATAGATCCAATAAAAGATTTTAATGAGACCCCACCCGGAATATTAGCATTGGACTGTATGCTGTACTTTGCTCGGAATTACAGAGAGGATTATACCAAAATAGTTCTGCGAAATAGTTGCAGGGCTGATGAACAGGAATGTCCCTTTGGTAAGACTAGTGTTGAACTTGTGAAGCTTCTCTGTGACATTTTGCAAATCGGTGAGCCGCCCAGTGAACAAGGACAGACATATCATTCCCTCTTTTTCACCCATGATCATCCATTTGAAGAACTCTTCTGTATTTGCATTGTACTGCTAAATAAAACATGGAAGGAAATGAGAGCTACGACAGAGGATTTTGTTAAAGTTTTAAGTGTAGTAAGAGAGCAGATAAGCAGAGCATTAACGGCGTCTCCGAAGGGTTTTGATAAATTTCGTCAAAAAATTAAAGAATTGACATACAGTGAGATAACTCACTTGTGGCAACAGGAACGCACAAACAGAGAAGTATGGGAGTCACACGCTAGGCCGATCGTTGAATTAAAAGAAAAAATCACCCCCGAAATTATAGATCTCATTCAACAGCAAAGATTAGGAGTTCTGGTCGGTGGAACGAGATTTAAGAAGTACATGAAGATAAATAGGAAGGACAAGTTTTGGTTCGTCCGTTTATCGCCTAATCATAAAATATTACATTACGGCGAGTGTGACGAGAAAAGCACACCGAGTTTAGAGGAACTTGGCACGAAGTTAGCCGTGGCTGATATAAAATGTGTGGTTGTCGGCAAAGAATGTCCTCATATGAAAGATTTAAAAGGTAAGATAAGCAGTCCAAATTTGGCTTTCTCGTTAATACTGAAAGCTGCGGAAGTACCATCTCTAGATTTCGTCGCACCGGATGAGCAGATTTTTGATTACTGGACGGATGGTATCAATGCATTACTCAAAGAGAAAATGACTAGCAAATCCTTTGAAAATGATTTGGAGACGCTTCTCTCTATGGACATCAAGGTCAGATTGTTGGATGCAGAGGGTATAGACATACCCCAGGATCCACCTCAAATACCCCCAGAGCCAGAAGACTATGATTTTTATTATGAAAACAATTAA

Protein sequence:

>DPOGS207663-PA
MPAVTKDSTILKIAVEMLNAPDKVPQLIEFNQSHPLSGIIQLLCKAWGLPDHTNYALQFSESNNQNYITEKNRNEIKNGSVLRLEQSPAKTVQDILAKINTGTESDQITALTKLSTVSSDLTFALEFINKNGLTLIVNHIETGKFKGNCLKYALVTFVELMDHGIISWDILENQFINKVVSFVSNQSTTQDPKIIQSCLSILENIVLNSSGKNSHVEKEITIPSLISHLENQDRIIQQNAIALINAIFSKADLTKRKTIAATLSSKQVRNKIYENLITKNVSKEALGTELAHQLYVLQTLMLGLLEPKMQTRADSQEQESQEKIKELRKYAFDNENNISAEVTTRRQTGSLSKDFKKLGFKCEIDPIKDFNETPPGILALDCMLYFARNYREDYTKIVLRNSCRADEQECPFGKTSVELVKLLCDILQIGEPPSEQGQTYHSLFFTHDHPFEELFCICIVLLNKTWKEMRATTEDFVKVLSVVREQISRALTASPKGFDKFRQKIKELTYSEITHLWQQERTNREVWESHARPIVELKEKITPEIIDLIQQQRLGVLVGGTRFKKYMKINRKDKFWFVRLSPNHKILHYGECDEKSTPSLEELGTKLAVADIKCVVVGKECPHMKDLKGKISSPNLAFSLILKAAEVPSLDFVAPDEQIFDYWTDGINALLKEKMTSKSFENDLETLLSMDIKVRLLDAEGIDIPQDPPQIPPEPEDYDFYYENN-