Monarch geneset OGS2.0

DPOGS202978
TranscriptDPOGS202978-TA2103 bp
ProteinDPOGS202978-PA700 aa
Genomic positionDPSCF300068 - 648658-657152
RNAseq coverage369x (Rank: top 32%)
Annotation
HeliconiusHMEL0095410.075.30% 
BombyxBGIBMGA012322-TA0.081.04% 
DrosophilaCG42388-PG4e-14642.05% 
EBI UniRef50UniRef50_Q7QE343e-15043.60%AGAP010676-PA n=4 Tax=Culicidae RepID=Q7QE34_ANOGA
NCBI RefSeqXP_001650930.12e-15243.95%hypothetical protein AaeL_AAEL005476 [Aedes aegypti]
NCBI nr blastpgi|1571100453e-15143.95%hypothetical protein AaeL_AAEL005476 [Aedes aegypti]
NCBI nr blastxgi|1571100453e-15242.84%hypothetical protein AaeL_AAEL005476 [Aedes aegypti]
Group
Gene OntologyGO:00055152.1e-20protein binding
KEGG pathwayrno:3162587e-10 
 K13738 (CD2AP)maps-> Bacterial invasion of epithelial cells
InterPro domain[642-697] IPR0014522.1e-20Src homology-3 domain
[74-159] IPR0010601.9e-14Fps/Fes/Fer/CIP4 homology
Orthology groupMCL12553 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202978-TA
ATGAAACGATCGCCCGACCTCTATACTACGAGCTCCAGTGGAAGCGGCTCGAGCGGAGGCGCTGTCACCAAGAGTGTCGGGTGTGATGTACCCGGGGAGGGCGAGGGCGGCGTGAGTCGCGACGTGTCCACTAAGATAGCGGCCGGTATGGATCGCTGGCAGACTACCCTGGAGGATATGATGCATCTGAGATCCTCCAGCAACCACGTGTTCAAAGACAACTACTGGGCTCAGAATGGTTTCGACGAGCTGCGGCGTTACGTGAAGCAGGGCGGCGACTTCAGCAAGGAACTCGCCAACATCTTGCAGGAACGAGTCGAAGCAGAAAACTACTATTCGAAATGCTTGGCGAAATTAGGCTCGAAGCTCAGTAAAGCTTGCAAAGAGAGTGTAGGGTCTTGTGCGGAAGCGTGGAAACACGTCGCTCATGATATGGAGAAGAGAGCTGAGATACACAGGAGTTACTCCAGTGCGTTGACGGAAGAACTTGTGAAACCGATGAAACAAGTCATAGACAACCAACTCAAGTTAAGGAAAAAGATCGAGGGCAACGTGGACAAGACGACGCGAGCGTTGGCAGATTGGAGAACAGCGGAGGCCAAGTCAAAGAGACAATCACACGCGGCCGCCAGGGAAAATGAGAAGCTTCAAGACGCCTCACTAGACATTAGTCGTCTGTCACGCAGCTCAAGTATGGGACACATCCCTCACTCCATCCTGAGCGCCGCGCGAGCTGAGCGGCTCGCGCCCGCCGCCAGCGAGCGCGACGCCGCCAAGCTGCAAGTGAAGAAGAGGAAGACCGAGGACGCCGTCAAGAAGACAGAGGTCGACTATTACAACGTTTGTGTTCAAGCGGAAAGATCGAGGTTGGAATGGGAGACGAGTGTGGTGAAGGGCGCGGGGATGCTGGAGTCCCTCGAGGAGGAGAGACTTGAACAGCTGAAGAGCGCCGCCGACTGTTACCTCAGGCTGACGGCAGCAGTCCCGCCGCAGCTGGCGGAGGCGACCAATGCTCTCGTGGCTCCTATTAAGAAAGCCAACTCTAACGTGGACATGCGCGTGGTGCGAGGAGTGCGCGGCGCTCCGGCCGGCGCCAGTGACCAACTGCTACCAGACTTCTACTGCGAGCACACCACGCTAGCCATGAACAAGGAGAGACGTAAACAAGCCCTATTGAAGATTCTTCAGTTGGTAAAGCAAGATATAGACAGAGAGCGGAAGTCTAAACAAGGTCTCGAAAAGCTTTCGATGGCTATAAAACAGACACCCACTTTCGGTAGTGATGATTCTCAACAAAACGTAGCTGACAAACTGTACCACATGAGGTCCATGTTAACATATCTGGAGGCGGTGCGATACAAGATCACAACCAGCCTCACCGAGCTGGACAACCGACCCGCCGGCCAACATCCGCTCGCCACACACATACAGATCGTACGAGACAAGCAAGGCTTGCAGCAGAGCATACTCAAGGTTCCGCCCTGGCTGCAGGCGGACTATCAGAGCGAGGTGAACAAGAACCTTCAACCGAACTACACGGCCTTGTCGAAGAGTGTGACCGGCGCCGTCTGTGGAGCGGACGAACGAGAGAGACGCGACTCCGTACTCAGCCGCGCCTCCAGCAAACTGCGCATTGAGCATCTGTCTTTCCGGAAGGCCGTGGAGAGGAGTAGCGCTCCCGCCACGCCCACGGTGACCACGGCGCCCACCACACCCACAGTGACCACGGCGCCCACCACGCCCGTGTCCGCGGAGTCCCCCGCTCTCGACTGGACGTCCAGCGAGCGCGGCGCTGGAGACGGACTCTCCAACCAGCAGGACAGTGACTTCGACGAATTTTCATCTCAAAGTGACAGTTCAGCTGATCTATCAGACGTGAGGAAACATAACAACAACACCAGCGAGATGAGTCACAAATATATCGGCAAATGTAGAGCGCTGTACACGTACGAGGCCAGACTGGACGACGAACTGACGCTCACACCAGGAGACGTGATCGATATCTATGAGAAGCAGGACGAGGTGTGGTGGAGCGGAGATCTCAACGGCTGCTTCGGCATCTTCCCCTCCTCCTACGTCGAGGAGATCACGTCCTGCATCTAG

Protein sequence:

>DPOGS202978-PA
MKRSPDLYTTSSSGSGSSGGAVTKSVGCDVPGEGEGGVSRDVSTKIAAGMDRWQTTLEDMMHLRSSSNHVFKDNYWAQNGFDELRRYVKQGGDFSKELANILQERVEAENYYSKCLAKLGSKLSKACKESVGSCAEAWKHVAHDMEKRAEIHRSYSSALTEELVKPMKQVIDNQLKLRKKIEGNVDKTTRALADWRTAEAKSKRQSHAAARENEKLQDASLDISRLSRSSSMGHIPHSILSAARAERLAPAASERDAAKLQVKKRKTEDAVKKTEVDYYNVCVQAERSRLEWETSVVKGAGMLESLEEERLEQLKSAADCYLRLTAAVPPQLAEATNALVAPIKKANSNVDMRVVRGVRGAPAGASDQLLPDFYCEHTTLAMNKERRKQALLKILQLVKQDIDRERKSKQGLEKLSMAIKQTPTFGSDDSQQNVADKLYHMRSMLTYLEAVRYKITTSLTELDNRPAGQHPLATHIQIVRDKQGLQQSILKVPPWLQADYQSEVNKNLQPNYTALSKSVTGAVCGADERERRDSVLSRASSKLRIEHLSFRKAVERSSAPATPTVTTAPTTPTVTTAPTTPVSAESPALDWTSSERGAGDGLSNQQDSDFDEFSSQSDSSADLSDVRKHNNNTSEMSHKYIGKCRALYTYEARLDDELTLTPGDVIDIYEKQDEVWWSGDLNGCFGIFPSSYVEEITSCI-