Monarch geneset OGS2.0

DPOGS214861
TranscriptDPOGS214861-TA1530 bp
ProteinDPOGS214861-PA509 aa
Genomic positionDPSCF300091 + 74825-79131
RNAseq coverage758x (Rank: top 17%)
Annotation
HeliconiusHMEL0150132e-11285.22% 
BombyxBGIBMGA010072-TA5e-11651.18% 
DrosophilaCortactin-PB5e-7647.55% 
EBI UniRef50UniRef50_Q142473e-12246.38%Src substrate cortactin n=85 Tax=Metazoa RepID=SRC8_HUMAN
NCBI RefSeqXP_001642095.12e-9950.77%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|2240504884e-12749.91%PREDICTED: cortactin [Taeniopygia guttata]
NCBI nr blastxgi|1942185835e-13550.95%PREDICTED: src substrate cortactin isoform 1 [Equus caballus]
Group
Gene OntologyGO:00055153.6e-20protein binding
KEGG pathwaytgu:1002229816e-128 
 K06106 (CTTN, EMS1)maps-> Shigellosis
    Pathogenic Escherichia coli infection
    Bacterial invasion of epithelial cells
    Tight junction
InterPro domain[2-508] IPR0155032.1e-176Cortactin
[76-112] IPR0031342.9e-21Hs1/Cortactin
[453-508] IPR0014523.6e-20Src homology-3 domain
Orthology groupMCL17318 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214861-TA
ATGTGGAAAGCGGCCACTGATGTAGTGGCGCCCACACCGGCCGAGGCTGACGATTGGGAGACAGATCCCGACTTTGTGAATGATGTCACAGAACAGGAACAACGTTGGGGGCCAGGGGGAAGACATGTAGAAGCTATTGATATGGCTAAACTCAGAGAGGAAGTTCTGGAAGCAGACAAGCAAATTAAACAGAAGCAGTACGAGGAAGGGCCTAAACCCTCATATGGATATGGAGGGAAATTTGGTGTCCAACAAGACAGGATGGATAAATCAGCGGTCGGGCACGATTACGTCGGCAAAACAGAGAAGCATGTCTCGCAGAAAGATTACGCACAAGGTTTCGGCGGTAAGTTTGGCGTTCAAACTGACCGTATGGACGCCAGCGCGGTGGGTCACGACTATGTGGGCGTCGTGTCCAAGCACGCCTCGCAGACCGATCATAGTAGGGGCTTCGGGGGGAAGTACGGCGTGCAGACTGACAGAGTTGACAAGAGCGCGGCTGGTTGGGAACACAAGGAGCAGATAGAGAAGCATCCGTCGCAGAAAGACTACTCGGTCGGCTTCGGAGGCAAGTTCGGTGTACAGGTCGACCGGCAGGACGCCAGCGCCGCCGACTGGGGACACAAGGAACCCACTGCGGCACACGAGTCGCAGACTGATCACTCCCGCGGTTTCGGTGGTAAGTTCGGGGTGCAGACGGACAGACAGGACGCGTCCGCCGTCGGCTGGGATCACCAGGAGAAGACGGAGGCTCACGCTAGCCAAGTGGACCATAAGAAGGGCTTCGGTGGTAAATTCGGTGTCCAAACTGACAGAGTGGATAAATGCGCCCAAGGTTTCGACTCCGTGGAGAAGTCGGGCGGGTACAGTAGACCCAGGCCAGACATCGGCGGAGCCAAGCCCAGCTCCATACGAGCCAAGTTTGAGAACATGGCCAAGGAAAAAGAACAGATCCTTCGAGATCAATCCGTTCAGAAATTAAGACAGGAGAGGCAACAACTAGATCGTAGTTTGTCAGAAAAAGAAAAACAACGTCTGGAGAAAGAAAAGGAGCAAAATCAAGAAGAGACGGCCAGCACGAACGTGTTCAAGAAGACTGAAGGTGGTAACGCAGTGCCCGCGGCTGTGCAGGCTGTGCAGGACGCGAGACAAGAGGTGGAGCAGGACGTTAGACAGGACTCTGTGCACGAGAAACAGGAAGTGAAGCAGAGCAACCTGCCGGATGTGACTCTTGTGGGAGACGCCAAGGACGAAGACAAGGAAGAGCATCCGCGGCAGCCCACGATAGTGGTGTCTCCTGTGGGCTGGGAGGGGGAGGGCGAGGGCGAGGCGTGCGAGGCTGACGACGAGGACGGGTACACGGCCCGCGCGCTGTACGACTACCAGGCCGCGGCGCCCGACGAAATATCATTCGACCCCGACGACCTCATCACCAACATCGTCATGATCGACGAGGGCTGGTGGCAGGGTCTGTGTAAGGGCGCATACGGCCTGTTCCCGGCTAACTACGTACAGCTACAAGACAAATAA

Protein sequence:

>DPOGS214861-PA
MWKAATDVVAPTPAEADDWETDPDFVNDVTEQEQRWGPGGRHVEAIDMAKLREEVLEADKQIKQKQYEEGPKPSYGYGGKFGVQQDRMDKSAVGHDYVGKTEKHVSQKDYAQGFGGKFGVQTDRMDASAVGHDYVGVVSKHASQTDHSRGFGGKYGVQTDRVDKSAAGWEHKEQIEKHPSQKDYSVGFGGKFGVQVDRQDASAADWGHKEPTAAHESQTDHSRGFGGKFGVQTDRQDASAVGWDHQEKTEAHASQVDHKKGFGGKFGVQTDRVDKCAQGFDSVEKSGGYSRPRPDIGGAKPSSIRAKFENMAKEKEQILRDQSVQKLRQERQQLDRSLSEKEKQRLEKEKEQNQEETASTNVFKKTEGGNAVPAAVQAVQDARQEVEQDVRQDSVHEKQEVKQSNLPDVTLVGDAKDEDKEEHPRQPTIVVSPVGWEGEGEGEACEADDEDGYTARALYDYQAAAPDEISFDPDDLITNIVMIDEGWWQGLCKGAYGLFPANYVQLQDK-