Monarch geneset OGS2.0

DPOGS209265
TranscriptDPOGS209265-TA3603 bp
ProteinDPOGS209265-PA1200 aa
Genomic positionDPSCF300111 + 571419-577592
RNAseq coverage423x (Rank: top 29%)
Annotation
HeliconiusHMEL0078950.086.87% 
BombyxBGIBMGA007045-TA0.079.22% 
DrosophilaCG32082-PC2e-13441.97% 
EBI UniRef50UniRef50_D6WHG71e-17044.29%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WHG7_TRICA
NCBI RefSeqXP_975354.20.046.60%PREDICTED: similar to AGAP001369-PA [Tribolium castaneum]
NCBI nr blastpgi|1892353350.046.60%PREDICTED: similar to AGAP001369-PA [Tribolium castaneum]
NCBI nr blastxgi|1892353350.041.15%PREDICTED: similar to AGAP001369-PA [Tribolium castaneum]
Group
Gene OntologyGO:00468471.7e-74filopodium assembly
GO:00071651.7e-74signal transduction
GO:00171241.7e-74SH3 domain binding
GO:00080931.7e-74cytoskeletal adaptor activity
GO:00055151.4e-15protein binding
KEGG pathwaymmu:1081002e-39 
 K05627 (BAIAP2, IRSP53)maps-> Regulation of actin cytoskeleton
    Adherens junction
InterPro domain[25-257] IPR0136061.7e-74IRSp53/MIM homology domain (IMD)
[329-470] IPR0014521.4e-15Src homology-3 domain
Orthology groupMCL16219 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209265-TA
ATGCTGCTAGCAGTCGACAGTTTGCCACACCTCAGTAACGAGGTTGCAGCATCCTTGAAAGGCCACATTCCTAAACGTCTATTTACGAATATCTTGGACAAGTTCAATCCTGGAGCTCGTCAGATGATAACAGCTGGTAAAGCTTACTTAAAGGCTTTACACGGTGCCGCTGCTGCGTCTCGTATGTACGTTGATGCTGTGGGTAAACTCGGAAGACAGGCTCAGCAGGGAACTTGGGGAGGATGCGCCGATATCGGTACAGCTCTTATGAAGGTTGTGGAAGTCTATCGAGAAATACAAGATCAACAAATGAACATTTTAAAAGCCTTTTACGTAGACTTGCTGGTCCCTCTAGAGACTAATCTGGAAAAAGATACGAAAGTCGTACAGTCTGAGCAAAAAAGATTTTTGCAACAACATAAACTACGCTCTGAGAGTTACAGCAAGGCCGCTGCAACCATCAAAAAACAAAGAAAGAAAAAAACGAATGTTACCAAAGTTGGTACAGCCATGGACAAAGAAATGAAGAGTATGCAAATATTGGAAGAAGAAAAAACAAAACTAGACGCTTTTTGCGAACAAAGTTTAAAAAATGCAATGACTCAAGAACGCAGGCGATACGGGTTTGTATTAGAGCGCCAGTGTTCTCTCGCTAAACATTGGCTTGCATACCATACAGCCGGTGCAACAGCTTATAACACTGGCCTAGATGAATGGCTAGAAGTATCGAGGACTAGAGAATTCCTGCCGTCCAATGTTGAAGCCATGTTTGTGAGCAGAATGAGGCAAGTATCTTTCTGGGCTGATGAAGATGTATACGCAAGTCCACGTAATGGCGATGACGATGATGGTGCTTCCGTTGGTTCCGCCTTACGCAAAACAAGATCTGTAGATGCTTCTTGTCTAGACGTTAGATCTATTGCAGATCTGGGATCACCCACCCATGGAATATCTAGAGCCAAATCTGATTTAAATCTACAAGCAAGTCTGCATACTATGGAACAAGATATAGAAACACGAAATAAAACACGTCCCTCTTCTCTTGCGCCACCAACATGCACATCACGAGATCCTCCTTTAGCCCGTGCCCTTTATGCTTATTCTGCTGCTGGTGACAACCAACTAAGTTTTCAACAAGGAGATATATTAGCTTTGCTAGGAGATCGAACTAAAGGTTGGCAATATGGTGAAAATTTGCGAACCCATGGCGCGGGCTGGTTTCCATTAGCTTATACAGAACCCATGATTAATGAGGAAACTGGGGGGAGTTCTCCTGCTCGTTGGAGCAGTGCACAGACACCCAGTGAGACGCCACCACCTGCTCCCATATCGCACACCCAAATGTCATCAGCAACTCCTACCCCTCATTCACATAATCATCCACCAGCACGCATGTTTGGAGACACTGTTACGGCTCATCATGTAGCAAAGGGTCGACTGGCAAACAGCTCTCCTGGTTTACCACCGACGCTGCCGGCACCTTCACCAAGTGCTCTAAGCACTTCAGCAAGTGCTACATTCCATCCAACCCCGCTAGCAGCAGTAGCATCAAGCGCTATAAAAAACTCTACTTCGGCGGCTAGTTTTACTACTCATGCGGTTATAGAAACTAGAAGTTTTGGGCCACAAACGTTACCAATGCGTTCTCAGATTGCCCAAAAAGTAGCTCCGGCTAAAACTGTTGTGTCGGCAGTAGGTGCAGTGGGTAATGTATCCCTGCATAGTTCCAACGATTCCGGGTTTTCCAATGAACCACCCCCACAACCTGATGTTGACTACAGCGACGAAGAGGCACAACGACTTCGCGTTCCTATCAGTAAATCCGCACAGCTTAACAATGAACATAAAGCTTATCTGTTAAAAACACAAAGTTTAAAGAGAGGTCCAGGTCCTAAGACCACAAATAATGGATATTTGACTGACGATCAACAATTACTACTGCGTAATATGTTGGATATGGATAAAGATTCTCAAAAAATTAAAAGAACAAAATCCTTTTGGAAATTTGGGCGCACCACTTCCGAAGAGATAATGGAGGGTATGTGTCTTTGGCAGCATCGTGATATAGTAGATACTGTGCCCGAATATAAAAAGAAACTATTTGTAAAATTAAAAAAGGAGTTGCGACCAGCACCTGAAATACCTGAAACAAATAAAAGTAAACATAATATTCAGCCAGACAAACCCTCATCTTTAAAAGATGGTAGCTCTTCCAGACCTCAAGAAACAATAGATAGAAAAGATATAAAAGTTAACAATGGTATGCGACAGGCAAGAAGTTTAGGTTCTATTCAACATGAAGAACATACTGAAAGAGAAAAAAAGCAAAATCAAATGATAAACAAGAAAAGTTTCTCAGAAAAGTCTATTGTTGAAGAAAAACAAGATGACTTCGTACATCGTAACGAATCCCGGCACTCTGATTTTACAAACACAAGTACTATAAAAACCAACTTTGAAAATAGTTTTTACAATGATGAAAATGGAGATGGCTATGTTATGAAAACTGTGAAACGTAGGGAAATACTTCAAAGATATGATAACGAGAGCAATTCGGATAATAATTCTGTAGCATCTAGTACAGATCCGTACGATTGTATCATTGTTGATGATCATATGACTTCTAAGAAACAACAAGAACTGGATAGACGTCGTCAAGCACAAATTATGGAAAACGATATTAAAACGAATGAAGAGAATGCTTATATGCCCGAGTTTGAAGAAATCGAGGTAACACCTATAAAGCCCCATGTTCGTACAACTTTACTAAATGACAAAGCTAAAAAACCATCACGTGAGAGATCTCCACCGAAAGTAATGGAATTCAAAACGTTTAAGGACTCGTCAAAAGAAATTAAAACCTTTTCTGCTTCTTCTGAAACTTTAAAATATAAGGATAACGAACAACCAACAGATCGTTATGGCAATCCATTGGAGCGAAATAACAATGAGAGTCGAAAAGAAGAGATATATAACGACGATAATGGTACTTTGAAATATAAAAAACGATCTAATGAAGAAAAATCCAGAAAACATAACTTTGAAAATGGTAATACTAGAGATTATCCCCAAACAGAGACGAGACATCAAAAGAAAGACGCTCGTTTGTTTGAAAATACAGAACCAAGGATTGATTATTCTTTGGATCGACGTCAGTCAAAAAAAGAATTTATTGAGTCAAAAAATCGTGATAATAAAGCAAGAAATAGAGTAAACAGAAGTTATGATGATAGTTCATTACAGTATAATGGTAGAAAAGAAGAAAGATATTATGAGTCAAATATTTCGTACTTCAGTGACCAAGAAAGAAGATCTTCTCGATATGATTATGAAACTGATTCTAAAAAAGCAGAAAAGTCAAAGATGCGCCAAGAGGAAGCGAGATTGGAAGCGAGGCGTATAAATGAACGTAGGACTGATGTACCTTACAGCGAATCGGATGATCAGAAATTTAACGAAGCTTTAAAACAACAGCGACCTCAACTTTTGCCAAGAACGAAGCTTCTTAAACGAACCGCAGATGGTAACGAGTTGCCAGAAGACGGTCATACATTTGGACCATGGTACGATTTATGGGGGCGAGAAACTGCTATGTACAAATAA

Protein sequence:

>DPOGS209265-PA
MLLAVDSLPHLSNEVAASLKGHIPKRLFTNILDKFNPGARQMITAGKAYLKALHGAAAASRMYVDAVGKLGRQAQQGTWGGCADIGTALMKVVEVYREIQDQQMNILKAFYVDLLVPLETNLEKDTKVVQSEQKRFLQQHKLRSESYSKAAATIKKQRKKKTNVTKVGTAMDKEMKSMQILEEEKTKLDAFCEQSLKNAMTQERRRYGFVLERQCSLAKHWLAYHTAGATAYNTGLDEWLEVSRTREFLPSNVEAMFVSRMRQVSFWADEDVYASPRNGDDDDGASVGSALRKTRSVDASCLDVRSIADLGSPTHGISRAKSDLNLQASLHTMEQDIETRNKTRPSSLAPPTCTSRDPPLARALYAYSAAGDNQLSFQQGDILALLGDRTKGWQYGENLRTHGAGWFPLAYTEPMINEETGGSSPARWSSAQTPSETPPPAPISHTQMSSATPTPHSHNHPPARMFGDTVTAHHVAKGRLANSSPGLPPTLPAPSPSALSTSASATFHPTPLAAVASSAIKNSTSAASFTTHAVIETRSFGPQTLPMRSQIAQKVAPAKTVVSAVGAVGNVSLHSSNDSGFSNEPPPQPDVDYSDEEAQRLRVPISKSAQLNNEHKAYLLKTQSLKRGPGPKTTNNGYLTDDQQLLLRNMLDMDKDSQKIKRTKSFWKFGRTTSEEIMEGMCLWQHRDIVDTVPEYKKKLFVKLKKELRPAPEIPETNKSKHNIQPDKPSSLKDGSSSRPQETIDRKDIKVNNGMRQARSLGSIQHEEHTEREKKQNQMINKKSFSEKSIVEEKQDDFVHRNESRHSDFTNTSTIKTNFENSFYNDENGDGYVMKTVKRREILQRYDNESNSDNNSVASSTDPYDCIIVDDHMTSKKQQELDRRRQAQIMENDIKTNEENAYMPEFEEIEVTPIKPHVRTTLLNDKAKKPSRERSPPKVMEFKTFKDSSKEIKTFSASSETLKYKDNEQPTDRYGNPLERNNNESRKEEIYNDDNGTLKYKKRSNEEKSRKHNFENGNTRDYPQTETRHQKKDARLFENTEPRIDYSLDRRQSKKEFIESKNRDNKARNRVNRSYDDSSLQYNGRKEERYYESNISYFSDQERRSSRYDYETDSKKAEKSKMRQEEARLEARRINERRTDVPYSESDDQKFNEALKQQRPQLLPRTKLLKRTADGNELPEDGHTFGPWYDLWGRETAMYK-