Monarch geneset OGS2.0

DPOGS203993
TranscriptDPOGS203993-TA3360 bp
ProteinDPOGS203993-PA1119 aa
Genomic positionDPSCF300005 + 1353308-1373333
RNAseq coverage235x (Rank: top 43%)
Annotation
HeliconiusHMEL0138690.066.54% 
BombyxBGIBMGA002138-TA0.082.14% 
Drosophilafw-PA0.053.55% 
EBI UniRef50UniRef50_D6WIC10.062.08%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WIC1_TRICA
NCBI RefSeqXP_002431650.10.062.99%furrowed, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3503968360.064.60%PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Bombus impatiens]
NCBI nr blastxgi|3503968360.064.70%PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Bombus impatiens]
Group
Gene OntologyGO:00054882.8e-27binding
GO:00071551.3e-05cell adhesion
KEGG pathwayhsa:64033e-57 
 K06496 (SELP)maps-> Malaria
    Cell adhesion molecules (CAMs)
InterPro domain[87-234] IPR0065851.2e-68Fucolectin tachylectin-4 pentraxin-1
[80-229] IPR0089797.7e-34Galactose-binding domain-like
[236-378] IPR0161872.8e-27C-type lectin fold
[249-377] IPR0161866.2e-24C-type lectin-like
[671-738] IPR0160604.5e-19Complement control module
[236-375] IPR0013046.2e-19C-type lectin
[935-990] IPR0004361.2e-14Sushi/SCR/CCP
Orthology groupMCL10594 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203993-TA
ATGTATTATTATCTATATCTGTACATTATATGTATTGAGTTTGTTTTGTTATGTTTCATTGGGTTTTTTGTTTTAGGCGACAGACGGTGTGGCCATCCAGCTGTGCCGCCGAATGCAAAGGTTTCTTTGGCTTCAGACACTGACATAGTACCGGGAACAGTGGCCACCTATGAATGCGATGACGGCTACGAGCTTTTCGGTGCACATCAAAGAGAATGTACATTAAGGGGTGACTGGACTTCCGAACCGCCATTTTGTGGAACCAACGTTGCCTTCAGAAAACCAGCAAATCAATCAACCACTGTACGTGGTGGAAGCGCTAGCAACGGCAATGATGGTGAAAAGACCACCGAACATGACGGCAAAAGATGCACAGAAACACAAAGAGAGGCTTCACCCTGGTGGCAGGTCGACCTGCTACGTCACTACGCCGTCAAAGTGGTTAGAGTCACTACTAGGGGCTGTTGCGGTCACCAACCGCTTCAAGATCTGGAGATCAGAGTAGGCAACAGCAGCAGTGATTTACAAAGAAACCCACTATGTGCTTGGTTTCCTGGCACCATTGACGAAGGTATAACGAAGACGTTCACTTGCGCCCGCCCCCTCATAGGTCAGCACGTTTTCCTCCAGCTGGTTGGAGTAGAGAGCTCCCTTTCTTTATGTGAAGTAGAAGTATTCACCACAGAAGAGTTCTCAAACGATCGATGCGCCCCGATTGGAGCCTCAGCAGATATTGAGTTGGCCGCTTTTTCTCGAAACTGTTACGAATTTAATGGAGCTAAGGGAGCATCTTTTGAGGAAGCGAGAAAACAATGTCAGGAACACGGAGGAGACTTAATACATGGGTTTCAGGGCGCTACGTCAAGTTACCTTATTCAGGAGTTAGAAAGGCGCAGGCCAAATTTAAAGACTCTGTTGGTCTGGATCGGAGCTCAAAAGGAACCTGGTCTTACTTCCAGAACTTGGAAATGGGTTGACGGAGAAACTGTTACAAAGCCAACGTGGGGAAAGGACCAGCCGAATAATTATAACGGCGAACAAAACTGCGTGGTGCTTGATGGTGGGCGCTCCTGGCTGTGGAACGACGTCGGCTGCAACCTGGACTACCTACACTGGATCTGCCAGTACCTGCCCCCTACTTGTGGAAGTCCAGATAAGCTGTTGAACACGACTATTGAAGATAATGACTACCATGTAGGGTCTTCGATAAGGTATAAATGTCCACAAGGTCACATGCTGATTGGAGATAAAACTAGAGAATGCAAAAAAGATGGATTCTGGTCTGGAGCAGCACCAAGTTGTAAATACATTAACTGCGGTGGATTGACTCCAATTCAAGATGGTAGCGTCGATTTAGTGGATGGGCACACCACTTACGGAGCGAAAGCTATTTATTCGTGTAAAGAGAATTACACGCTAGTGGGTAACGCTGAACGAATGTGTAAGGATCAAGGAATTTGGGACGGAGAGGCACCCAAGTGTCTGTTTGACTGGTGTCCGGAGCCACCACCAGTCTCAGGAGCTACGGTCACCACTAGTGGTCACAAAGCAGGATCGTTGGCCACCTACACTTGCCAGAATGGATTTATACTTTTTGGTTCACCAAGTATCACTTGTAATCTCGGTGGAACATGGGGCGGCACCCCACCTTCATGTAAATATGTAGACTGCGGCACTCCAGCACAAGTCCATAAAGGGTCCTTCAGACTATTGAACGGCACGACTACGTACGGCTCGATAGCCCAATTCACCTGCGAACCCGATTACTGGTTGGCGGGAGCTGAAGTACTCACATGCTATCGTGATGGCAAATGGTCACATGATATACCTTCCTGTGAATTGATAAGTTGTTCGGACCCGGAGGTGCCGACTGGTGGCTATATGGAAGCATACGATTACAACGTCCACTCAACCATTGACTTTCATTGTGAGAAGGGACATAAACTCATTGGTGAACCAAGTCTCACGTGTCAGCCTGACGGAGAGTGGTCAGGAGAATCGCCTAAGTGTGAATATGTGGACTGCGGTAAATTGCCACCTCTGCCTTACGGTTCGGCAGAACTTTTAAATGGTACTACGCATTTAGGAAGTATCATCCAATACTCATGTACTACCAACTACAGACTGGTCGGACCAGTAAGAAGGATATGTACTGAAGATTTCCAATGGAGTGATTCATCACCAAGATGTGAAGAAATAAGATGTCCAGAGCCAATCGTAGCGGAAAACAGCATCGTATCCGTAACTGGTAACGATCGCATGCACGGACGCACGCTCATTCGTACACGATCAAGCACTCAGGGCAATACGTACAGAATCGGTGCCTTGGTAAAATACCGCTGTGAGCGTGGGTACAAGGTGGTGGGCGAGAGTCTATCAACTTGTGAAGATAATGGACAATGGAGTGGTGTCAGACCTAAATGTCAATACGTTGACTGTGGAAATCCGGGTCGCATACAAAATGGCAAAGTCACATTGGCTACAAACGCGACGTACTATGGGGCAGCAGCCTTGTACGAATGCGACGAACATTGGCAACTAGATGGTGTCTCAAGGCGATTGTGTCAAGATAACGAAACCTGGAGTTCTGAAGCACCTGTATGTAAAGAAATAACCTGTGTGGATCCTTCAATCCAAATAAAGGGCAGTATTGGTTTATTGGTTGTGACGTCAACTCTCAGCATTGGAGGCGAAGCACACTACCGCTGTGAACGGGGATACAGCCTAAAAGGAAATGAAACTAGAACTTGTCTACCGAAAGGACAGTGGGCCGGAGCACCTCCTGTTTGCATACCGATAGACTGCAAGTCACCCGGCACTGTAGACAACGGCAGAGTGATTATTTCAAATAGTTCGACAATCTTCGGCAGCTCTATAGAGTATCATTGCTTACCGCAATATCAGCGAGTTGGACCATTCCTTCGCAAATGTTTAGACGATGGCAAATGGTCAGGAGAAGAACCCAAGTGCGAATTGATCACGAACGAAGCTGCTGAAAATGGCGCTCTACCACTCAGTGTTGGAGTTGGTTGCGGTATCGTTCTATTTTTGCTTATGTTGCTCGGAGTCATCTATTTAAGACTACGTAAAGCAACGCCAGTCAAGAACACTGAAAATATAGAAGGAGCTGAACGGAAAGAAGACCAAAACGCAGCCGTAATGAGCTACGCAACCCTCCACGATACTAACGGACGGCATATTTACGACCACGTAACGGACAATCTGTACGATTCACCGTACGGCGAGAGTTTGGCCGAGAACTCCGCGTACGGTAGACGCAGTGACACCGAATCCGCATACGAACCAGAACCCACCGGCCCCAACGCTGTAGTCACCATCAATGGAGTGGCCGTTCGTTGA

Protein sequence:

>DPOGS203993-PA
MYYYLYLYIICIEFVLLCFIGFFVLGDRRCGHPAVPPNAKVSLASDTDIVPGTVATYECDDGYELFGAHQRECTLRGDWTSEPPFCGTNVAFRKPANQSTTVRGGSASNGNDGEKTTEHDGKRCTETQREASPWWQVDLLRHYAVKVVRVTTRGCCGHQPLQDLEIRVGNSSSDLQRNPLCAWFPGTIDEGITKTFTCARPLIGQHVFLQLVGVESSLSLCEVEVFTTEEFSNDRCAPIGASADIELAAFSRNCYEFNGAKGASFEEARKQCQEHGGDLIHGFQGATSSYLIQELERRRPNLKTLLVWIGAQKEPGLTSRTWKWVDGETVTKPTWGKDQPNNYNGEQNCVVLDGGRSWLWNDVGCNLDYLHWICQYLPPTCGSPDKLLNTTIEDNDYHVGSSIRYKCPQGHMLIGDKTRECKKDGFWSGAAPSCKYINCGGLTPIQDGSVDLVDGHTTYGAKAIYSCKENYTLVGNAERMCKDQGIWDGEAPKCLFDWCPEPPPVSGATVTTSGHKAGSLATYTCQNGFILFGSPSITCNLGGTWGGTPPSCKYVDCGTPAQVHKGSFRLLNGTTTYGSIAQFTCEPDYWLAGAEVLTCYRDGKWSHDIPSCELISCSDPEVPTGGYMEAYDYNVHSTIDFHCEKGHKLIGEPSLTCQPDGEWSGESPKCEYVDCGKLPPLPYGSAELLNGTTHLGSIIQYSCTTNYRLVGPVRRICTEDFQWSDSSPRCEEIRCPEPIVAENSIVSVTGNDRMHGRTLIRTRSSTQGNTYRIGALVKYRCERGYKVVGESLSTCEDNGQWSGVRPKCQYVDCGNPGRIQNGKVTLATNATYYGAAALYECDEHWQLDGVSRRLCQDNETWSSEAPVCKEITCVDPSIQIKGSIGLLVVTSTLSIGGEAHYRCERGYSLKGNETRTCLPKGQWAGAPPVCIPIDCKSPGTVDNGRVIISNSSTIFGSSIEYHCLPQYQRVGPFLRKCLDDGKWSGEEPKCELITNEAAENGALPLSVGVGCGIVLFLLMLLGVIYLRLRKATPVKNTENIEGAERKEDQNAAVMSYATLHDTNGRHIYDHVTDNLYDSPYGESLAENSAYGRRSDTESAYEPEPTGPNAVVTINGVAVR-