Monarch geneset OGS2.0

DPOGS210955
TranscriptDPOGS210955-TA2022 bp
ProteinDPOGS210955-PA673 aa
Genomic positionDPSCF300004 - 1193302-1195571
RNAseq coverage145x (Rank: top 54%)
Annotation
HeliconiusHMEL0080900.058.15% 
BombyxBGIBMGA006382-TA9e-16851.20% 
DrosophilaCG14304-PA1e-6738.29% 
EBI UniRef50UniRef50_A0NC904e-7050.00%AGAP009479-PA n=1 Tax=Anopheles gambiae RepID=A0NC90_ANOGA
NCBI RefSeqXP_001230737.28e-7150.00%AGAP009479-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582883452e-6950.00%AGAP009479-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571297693e-7834.23%hypothetical protein AaeL_AAEL011586 [Aedes aegypti]
Group
Gene OntologyGO:00080612.2e-12chitin binding
GO:00060302.2e-12chitin metabolic process
GO:00055762.2e-12extracellular region
KEGG pathway 
InterPro domain[500-566] IPR0025572.2e-12Chitin binding domain
Orthology groupMCL20445 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210955-TA
ATGAATGGAAGAATGGAAACAGCAGCCTCTAATTCTGTTAATACTTTAAATACTGAAATCGGTATAACGCAAAAACTTGGTACAAATAAGGGTCTTAAATTAAATTCAACTCAGTTGTCCATTAATGAAAGGTACAGAAGATTAATACCCTATATGACATTTTACTATGCTAATGATTTATTGCCACCCACAACAGAATCTTACACTAATAATGTAGAAGTTGAAAAGGCTGAAATTATTGAAGCTGGAACTTCGAGGCCTTTAGGGCGAGAACCAAAAATTATTTATTCACAAAGACAAAAAGATATTCCCAGGTACCAAGGAAACCGGTTAACCCCATTTAATATAGCCTCATCAAGTCCGAGTAGATTGTTCTATAAAGACGTTTTCCCTGTTGTGTACCAACATATACCAAAAGCAAATCACAACTACAGTCCTGCCTCGGAGAATAGGGATCATTTATATGACAGCTACTTCCCGAAACCCATTAAAGCACCGAGTGTTCCATTTACGAAGCCACAAACCAGAAAGCCATACTATAACTATAACCATGAAAATATACCCAACATTCAGTATATTCAATCCAATCAAGGAGAATCTCCAAAATATAAACTTGTGCCTTATGACCAGGCTCCACCAGTAAACGTTGAAAAAAATGGCAATTACAATAAACAATACAATGTTCCTGTTCTAATACCTGAGGAACCTGTTTATATCAAACCCAGACCGCATGTTTATCAGCCCGCGCAACATTTCTATGAAAATAGCTACCAACCAAAGCCAAGAAAACCTCCAACGACAATTTCAGAAGTCTATTACGAACGAAGGCCATATGAACCAGTATTATCGGAACCAGTGATAGAAAGTGGTTTTAAGCCTATTATTAAATCTCAAATAACATCTACTGAAAATCCTGTGTATACATCGACTGCGCAAGATATACCATATGACGATTACTATCAGGAAAAGCAAGAACAAGGCCAGTTGATTCAACCAGCTCAAATAGAGTCTGAAGTAACAAAATATAGACCTCAATATGTTGTAGAACAGCCGCCTACCCAGAACGAACACTACAGTTCTTCAAAATCGGTGGCTTTAGCGGATTTACTTAATTCATTACAGATAAATAAATCTATACCGAAACCAATAACTAGAGAAAACGTCGGAGCTTCAATTAAGACATTATTACAAGTTTTGAATGCGTTAAGAGCAATACCTCAGGAGAATGACGTAGAAACATCCGTATTAAGCACACCTAAGCCGTTTGAAGCGATTGAAACACCTGTCCGATCGACTCCGCATACCGTTGTTGCTACAACCGCAAGACCACAAAATTCTGATATCCATGAACCTTTGCTTGCTACCATTCACACGCCCTCGCAACATATTGATGAATATCCAACTGGCGGCAGTAGCTCTCAGCGTTTTCCTCTTCCAGTTACATCTGAGGAGGAGGGTGGGACTCCCGGTAAACCAGATGTCGACTATCCAATTTTAACCGTTATACCTGAAACCAGTTTTAATTGTAAAACGCAACGTTATAAAGGATTTTTTGCTGATCCCGAAACAAGATGTCAGGTATGGCATTATTGTGATTTGAATGGTGGTCAAGCGTCATTCCTGTGTCCTAACGGGACGATATTTTCTCAAGCGGCACTAACGTGTGATTGGTGGTTTAATGTACGCTGTTCACAAACCGCTCAACTGTACGTGCTAAATGAAAGTCTATACAAATATATTTTGCCACATTCACCTAAGTTCCCCGAAGACTACAGCGGACCCTTAGTAGATAAGTACCTGTCGTTAAAGTTTAAAGAAATGGAAGAACAGTTCAGGAAGAATAAAAATAAAAAAGCCGAAAAAATGCAAGATGACGATTCAAATGACTCAAAAGAAACTGATGATTCTGTGATTGAAAGTCGAAGACAAGAAAACAGTCAGAATGACTCTGTAAACCAACCTCACGTAATTGTCGAATCGCCTGGCAGTAGTGGCAACGTTCAGAGATTACAAGATGAATAA

Protein sequence:

>DPOGS210955-PA
MNGRMETAASNSVNTLNTEIGITQKLGTNKGLKLNSTQLSINERYRRLIPYMTFYYANDLLPPTTESYTNNVEVEKAEIIEAGTSRPLGREPKIIYSQRQKDIPRYQGNRLTPFNIASSSPSRLFYKDVFPVVYQHIPKANHNYSPASENRDHLYDSYFPKPIKAPSVPFTKPQTRKPYYNYNHENIPNIQYIQSNQGESPKYKLVPYDQAPPVNVEKNGNYNKQYNVPVLIPEEPVYIKPRPHVYQPAQHFYENSYQPKPRKPPTTISEVYYERRPYEPVLSEPVIESGFKPIIKSQITSTENPVYTSTAQDIPYDDYYQEKQEQGQLIQPAQIESEVTKYRPQYVVEQPPTQNEHYSSSKSVALADLLNSLQINKSIPKPITRENVGASIKTLLQVLNALRAIPQENDVETSVLSTPKPFEAIETPVRSTPHTVVATTARPQNSDIHEPLLATIHTPSQHIDEYPTGGSSSQRFPLPVTSEEEGGTPGKPDVDYPILTVIPETSFNCKTQRYKGFFADPETRCQVWHYCDLNGGQASFLCPNGTIFSQAALTCDWWFNVRCSQTAQLYVLNESLYKYILPHSPKFPEDYSGPLVDKYLSLKFKEMEEQFRKNKNKKAEKMQDDDSNDSKETDDSVIESRRQENSQNDSVNQPHVIVESPGSSGNVQRLQDE-