Monarch geneset OGS2.0

DPOGS204348
TranscriptDPOGS204348-TA1521 bp
ProteinDPOGS204348-PA506 aa
Genomic positionDPSCF300142 + 270349-272005
RNAseq coverage496x (Rank: top 25%)
Annotation
HeliconiusHMEL0070109e-8670.79% 
BombyxBGIBMGA007250-TA3e-3249.08% 
DrosophilaMur89F-PA2e-3636.90% 
EBI UniRef50UniRef50_Q7PQ782e-7537.88%AGAP004367-PA n=2 Tax=Anopheles gambiae RepID=Q7PQ78_ANOGA
NCBI RefSeqXP_001948722.14e-5335.19%PREDICTED: similar to AGAP004367-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|3479717726e-7537.88%AGAP004367-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479717721e-9336.40%AGAP004367-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00080613.4e-10chitin binding
GO:00060303.4e-10chitin metabolic process
GO:00055763.4e-10extracellular region
KEGG pathway 
InterPro domain[54-107] IPR0025573.4e-10Chitin binding domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204348-TA
ATGGAACTTCAACGACAATTAGCAGCACTAGTATTACCTACTACCTCTCAACCTTCTGTTTCATCATCAACGCCTATGGATTCTTCGACAACTATAAAAGATCCTAATTCAAACTGTTCCCAGTCTAATTCTGATCATGAAAATAATAAAAACTATACATGTACTAAGGCCGGCTTTTACGCTGATCCAAATGATTGCAAAAAGTTTTACCGTTGTGTAGATTGGGAGAATAACGGAGAAAAGTTTTCAATATATCACTTTGAGTGTGGTGATGGAACAATTTGGGATCCAGCTTTAGAAACCTGTAACCACGAGGATTCAGTTTATCCACCTCGAAATTGCAATGGCAAAGAACAACAAAATCAAACTAGTAGCGTTTCTCCTTCAACAACAGAAGTATCTACAACGACGTCATCTGAAACAGAATCTACTGGTATTGTAACAGAATCTGAAACAACGAGTAGTACTACTTCACAGACAACAGTACAATCATCTACTACAACATTTTCAACAGCAGAATCAACATCTACAACCACTCAAATGACGACGACACAACCTTCTACAACTCCATCTAAACAAACAACTGCCCAAACCTCAACTACAGAAACTTCTACAGTGATAATGTCAACAAGTACTGAGCCTCCAACTAAACCTAGCTCCACAACAATTGCTTCGACTACAATCGAATCTTCTACGCAACAAACATCAACTACAAACAGTGAAGAGCCAACGACAGATTCATCAACACAAACTACTACTGATATGACAGAGATCACTACTTCTGTATCGACAACTACAATTATGGATGAATCTACAACTATGAGTACTCAAACATCGTCAACAATGTCACAAGCCAGTACCACAGCAATTACAACTACGACTGAAGAGATTTCATCATCTACTACTGAAAGTCAAAGTACAGCTATGGAATCTACCACCGTAACAAATGGTCAGGATAATAGTACAGAATCAAACAAAGATTGTCCTGATACTGAAAAGGATCAAAATTTATATGTTTGTCCAAGTTCTTTCAAAAGACATCCAAAATATTGTAACTTGTTCTATCAATGTACGGAGGATAATGATAATCATGAAGTAAAAATAGCTGTATTTAATTGTCCCAATAACACCATTTACGATGAAAATAAGGTTCAGTGTGTAGAAGAAAAACAAGCAAGTAAAAAATGCAACGGGCAAATATCTCAGAGAAATAGAATAAAACGATTGGGAGCCTATTTTAATGAACCGGTAATTGTATCCAAAAATAGTTTAAGATGTTCAGAAGCTGGACATTTTCCATTTGAAAAACGAGAACAATGTTCATCAGCCTTTTTAAAATGTGAATACGCAACAAGTGGACAATTGAGGGGATACGTGTACAAATGCCCAGAAGGATTTGTGTATTGGTCTATTAGCAGACGATGTGAAGCTATAAGGAAAGTCAGAGATTGTAAATTATCGTCATATAATTGGAATAACAGATACGATGTACCTGTGGAACGAAATAATATAGCATATTGA

Protein sequence:

>DPOGS204348-PA
MELQRQLAALVLPTTSQPSVSSSTPMDSSTTIKDPNSNCSQSNSDHENNKNYTCTKAGFYADPNDCKKFYRCVDWENNGEKFSIYHFECGDGTIWDPALETCNHEDSVYPPRNCNGKEQQNQTSSVSPSTTEVSTTTSSETESTGIVTESETTSSTTSQTTVQSSTTTFSTAESTSTTTQMTTTQPSTTPSKQTTAQTSTTETSTVIMSTSTEPPTKPSSTTIASTTIESSTQQTSTTNSEEPTTDSSTQTTTDMTEITTSVSTTTIMDESTTMSTQTSSTMSQASTTAITTTTEEISSSTTESQSTAMESTTVTNGQDNSTESNKDCPDTEKDQNLYVCPSSFKRHPKYCNLFYQCTEDNDNHEVKIAVFNCPNNTIYDENKVQCVEEKQASKKCNGQISQRNRIKRLGAYFNEPVIVSKNSLRCSEAGHFPFEKREQCSSAFLKCEYATSGQLRGYVYKCPEGFVYWSISRRCEAIRKVRDCKLSSYNWNNRYDVPVERNNIAY-