Monarch geneset OGS2.0

DPOGS215873
TranscriptDPOGS215873-TA1998 bp
ProteinDPOGS215873-PA665 aa
Genomic positionDPSCF300029 - 612344-618319
RNAseq coverage501x (Rank: top 25%)
Annotation
HeliconiusHMEL0048930.070.02% 
BombyxBGIBMGA000421-TA0.056.42% 
DrosophilaCG13643-PB7e-2345.22% 
EBI UniRef50UniRef50_Q171Z52e-3248.95%Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q171Z5_AEDAE
NCBI RefSeqXP_001652778.13e-3348.95%hypothetical protein AaeL_AAEL007483 [Aedes aegypti]
NCBI nr blastpgi|1571161715e-3248.95%hypothetical protein AaeL_AAEL007483 [Aedes aegypti]
NCBI nr blastxgi|1571161717e-3928.27%hypothetical protein AaeL_AAEL007483 [Aedes aegypti]
Group
Gene OntologyGO:00080612.4e-06chitin binding
GO:00060302.4e-06chitin metabolic process
GO:00055762.4e-06extracellular region
KEGG pathway 
InterPro domain[22-75] IPR0025572.4e-06Chitin binding domain
Orthology groupMCL26829 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215873-TA
ATGTCTCCCGTGTGGCTCGTGCTCGCATTACTCCCTATCATCGGTGGAGCTCAAAGGTTCAATTGTCGTGGAAGAATTCTCGGAGCGTACTATGCAGATGCTAAATCAGGATGCAAAGCGTTTCATGTTTGTGTGAGGGTTGCGGGCGGGGGTATCAGAGACTTTAGATTCTTCTGCCCCCCTGGTACACTGTTCCACCAGGAGGCCCAAACTTGTACCGACTGGGGTGATGACGACCCTCTAGCTTGTCCCGCAGACATCTACGATGGTTCGTTTGACCTGTACAAGATAGGATCTGGATTCGATACGAAAAAGATCTCGTCTTCTGGAAATCGTGAAGATGAAGCCGAATTCGGCCTACAGAGATCTGAAACCGGAGATCGCCGTCTATCTCAAAATGCTCCTAGTGGTGGATCTGACCTCAGAGCAGCGCATTCCTCTGATTTCTTCACAGGACAACGCGACCGCGGCCGTGACGAAGTGGTTGCTCAGACAAAGGCACCGGCATCTATTGCAGTCACTCGACAGTCATTTAGAAGGATCACAACAACCCGCCCTCCTTATACAAGCACTCTTTTCTACCAAACGTCACCGTCTCCTACAACCTTGCCTCCCCAACCGTCGTCTCAACCCCAATACGAGATATCCAAGCGTAAATTTTTAAGGAAGCGTCCTGTTTATACCTCAACAACTTTACCAACAACTGTACAAAACACTTACTCTCCGCAAACTTACACTGAGCCTCAACAAAATTTTAATCGTAAAGTTCCACAACAGAACAAAATATTTCCAACAACAGTCAATATCCCTCCACAGTATAAGGACGAATACGTTGAAGTTAGCAAAGTAGTTCCAAAACAATATAATAATAGATTTTTCCCTAATAACCCTACTCCGACTCCTTTCGCTGAAACAACGCTTGCCCCACAGAATACTAAAAAAGATGGATTTGGCGATTCCTTTAATTATGATACACAGTCTACTCCGGCTAATCACAACGAGAATCGTCCATTCAAAGTAAGAAATAATTTTAATGTACAAAATGATGTATCGAATGAACAGGATTTCGTCCGTATAAGAAACTACAACGGTAACAACAATAGCAACAGAGCTCCCGCTACTTCAACAGTAGATTATAATTCAGCTCGTAGCACCAGTACACCAGTTTATAAAAATGTCAACAGCCTTTCTTATGAAACGGAAAAGAAAAATTTCGCTCCATTTCTTGGCGCTAAGCAGACCTTTTATAATAATCCCACCACCACCCCTTCTACTACCACTATTTATACGACTACCCGCGCTGATCTTCCACCCAACATAAACACCGTAGCTTACAATACAAACATAGGATTCAATACGCAATCCTCGAACTACGCGGATAGCGGTGAAGACGATGGACAATATCGACCACCACAAGGTGAAGACGATGGTCAGTATAGACCAGAGCTATACGAGAGAGAATCGGAGCTGCTGTCTGGTGCACACTCGCTAAACATCGCAGCAAGTGGTAACAGGCTTCCGGAGGACCAAAAAGCCAGAAAAACTTCAAAAGCAGTAGGAAAAACATCACCACCGAGGCCATTTAGACCCTCCCAGACCTCAACTCTCCCGCCCTCTGAGTACACTACTACATACAGGCCGCAATCTGAATCATCAACTCATAGGACGTTTGATTATTATCAACCATACACAACCACTTCCAGACCTTACGAGGCACCTTCAGCGATCGCATACACCTTCGCTCCCTCCAACCCACCAGTTAAAGTAACCACGACATTGGCTCCCAGAACAACTGAACGCTCACAGTCTAGAGAAACAGTTCCGCCAACAACATCTTCCTTACCATACTCTAAAAAATCCACTCGACCTCCCCACTATTCAAAAGCCGCAAACCACAGAGAGGACAATAGTTATGACTATGCCTATTACGATTCTGATCCCGGCTTCTCGGAATACGACCAAATAGAGGAGTTTGGTAGAACTAAGTCTAGACTTTAA

Protein sequence:

>DPOGS215873-PA
MSPVWLVLALLPIIGGAQRFNCRGRILGAYYADAKSGCKAFHVCVRVAGGGIRDFRFFCPPGTLFHQEAQTCTDWGDDDPLACPADIYDGSFDLYKIGSGFDTKKISSSGNREDEAEFGLQRSETGDRRLSQNAPSGGSDLRAAHSSDFFTGQRDRGRDEVVAQTKAPASIAVTRQSFRRITTTRPPYTSTLFYQTSPSPTTLPPQPSSQPQYEISKRKFLRKRPVYTSTTLPTTVQNTYSPQTYTEPQQNFNRKVPQQNKIFPTTVNIPPQYKDEYVEVSKVVPKQYNNRFFPNNPTPTPFAETTLAPQNTKKDGFGDSFNYDTQSTPANHNENRPFKVRNNFNVQNDVSNEQDFVRIRNYNGNNNSNRAPATSTVDYNSARSTSTPVYKNVNSLSYETEKKNFAPFLGAKQTFYNNPTTTPSTTTIYTTTRADLPPNINTVAYNTNIGFNTQSSNYADSGEDDGQYRPPQGEDDGQYRPELYERESELLSGAHSLNIAASGNRLPEDQKARKTSKAVGKTSPPRPFRPSQTSTLPPSEYTTTYRPQSESSTHRTFDYYQPYTTTSRPYEAPSAIAYTFAPSNPPVKVTTTLAPRTTERSQSRETVPPTTSSLPYSKKSTRPPHYSKAANHREDNSYDYAYYDSDPGFSEYDQIEEFGRTKSRL-