Monarch geneset OGS2.0

DPOGS209208
TranscriptDPOGS209208-TA1452 bp
ProteinDPOGS209208-PA483 aa
Genomic positionDPSCF300061 + 866031-883198
RNAseq coverage505x (Rank: top 25%)
Annotation
HeliconiusHMEL0147741e-14859.34% 
BombyxBGIBMGA001328-TA3e-15462.47% 
DrosophilaCG14880-PB1e-4879.41% 
EBI UniRef50UniRef50_Q177A42e-4767.46%Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q177A4_AEDAE
NCBI RefSeqXP_001657575.13e-4867.46%hypothetical protein AaeL_AAEL006201 [Aedes aegypti]
NCBI nr blastpgi|1571125806e-4767.46%hypothetical protein AaeL_AAEL006201 [Aedes aegypti]
NCBI nr blastxgi|1949012742e-6032.73%GG20015 [Drosophila erecta]
Group
Gene OntologyGO:00080612.8e-09chitin binding
GO:00060302.8e-09chitin metabolic process
GO:00055762.8e-09extracellular region
KEGG pathway 
InterPro domain[20-85] IPR0025572.8e-09Chitin binding domain
Orthology groupMCL26289 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209208-TA
ATGAATCACGTGTCGCGTCTGTCTGTGGGCGACGAAAGACGGCAGACAGGTAATCGCGTTAGCAGTCCAGCGACGCGCTTCACTTGTCAAGGACGCGTTGCCGGCTACTACGCTGACGTCGAGACCGGCTGCCAGGTATACCACATGTGTGATGGTCTCGGCCGCCAGTTCAGCTACTCCTGCCCCAACACCACTCTGTTCCAACAACGCATGCTGGTGTGTGATCACTGGTACATGGTGAACTGCTCCAACTCAGAGAGCGACTATGACGCCAACCTTCTCATAGGTCAGAGGGACAAGCCGTTCGTATCTGATGAAGATATGCGTCTTCGAACTCCTCGCCCTGACATCCTCAACGTGCCTTCCAACAGCAACTATTATGATGGCCTCAAGGAGGCTGAGTCCAAGTTTTCAATACATCCCAGTAATAGTATAGTCGGTATAGCTGACTCCTTATCTGGCGAGGATAATGAATTGGATGACGGGAAGCCGAGCTACAGACCGCCGAGCTCGTGGTCTATGAAGTACTTTAAAACGACTCCCGTTAAAAATATAGACAGACAGACAAATACAGATCAATATAATAGTTTTGATAGCAAGAAGGAAGCGGACCAACCGCGACCGATCAAAGCCAAAGTAAACATTTCTCCACCTTCACAAGACCTCAGCCCGCCTCTACCCCCAAAAGATCAATCAACCACAGCACAACCAGCCAAGGACCCCGTGTATAATTTCATCAAGCGCTTCGATCCAAACTCTCCCGACTCATTGAAGACATCCATGACTCAATCCGAGATAATAAACCTCAACAAACATCTACCCGAGGGTCAGGTCAGCTCGGAGGAGGAGAGGACTCCGCGGAAATATAAAAACTTCGGTAACAACATTAACGTTCTCGACGGAGACAAAAAGAAAGGTTCCAGCGATGGGACGAGGTTCGACACCGTGAACATCAAAACGAAGCCAGAAGTCTCCGAAAACGCTACATCTAAAGTTCCAGTACCATCACAAGTATTATTACCGCCCAAGAGAGAATTCAACCCACCAGCATCTACGACGATGGGTCCACCCATCTACTACGAATGGAAATGGGCAGTTCCGGCCTTTGAGTTAGCTCCCCCCAAACTGAATAACGAGACTAACATCACCAACGTCAAACCCGTCAAACCTATTGAGAGACCGTTCAGTGTTGTACCCAAATCTACACCCAGAGAGGTGGAAGTCACACCACGGAATACCGAGTACAATATAAGTTCATATTTCGTTCCCGACTACGTGTTCCCATTGGACGGACCCCATCCGGGATACGGCGACGATGACGCGCAGACCTCGTTCCAGGTGCAGGTCTCTAGACCGGGAAGGTCAAGTTACGGCGAAAACCCAGCGTGCCCTCAGTGTCATCCAGCTTATCTTGAACCCGGTACGTGCGAGCCCTGTGTCGTAAAACGGTAA

Protein sequence:

>DPOGS209208-PA
MNHVSRLSVGDERRQTGNRVSSPATRFTCQGRVAGYYADVETGCQVYHMCDGLGRQFSYSCPNTTLFQQRMLVCDHWYMVNCSNSESDYDANLLIGQRDKPFVSDEDMRLRTPRPDILNVPSNSNYYDGLKEAESKFSIHPSNSIVGIADSLSGEDNELDDGKPSYRPPSSWSMKYFKTTPVKNIDRQTNTDQYNSFDSKKEADQPRPIKAKVNISPPSQDLSPPLPPKDQSTTAQPAKDPVYNFIKRFDPNSPDSLKTSMTQSEIINLNKHLPEGQVSSEEERTPRKYKNFGNNINVLDGDKKKGSSDGTRFDTVNIKTKPEVSENATSKVPVPSQVLLPPKREFNPPASTTMGPPIYYEWKWAVPAFELAPPKLNNETNITNVKPVKPIERPFSVVPKSTPREVEVTPRNTEYNISSYFVPDYVFPLDGPHPGYGDDDAQTSFQVQVSRPGRSSYGENPACPQCHPAYLEPGTCEPCVVKR-