Monarch geneset OGS2.0

DPOGS208061
TranscriptDPOGS208061-TA1362 bp
ProteinDPOGS208061-PA453 aa
Genomic positionDPSCF300203 + 468261-473103
RNAseq coverage34x (Rank: top 74%)
Annotation
HeliconiusHMEL0121293e-7274.23% 
BombyxBGIBMGA001486-TA1e-6165.26% 
DrosophilaCpr30F-PA1e-2350.85% 
EBI UniRef50UniRef50_B2DBI63e-5664.86%Cuticular protein CPR76a n=1 Tax=Papilio xuthus RepID=B2DBI6_9NEOP
NCBI RefSeqNP_001166684.14e-6165.98%cuticular protein RR-2 motif 76 [Bombyx mori]
NCBI nr blastpgi|2905606328e-6065.98%cuticular protein RR-2 motif 76 precursor [Bombyx mori]
NCBI nr blastxgi|2244951147e-9869.50%insect intestinal mucin 2 [Mamestra configurata]
Group
Gene OntologyGO:00080611e-17chitin binding
GO:00060301e-17chitin metabolic process
GO:00055761e-17extracellular region
GO:00423021.5e-13structural constituent of cuticle
KEGG pathway 
InterPro domain[31-97] IPR0025571e-17Chitin binding domain
[314-366] IPR0006181.5e-13Insect cuticle protein
Orthology groupMCL19792 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208061-TA
ATGCTTCAAACAGCCTTTTCCGACCAATATATATCAGGAATCTTCATAACACTTCTGTTGCTGACTGCAGCAACAGCAGAACAAATATTCGACACTGAGTGCCCATCGGAACAAGAACAGGACTGGTCTATCGAGAAGTTGTTGCGTCATGATGATTGTGAAAAGTTTTACAAATGTACCTTTGGAAAACCAGTAGAAATGTCCTGTCCAGCTGGTCTCTGGTTCAACCTCGACTTATGGCAGTGTGATTGGCCAGCTAATGTAGACTGTACCGGCAGGAATGAACCTACTCTTCCTCCACCGGAAACCACATCTCCTATACCAACTACACCATCGCCCACAACACCTTCCACTCCTGCACCAACGACAACAACAACTACCACTACTACTACTACTACGACGCCTGCACCTACAACCACTACTACAACTACCACAACGCCGGCACCTACAACCACGACCACAACAACAACGACCCCTGCGCCCACAACCACAACCACCACAACTACACCTGCGCCCACAACCACGACAACTACAACGACACCAGCACCCACAACTACTATAACCACAACAACTCGCAAGTGCAGACCGAGAACTACGACAACACCTGTACCGACATCAACAACAACAGTGGCACCGACTGAACCAGATTTTTTGGAAAACGGTTGCCCAGTAAATCCACACATACATTGGCTGCTACCTCATGAGTCGGATTGTAATTTGTTCTACTACTGCGTTTGGGGAAGATTAGTGCTACGGCAATGTCCTGCAACTCTACACTTCAACAGAGTTATACAGAAAGCACAGTCACAGTCCGACACGCAATACATTACAATGGCCGCTAAGTTGTTCGTCGTCCTCGCTCTGGCCGCAGTCGCCTCAGCCCTCCCCGTGGTGCCGGTCGCCAAATACGCGTACGCCGAGCCCGAAGCACCCGCCCACTACGAGTTCCAGTACTCTGTGCACGACGAGCACAGCGGGGACGTGAAGCAGCAGCAGGAGTCCCGTGAGGGAGACGTCGTCCACGGATCATACTCGCTGGTGCAACCTGACGGAGTCCACCGCATTGTAGAGTACAGCTCTGACGCACACAACGGTTTCAACGCCAACGTACGTTACGAAGGACAACCCATCCAGGCCCCAGTCCCCGCTAAGATCGCCTATGCTGCTCCCGTCGCCAAGCTCGTCCACGCCGCCCCTGTCGCCAGAGTAGCGTACTCCGCCCCTATCTCCTACGCCGCCCCCGTCGCTAAAGTAGCATACGCTGCTCCCGTAGCTAAGGTAGCTTACTCAGCTCCCATCTCCTACGCTGCTCCTCTCGCCCACGTCTCATACTCTTCTCCCGCCATCTCCTACCACCACTAA

Protein sequence:

>DPOGS208061-PA
MLQTAFSDQYISGIFITLLLLTAATAEQIFDTECPSEQEQDWSIEKLLRHDDCEKFYKCTFGKPVEMSCPAGLWFNLDLWQCDWPANVDCTGRNEPTLPPPETTSPIPTTPSPTTPSTPAPTTTTTTTTTTTTTPAPTTTTTTTTTPAPTTTTTTTTTPAPTTTTTTTTPAPTTTTTTTTPAPTTTITTTTRKCRPRTTTTPVPTSTTTVAPTEPDFLENGCPVNPHIHWLLPHESDCNLFYYCVWGRLVLRQCPATLHFNRVIQKAQSQSDTQYITMAAKLFVVLALAAVASALPVVPVAKYAYAEPEAPAHYEFQYSVHDEHSGDVKQQQESREGDVVHGSYSLVQPDGVHRIVEYSSDAHNGFNANVRYEGQPIQAPVPAKIAYAAPVAKLVHAAPVARVAYSAPISYAAPVAKVAYAAPVAKVAYSAPISYAAPLAHVSYSSPAISYHH-