Monarch geneset OGS2.0

DPOGS208060
TranscriptDPOGS208060-TA1665 bp
ProteinDPOGS208060-PA554 aa
Genomic positionDPSCF300203 + 456090-462215
RNAseq coverage13x (Rank: top 83%)
Annotation
HeliconiusHMEL0121265e-12358.82% 
BombyxBGIBMGA001486-TA5e-6164.29% 
DrosophilaCpr30F-PA4e-2450.41% 
EBI UniRef50UniRef50_B2DBI61e-5263.74%Cuticular protein CPR76a n=1 Tax=Papilio xuthus RepID=B2DBI6_9NEOP
NCBI RefSeqNP_001166684.14e-5764.92%cuticular protein RR-2 motif 76 [Bombyx mori]
NCBI nr blastpgi|2905606328e-5664.92%cuticular protein RR-2 motif 76 precursor [Bombyx mori]
NCBI nr blastxgi|1892343851e-7737.89%PREDICTED: similar to Ccp84Ad CG2341-PA [Tribolium castaneum]
Group
Gene OntologyGO:00423021.9e-13structural constituent of cuticle
KEGG pathway 
InterPro domain[37-89] IPR0006181.9e-13Insect cuticle protein
Orthology groupMCL19113 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208060-TA
ATGGCAGCCAAGTTGTTCGTCGTCCTCGCTCTGGCCGCAGTCGCCTCAGCCCTCCCCGTGGTGCCGGTCGCCAAATACGCGTACGCCGAGCCCGAAGCACCCGCCCACTACGAGTTCCAGTACTCTGTGCACGACGAGCACAGCGGGGACGTGAAGCAGCAGCAGGAGTCCCGTGAGGGAGACGTCGTCCACGGATCATACTCGCTGGTGCAACCTGACGGAGTCCACCGCATTGTAGAGTACAGCTCTGACGCACACAACGGTTTCAACGCCAACGTACGTTACGAAGGACAACCCATCCAGGCCCCAGTCCCCGCTAAGATCGCCTATGCTGCTCCCGTCGCCAAGCTCGTCCACGCCGCCCCTGTCGCCAGAGTAGCGTACTCCGCCCCTATCTCCTACGCCGCCCCCGTCGCTAAAGTAGCATACGCTGCTCCCGTAGCTAAGGTAGCTTACTCAGCTCCCATCTCCTACGCTGCTCCTCTCGCCCACAAAGCACAGTCACAGTCCGACTCGCAATACATTACAATGGCCGCTAAGTTGTTCGTCGTCCTCGCTCTGGCCGCAGTCGCCTCAGCCCTCCCCGTGGTGCCAGTCGCCAAATACGCGTACGCCGAGCCCGAAGCACCCGCCCACTACGAGTTCCAGTACTCTGTGCACGACGAGCACAGCGGGGACGTGAAGCAGCAGCAGGAGTCCCGTGAGGGAGACGTCGTCCACGGATCATACTCGCTGGTGCAACCTGACGGAGTCCACCGCATTGTAGAGTACAGCTCTGACGCACACAACGGTTTCAACGCCAACGTACGTTACGAAGGACAACCCATCCAGGCCCCAGTCCCCGCTAAGATCGCCTATGCTGCTCCCGTCGCCAAGCTCGTCCACGCCGCCCCTGTCGCCAGAGTAGCGTACTCCGCCCCTATCTCCTACGCCGCCCCCGTCGCTAAAGTAGCATACGCTGCTCCCGTAGCTAAGCTCAACCTCTTCTTGACTTCGGAGATAAAGGACCTTGACAGAAGACAGCCTTATCAAATTGAATTACAAAGTTACAATCCTTACGAAGAATTTGAGTCACTTATATCCGATGGTCCCAAAGACATTGAAAACTGTTACATAAATACTGATAATAATAGTACTATAGTACAATTGTTCGTCGTCCTCGCTCTGGCCGCAGTCGCCTCAGCCCTCCCCGTGGTGCCAGTCGCCAAATACGCGTACGCCGAGCCCGAAGCGCCCGCCCACTACGAGTTCCAGTACTCTGTGCACGACGAGCACAGCGGGGACGTGAAGCAGCAGCAGGAGTCCCGTGAGGGAGACGTCGTCCACGGATCATACTCGCTGGTGCAACCTGACGGAGTCCACCGCATTGTAGAGTACAGCTCTGACGCACACAACGGTTTCAACGCCAACGTACGTTACGAAGGACAACCCATCCAGGCCCCAGTCCCCGCTAAGATCGCCTATGCTGCTCCCGTCGCCAAGCTCGTCCACGCCGCCCCTGTCGCCAGAGTAGCGTACTCCGCCCCTATCTCCTACGCCGCCCCCGTCGCTAAAGTAGCATACGCTGCTCCCGTAGCTAAGGTCGCTTACTCAGCTCCCATCTCCTACGCTGCTCCTCTTGCCCACGTCTCATATTCTTCTCCCGCCATCTCATACCACCACTAA

Protein sequence:

>DPOGS208060-PA
MAAKLFVVLALAAVASALPVVPVAKYAYAEPEAPAHYEFQYSVHDEHSGDVKQQQESREGDVVHGSYSLVQPDGVHRIVEYSSDAHNGFNANVRYEGQPIQAPVPAKIAYAAPVAKLVHAAPVARVAYSAPISYAAPVAKVAYAAPVAKVAYSAPISYAAPLAHKAQSQSDSQYITMAAKLFVVLALAAVASALPVVPVAKYAYAEPEAPAHYEFQYSVHDEHSGDVKQQQESREGDVVHGSYSLVQPDGVHRIVEYSSDAHNGFNANVRYEGQPIQAPVPAKIAYAAPVAKLVHAAPVARVAYSAPISYAAPVAKVAYAAPVAKLNLFLTSEIKDLDRRQPYQIELQSYNPYEEFESLISDGPKDIENCYINTDNNSTIVQLFVVLALAAVASALPVVPVAKYAYAEPEAPAHYEFQYSVHDEHSGDVKQQQESREGDVVHGSYSLVQPDGVHRIVEYSSDAHNGFNANVRYEGQPIQAPVPAKIAYAAPVAKLVHAAPVARVAYSAPISYAAPVAKVAYAAPVAKVAYSAPISYAAPLAHVSYSSPAISYHH-