Monarch geneset OGS2.0

DPOGS215887
TranscriptDPOGS215887-TA1518 bp
ProteinDPOGS215887-PA505 aa
Genomic positionDPSCF300029 + 2217-9106
RNAseq coverage510x (Rank: top 25%)
Annotation
HeliconiusHMEL0023152e-8552.53% 
BombyxBGIBMGA000266-TA6e-8269.58% 
DrosophilaCG34461-PA1e-2452.99% 
EBI UniRef50UniRef50_Q9BPR73e-7969.58%Cuticle protein n=18 Tax=Obtectomera RepID=Q9BPR7_BOMMO
NCBI RefSeqNP_001166667.12e-8069.58%cuticular protein RR-2 motif 98 [Bombyx mori]
NCBI nr blastpgi|2905609055e-7969.58%cuticular protein RR-2 motif 98 precursor [Bombyx mori]
NCBI nr blastxgi|2905609052e-8567.04%cuticular protein RR-2 motif 98 precursor [Bombyx mori]
Group
Gene OntologyGO:00423026.4e-15structural constituent of cuticle
KEGG pathway 
InterPro domain[257-309] IPR0006186.4e-15Insect cuticle protein
Orthology groupMCL10531 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215887-TA
ATGCCCGTTTGTACAGAAGCGCGGTGGTTAGAGCAGCGAGGTGAGCGAGGATGGAGCGGGCTGCAGGCGCAAATATACTATACGCAAGATGCAAGGTCGTTCACTAAACAAGCTGCCGCCTTCGCCGCAGGCAATGTTACAATTACTATAAAACTATTGCTCAGCAGCAGATACATCATTCAGTACTCTGTTGCTAATACAACAATATTAAAAATGTTCTCTCGTTTGGTAGCTCTTTGCGCTGTTGTAGCGGTGTCGTCAGCTGGTCTCCTGCCAGCAGCCGTACACTACTCCCCAGCCTCCGCCGTCTCTTCTCAGAGCATCGTACGTCATGACCAGCCTCAAGCACATGTCGCTAAATTAGCCGTCGTAGCACCAGTTGCCTACCACGCTGCCCCTGCTCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCGTCTACCACGCTGCCCCTGCCCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCTCCTACCATGCTGCACCCGTCGCTAAACTTGTAGCTCAACCCGAAGAAATCGCCTACCCCAAATACGAGTACTCTTACTCCGTTGCTGACGGACACAGCGGAGACAACAAACAACAGCAAGAGTCCCGCGACGGTGATGTTGTGAAGGGATCCTACTCATTCCATGAAGCTGACGGCTCCATCAGGTCTGTGGAATACAGCGCTGACGACAAGAACGGTTTCAACGCAGTAGTACACAACTCCGCTCCCACCGCCGCGCCCGCCCTCATCAAGGCTGCCCCACTTGTACTCAAGGCTCCCGTCTACGCTTCCCCTCGGTCCGTACACTACTCCCCAGCCTCCGCCGTCTCTTCTCAGAGCATCGTACGTCATGACCAGCCTCAAGCACATGTCGCTAAATTAGCCGTCGTAGCACCAGTTGCCTACCACGCTGCCCCTGCTCCAGTCGCCTACCACGCTGCCCCTGCCCCAGTCTCCTACCACGCTGCCCCCGTCGCTAAACTTGTAGCTCAACCCGAAGAAATCGCCTACCCCAAATACGAGTACTCTTACTCCGTTGCTGACGGACACAGTGGAGACAACAAACAACAGCAGGAGTCCCGCGACGGTGATGTTGTGAAGGGATCCTACTCATTCCATGAAGCTGACGGCTCCATCAGGTCTGTGGAATACAGCGCTGACGACAAGAACGGTTTCAACGCAGTAGTACACAACTCCGCTCCCACCGCCGCGCCCGCCCTCATCAAGGCTGCCCCACTTGTACTCAAGGCTCCCGTCTACGCTGCCCCTGTACCCCACTACTACCATCACTAA

Protein sequence:

>DPOGS215887-PA
MPVCTEARWLEQRGERGWSGLQAQIYYTQDARSFTKQAAAFAAGNVTITIKLLLSSRYIIQYSVANTTILKMFSRLVALCAVVAVSSAGLLPAAVHYSPASAVSSQSIVRHDQPQAHVAKLAVVAPVAYHAAPAPVAYHAAPAPVVYHAAPAPVAYHAAPAPVAYHAAPAPVAYHAAPAPVAYHAAPAPVAYHAAPAPVAYHAAPAPVAYHAAPAPVAYHAAPAPVAYHAAPAPVSYHAAPVAKLVAQPEEIAYPKYEYSYSVADGHSGDNKQQQESRDGDVVKGSYSFHEADGSIRSVEYSADDKNGFNAVVHNSAPTAAPALIKAAPLVLKAPVYASPRSVHYSPASAVSSQSIVRHDQPQAHVAKLAVVAPVAYHAAPAPVAYHAAPAPVSYHAAPVAKLVAQPEEIAYPKYEYSYSVADGHSGDNKQQQESRDGDVVKGSYSFHEADGSIRSVEYSADDKNGFNAVVHNSAPTAAPALIKAAPLVLKAPVYAAPVPHYYHH-