Monarch geneset OGS2.0

DPOGS215905
TranscriptDPOGS215905-TA1365 bp
ProteinDPOGS215905-PA454 aa
Genomic positionDPSCF300029 + 419035-423123
RNAseq coverage582x (Rank: top 22%)
Annotation
HeliconiusHMEL0079913e-9159.03% 
BombyxBGIBMGA000277-TA4e-6079.38% 
DrosophilaCpr62Bc-PA1e-2652.59% 
EBI UniRef50UniRef50_G6CZ374e-4667.76%Cuticular protein RR-2 motif 87 n=6 Tax=Obtectomera RepID=G6CZ37_DANPL
NCBI RefSeqNP_001166675.12e-5879.38%cuticular protein RR-2 motif 87 [Bombyx mori]
NCBI nr blastpgi|2905606263e-5779.38%cuticular protein RR-2 motif 87 precursor [Bombyx mori]
NCBI nr blastxgi|2905606265e-6479.38%cuticular protein RR-2 motif 87 precursor [Bombyx mori]
Group
Gene OntologyGO:00423024.4e-15structural constituent of cuticle
KEGG pathway 
InterPro domain[380-432] IPR0006184.4e-15Insect cuticle protein
Orthology groupMCL35075 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215905-TA
ATGACATCTTCAAGTTCTCTACTTTTCCTGTTGGTGACGATCTTGTATCAAGATATCACTTCTGGGGCATCATTTTCAAATGTTATAGTGCAAAGAGGATCTGAAGTACTTCCCATAGAATACATCAACTATAATCCATCTGCATTAACATACACGTCTCAAGTGGCTCGTTTATCTCAAGCACCATTACAATCAGTAATTTCTGAACCGCTAGTATCAAGTGTTTTATCTCCATGTGTGTCGCCTTGCGTCATTCCTAATGCAGCTCCCGTCCAATCCACGGTTAATGCTAAAATTGGTGCTTATAAGACATCAGCGGCGAATGTACCTGTCATATTGACAAATAAAGACGAATCAAGGGGATACCAATATGCTTACGCAGTTTTTGACGTAGACACCGGTGACAAGAAAACTCAAAGCGAACGCAGTGACGGGTCAGTAGTTCAAGGGCACTATTCGTTCATTCAACCTGACGGCTTCTTGCGCGAAGTCGTTTATACGGCGGACGATTTGAAAGGATTCAACGCTATAGTACGTAACATATCTCCTGAACCGGAACACAAACATACTTCTGAAACAGAAAAATCAGAAGAGAAACACATCATACCGCCATGCAAAGACAATAAAAATGAACATTTAACACATGCACATGAGAGAAATGAAGAAAGTCATCCAGCTCATGAACAAGAAAATCACGAAGCAAGTTCATCAGAGGAAATGCATGTAAAACAGGAACATTTAGAGGTAGAAAATTTGAATGAAAAAAAAAGTGAAGAAGCAAGTGCAGAAAAAAATGAAGAAAATGTTGATAAAAGCCATGAGCATAGTGCTGAAAAAAGCGAAAAAGATAGTGAGGAAAACTCGGGTGAAAAATCTGTCGAAAAATCAAACGAAGAAGTCCATCCAATTGTAGCATTTGGAGCTATGTTGGCTGCTGCCAACGCTGGTCTTCTTCATGGTCATGGACACGCTGTGTCCTCCCAAAGCATTGTCCGTCATGATGAAGGACACTACGCGGCGCCCATCGCATATGCTGCCCCAATCGCTCATGCTGCTCCAGTTGCCTACGCCCCAGTCGCTCACTACGCTGCACCCGCTGCCCATTACGATGGACATGATGAATATGCCCACCCCAAATACGACTTCGCATACTCAGTAGCAGACCCTCACACCGGTGATCACAAGTCACAGCATGAGAGCCGCGACGGTGACGCCGTCCATGGCTACTACTCCCTGGTACAGCCTGACGGCTCCGTACGTAAAGTGGAATACTCTGCTGATGACCACAATGGATTCAATGCCATCGTACACAACTCAGCTCCCTCTGTGCATGCCGCACCAGTGCCAGCCTACCATCACTACTAA

Protein sequence:

>DPOGS215905-PA
MTSSSSLLFLLVTILYQDITSGASFSNVIVQRGSEVLPIEYINYNPSALTYTSQVARLSQAPLQSVISEPLVSSVLSPCVSPCVIPNAAPVQSTVNAKIGAYKTSAANVPVILTNKDESRGYQYAYAVFDVDTGDKKTQSERSDGSVVQGHYSFIQPDGFLREVVYTADDLKGFNAIVRNISPEPEHKHTSETEKSEEKHIIPPCKDNKNEHLTHAHERNEESHPAHEQENHEASSSEEMHVKQEHLEVENLNEKKSEEASAEKNEENVDKSHEHSAEKSEKDSEENSGEKSVEKSNEEVHPIVAFGAMLAAANAGLLHGHGHAVSSQSIVRHDEGHYAAPIAYAAPIAHAAPVAYAPVAHYAAPAAHYDGHDEYAHPKYDFAYSVADPHTGDHKSQHESRDGDAVHGYYSLVQPDGSVRKVEYSADDHNGFNAIVHNSAPSVHAAPVPAYHHY-