Monarch geneset OGS2.0

DPOGS215900
TranscriptDPOGS215900-TA1332 bp
ProteinDPOGS215900-PA443 aa
Genomic positionDPSCF300029 + 342369-374197
RNAseq coverage76x (Rank: top 65%)
Annotation
HeliconiusHMEL0219362e-6253.24% 
BombyxBGIBMGA000266-TA3e-5049.10% 
DrosophilaCG34461-PA3e-2453.39% 
EBI UniRef50UniRef50_Q9BPR78e-4849.10%Cuticle protein n=18 Tax=Obtectomera RepID=Q9BPR7_BOMMO
NCBI RefSeqNP_001166667.11e-4849.10%cuticular protein RR-2 motif 98 [Bombyx mori]
NCBI nr blastpgi|2905609052e-4749.10%cuticular protein RR-2 motif 98 precursor [Bombyx mori]
NCBI nr blastxgi|2236713104e-6657.25%TPA: putative cuticle protein [Bombyx mori]
Group
Gene OntologyGO:00423022.5e-15structural constituent of cuticle
KEGG pathway 
InterPro domain[367-419] IPR0006182.5e-15Insect cuticle protein
Orthology groupMCL18037 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215900-TA
ATGTCTACAGGAATACCTCTGTCTAAACCTATGCGTATATTATGTTTCGCTGCCCTCGTCGCCGTTGCTGCGGCACAATACGGCTACGGTCATGGGTCAGCGTACTCGTCTAATCACATCTCCCGACACGATGGACATCCACAACTGGTCCATGGACATCACGGTCACCATGATTACCATGTATTGTGCTTATTGGGACTGTTAGCTGTAGCAATGGCTAATTTAGTCCCCGAATACGAGAGGGCTGACATAATAAAAAAAGACGATCATCCAGTCGTCAATGATAGGGATTATCATAATGACTATTATCAGAAAAACTACCGCTATGGACATGAATACAACCACTATGACTACTACGTATTGTGCTTATTGGGATTGTTAGCTGTAGCAATGGCTAATTTAGTCCCCGAATACGAGAGGGCTGTCTCTTCACAATACATTATAAAAAAAGACGATCATCCAGTCATCATTGATAAGGATTATCATAATGACTATTATCACAACAACTACCGCTATCGACATGGATACAACCACTATGACTACTACGTTCTGTGCTTGAGTGCTGTTCTGGCAGTGGCCACAGCAGGCCTTATCGCTGAACCTCATTACTCCTCTGCTGCCGCAGTTTCTTCTCAAAGTATCGTTCGTCACGATCAACCTCATGCTGTAGCTGCCGCTCCAGTTGCAATCCACTCTGCTCCAGTTGCCATTCACTCTGCTCCTGTTGCCTACCATGCAGCACCAGTACACTATTCATCTGCCAGAGCTGTATCTTCTCAGTCGATCCAACGTCACGACCAACAGCGTGCTGCTATTGCTGTTGCACCCGTAGCCCACTACGCCGCTGCTCCAGTTGCTGTTCACTCTGCTCCTGTTGCTTACCACGTAGCACCAGTACACTATTCATCTGCCGGAGCTGTATCTTCTCAGTCCATCCAACGTCATGACCAACCCCGTGCTGCTATTGCAGTGGCTCCCGTAGCTCACTACTCAGCTGCTCCAGTCGCTCACTACGCAGCTGCCCCAGTAGCTCACTACTCTGCCCCTATCGCCCATGCTGCATATGCTGCCCACGAAGAAATCGACTCTCACCCTCAATACGACTTCTCTTACTCCGTACATGACGGACACACCGGCGACAACAAGTCACAGCACGAGAGCCGCGACGGTGACGCAGTGCACGGCGAGTACTCCCTGGTAGAGGCTGACGGATCTGTACGTACCGTTCAATACAGCGCTGATGATCACTCTGGCTTCAACGCCGTCGTCAGCCACTCAGCTCCGTCAGCTCACGCCGTACCAGCACCAGCTCATGTGCTTGCTCACCATTAA

Protein sequence:

>DPOGS215900-PA
MSTGIPLSKPMRILCFAALVAVAAAQYGYGHGSAYSSNHISRHDGHPQLVHGHHGHHDYHVLCLLGLLAVAMANLVPEYERADIIKKDDHPVVNDRDYHNDYYQKNYRYGHEYNHYDYYVLCLLGLLAVAMANLVPEYERAVSSQYIIKKDDHPVIIDKDYHNDYYHNNYRYRHGYNHYDYYVLCLSAVLAVATAGLIAEPHYSSAAAVSSQSIVRHDQPHAVAAAPVAIHSAPVAIHSAPVAYHAAPVHYSSARAVSSQSIQRHDQQRAAIAVAPVAHYAAAPVAVHSAPVAYHVAPVHYSSAGAVSSQSIQRHDQPRAAIAVAPVAHYSAAPVAHYAAAPVAHYSAPIAHAAYAAHEEIDSHPQYDFSYSVHDGHTGDNKSQHESRDGDAVHGEYSLVEADGSVRTVQYSADDHSGFNAVVSHSAPSAHAVPAPAHVLAHH-