Monarch geneset OGS2.0

DPOGS206758
TranscriptDPOGS206758-TA744 bp
ProteinDPOGS206758-PA247 aa
Genomic positionDPSCF300316 + 86251-87074
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0111722e-4154.70% 
BombyxBGIBMGA009722-TA5e-2038.29% 
Drosophila% 
EBI UniRef50UniRef50_P088304e-1838.29%Chorion class CB protein M5H4 n=2 Tax=Bombyx mori RepID=CHCB1_BOMMO
NCBI RefSeqNP_001112374.19e-1938.29%chorion class CB protein M5H4 precursor [Bombyx mori]
NCBI nr blastpgi|454768056e-1838.68%chorion protein [Lymantria dispar]
NCBI nr blastxgi|1029437e-2740.95%chorion class C protein (clone pc404-H12) - polyphemus moth
Group
Gene OntologyGO:00073045.7e-27chorion-containing eggshell formation
GO:00052135.7e-27structural constituent of chorion
GO:00072755.7e-27multicellular organismal development
GO:00426005.7e-27chorion
KEGG pathway 
InterPro domain[3-190] IPR0026355.7e-27Chorion protein
Orthology groupMCL10869 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206758-TA
ATGTCCACGAAAACAGTTGTATTTGTCTGTTTACAGAGCTTATTTATACAGGGTCTCTTGGCTCAATGTATTGGAGCATACAACGGCCTGGCCGCTGGTGGTTGGCCCGCTTCTAATGCCGTGGCCTGGGAGAACGGCATGAACTGGCCGGGAAGTGCGTTGTCTTGGGAAGCTGGTGTTCCTTACGGAGCTGGACCTTGTGCTGCTTCATCACTCGGCGCTTCCTACTCTGCAGCATCTTTGGCCGCTGCTCCATTAGCCGCTGAATGGGGTGCTGGTTACTCGCCAGCAGGTCTCGCTGCCTCTAACGGTGGTGGACTCGCCATAAACAGTTACTCCCCCATCGCTCCTACTGGCGTTTCTATGAACTCTGAAAATATGTACGAAGGACCGTTGGCTGTTTCTGGTGCTGTTCCGTTCCTTGGCGCTGTGGCTTTAGAAGGAAATCTACCTACAGGAGGAGCAGGTGCTGTAGCGTACGGATGTGGAAATGGCAACGTAGCTATGTTGAGTGAAGACTATGCTGGCGCCGGTTTCGGAGCTGGTTTGGCTGGACCAGCTTATGGTTACAATGGTCTCGCTGGTCCACTCGCCCTGGAAGCCGGTAACCTTGGACCCGCTTATGGTTACAATGGGCTCGCTGGTTCACTCGCCCTGGAAGCCGGTAACCTTGGACCAGCTTATGGTTACAATAATTACAACAGCGCAAGACTTGGAGGTTGCGGATGTGGCACTCTCTACTAA

Protein sequence:

>DPOGS206758-PA
MSTKTVVFVCLQSLFIQGLLAQCIGAYNGLAAGGWPASNAVAWENGMNWPGSALSWEAGVPYGAGPCAASSLGASYSAASLAAAPLAAEWGAGYSPAGLAASNGGGLAINSYSPIAPTGVSMNSENMYEGPLAVSGAVPFLGAVALEGNLPTGGAGAVAYGCGNGNVAMLSEDYAGAGFGAGLAGPAYGYNGLAGPLALEAGNLGPAYGYNGLAGSLALEAGNLGPAYGYNNYNSARLGGCGCGTLY-