Monarch geneset OGS2.0

DPOGS206742
TranscriptDPOGS206742-TA930 bp
ProteinDPOGS206742-PA309 aa
Genomic positionDPSCF300316 - 99855-102100
RNAseq coverage39x (Rank: top 73%)
Annotation
HeliconiusHMEL0111724e-2762.73% 
BombyxBGIBMGA009715-TA3e-1547.73% 
Drosophila% 
EBI UniRef50UniRef50_P088302e-1043.82%Chorion class CB protein M5H4 n=2 Tax=Bombyx mori RepID=CHCB1_BOMMO
NCBI RefSeqNP_001112374.13e-1143.82%chorion class CB protein M5H4 precursor [Bombyx mori]
NCBI nr blastpgi|454768053e-1047.27%chorion protein [Lymantria dispar]
NCBI nr blastxgi|2125362841e-3132.51%conserved hypothetical protein [Penicillium marneffei ATCC 18224]
Group
Gene OntologyGO:00073041e-18chorion-containing eggshell formation
GO:00052131e-18structural constituent of chorion
GO:00072751e-18multicellular organismal development
GO:00426001e-18chorion
KEGG pathway 
InterPro domain[2-90] IPR0026351e-18Chorion protein
Orthology groupMCL10869 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206742-TA
ATGTCTCCTATCTCACCCACTGGAGTTTCTATGACGTCTGAGAATGCATACGAAGGACCGTTGTCCGTTGCTGGCACTATTCCATTCTTGGGTGCTGTGGCTTTAGAAGGAACTCTTCCGACTGGTGGTGCTGGTTCTGTCTCTTATGCCTGCGGTAATGGAAACGTTGCTATGATCAATGAAGACTATGCTGGTTACGGCGCGGGTCCTCTCGGTTCCGGTTACGGGTATAATGGATTTGGTCCCGCGGCTTATGAAGGTTATAACGGACTGTCTACCCCACTAGCTGTGGAAGCTGGTCGCGCTGGAGTAGGCTATGGTTATAACATTGTGATTGATCAAACATCAGGAGCATCGATATCATTCGGAGGAAGTTATCAAATCGGTCTTGAAGACGATTTATCTGAAACTGCTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCAGCATCAGAAGGGATTGCGGCATCAGCTGCGGTGGAATTTGCTGCGGATGTTGAAGATAATACTAACATCACGTCCGACAGCAATATTTATGCTGATTCAAATTATACAGAGATGTCAGATGATGTATTAGACATAATTACATTAGATGAATATCTGTCCGTGGACGAAACTTCGTCGGAAAATGTTAGTGATGTTGAACGTTTGTATTTTTATGTTTTAGAATGA

Protein sequence:

>DPOGS206742-PA
MSPISPTGVSMTSENAYEGPLSVAGTIPFLGAVALEGTLPTGGAGSVSYACGNGNVAMINEDYAGYGAGPLGSGYGYNGFGPAAYEGYNGLSTPLAVEAGRAGVGYGYNIVIDQTSGASISFGGSYQIGLEDDLSETAAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASEGIAASAAVEFAADVEDNTNITSDSNIYADSNYTEMSDDVLDIITLDEYLSVDETSSENVSDVERLYFYVLE-