Monarch geneset OGS2.0

DPOGS201056
TranscriptDPOGS201056-TA909 bp
ProteinDPOGS201056-PA302 aa
Genomic positionDPSCF300497 - 4385-5456
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0112053e-2142.56% 
BombyxBGIBMGA003248-TA3e-1552.50% 
Drosophila% 
EBI UniRef50UniRef50_P089301e-1244.95%Chorion class CB protein PC404 (Fragment) n=2 Tax=Antheraea polyphemus RepID=CHCB2_ANTPO
NCBI RefSeqXP_002163775.13e-1442.71%PREDICTED: similar to DNA-directed RNA polymerase (ISS), partial [Hydra magnipapillata]
NCBI nr blastpgi|2211265185e-1342.71%PREDICTED: similar to DNA-directed RNA polymerase (ISS), partial [Hydra magnipapillata]
NCBI nr blastxgi|1608975252e-2337.96%hypothetical protein Daci_2082 [Delftia acidovorans SPH-1]
Group
Gene OntologyGO:00073047.8e-30chorion-containing eggshell formation
GO:00052137.8e-30structural constituent of chorion
GO:00072757.8e-30multicellular organismal development
GO:00426007.8e-30chorion
KEGG pathway 
InterPro domain[1-161] IPR0026357.8e-30Chorion protein
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201056-TA
ATGTCTTCGAAAATCATTGTTTTGTGTGTTCAAGCCTTTTTCATACAGAATATTTTCGGACAATGCATCGCTAACGTCGGCAGTAACTACAACTTAGGTAATTGCGATGTCCTCGCTGCGAGAAGGAGCTACGATCTACCTAACTGTGGAAGTTCCAACGCCCAATGGGCCGGGGCTCAGTTGGGTTTCGTTGAGGGTCTGACTGCTAGTTCTGGTGGTGGACTCAATGTTCAAACATCCTCTCCGTTTGCCCCTGGTAGTCTTTCCATACTCTCTGAAAATCAAATCCAAGGGCCTGTTGAAGTTAGCGGCACTTTACCATTCCTTAGCGCTGTGGCATTTGAAGGTTCACTACCAACACGAGGATCTGGAGAAGTGCTTTATCAATGTGGTAATGGGAGAGTTGGGATTTTGGAGGAAAATAACCAAATATCTGCAATTAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGGAATTTTAGGTGGAAATTCTGATTATTTGGGTATGGGATGGAATACAGCGTCTGCTTTTAATGGAATAAGCGGCTGTTTAACCCCTGAACCAGTCGCTCTCGGTTGGAATGGCAACCGCAGATCTGGTTGTAACTGTCTTTATTAG

Protein sequence:

>DPOGS201056-PA
MSSKIIVLCVQAFFIQNIFGQCIANVGSNYNLGNCDVLAARRSYDLPNCGSSNAQWAGAQLGFVEGLTASSGGGLNVQTSSPFAPGSLSILSENQIQGPVEVSGTLPFLSAVAFEGSLPTRGSGEVLYQCGNGRVGILEENNQISAINSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSGILGGNSDYLGMGWNTASAFNGISGCLTPEPVALGWNGNRRSGCNCLY-