Monarch geneset OGS2.0

DPOGS213668
TranscriptDPOGS213668-TA1353 bp
ProteinDPOGS213668-PA450 aa
Genomic positionDPSCF300219 - 413515-417221
RNAseq coverage3x (Rank: top 91%)
Annotation
HeliconiusHMEL0031072e-1737.55% 
BombyxBGIBMGA008610-TA2e-2444.04% 
Drosophila% 
EBI UniRef50%
NCBI RefSeqXP_001844992.13e-1947.14%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700342595e-1847.14%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|3072111713e-7452.32%hypothetical protein EAI_17577 [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[216-237] IPR0023955.9e-09HMW kininogen
Orthology groupMCL27839 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213668-TA
ATGAGGGGGTTGTTGGTGTTAGGAGTGATATGTTTAGTGGCGTTGTGTACAGCCCACGAGGCTGAGGCCGACGACCAGGCTGTGGCGGCCAGCAAACATGAGCATGGTGGCGGCCACGAACACCATGCCCACCATCATCATGAACATGGCGGCAAAGGCCACAAAGGACACAAGGGACACCACCACCATCACAAGGGTGACGAAGGTCATCACGGCAAGCATCATCATGAAGGTCATCATCATGAACATGGCGGCGGTCATAAGAAACATTGGGATGAACACGATCATCACGGTGAACACCACGAGAAAGGCCACCACCACAAGGGAGAGAAACATGGACACCACGAACACCACGATAAGGGCGAAAAGACCGACGGATACCACAAGAAATATCACAAAGACCACTTCCATAAGGACCACCACTTCCACGATGGACACCATCTCGAAGGCAAACACCACAAACACGGACATCACCACAAACACCACGAAGATCATGGCGGTCACCACAAGAAGGGAGGTCATCATCATTCCGGTCACCATGAAGACCATTATGGCAAACACGGCCACCATGACAAACACCACTACGATGAGGATCACCATGGACACAAGGGTCATCACGGACATGATGAACACCATCATCACCACCATGACCATGGCAAGAAAGGAGGCCACGAAGACCACAAGCACTGGGCCCACGAGGCTGAGGCCGACGACCAGGCTGTGGCGGCCAGCAAACATGAGCATGGTGGCGGCCACGAACACCATGCCCACCATCATCATGAACATGGTGGCAAAGGCGATAAAGGGCACAAGGGACACCACCACCATCACAAGGGTGACGAAGGTCATCACGGCAAGCATCATCATGAAGGTCATCATCATGAACACGGCGGCGGTCATAAGAAACATTGGGATGAACACGATCATCACGGTGAACACCACGAGAAAGGCCACCACCACAAGGGTGAGAAACATGGACACCACGAACACCACGATAAGGGTGAAAAGACCGACGGATACCACAAGAAATATCACAAAGACCACTTCCATAAGGACCACCACTTCCACGATGGACACCATCTCGAAGGCAAACACCACAAACACGGACATCACCACAAACACCACGAAGATCATGGCGGTCACCACAAGAAGGGAGGTCATCATCATTCCGGTCACCATGAAGACCATTATGGCAAACACGGCCACCATGACAAACACCACTACGATGAGGATCACCATGGACACAAGGGACATCACGGACATGATGAACACCATCATCACCACCATGACCATGGCAAGAAGGGAGGCCACGAAGACCACAAGCACTGGGGTTTCCATCATGGAAAGCATTAA

Protein sequence:

>DPOGS213668-PA
MRGLLVLGVICLVALCTAHEAEADDQAVAASKHEHGGGHEHHAHHHHEHGGKGHKGHKGHHHHHKGDEGHHGKHHHEGHHHEHGGGHKKHWDEHDHHGEHHEKGHHHKGEKHGHHEHHDKGEKTDGYHKKYHKDHFHKDHHFHDGHHLEGKHHKHGHHHKHHEDHGGHHKKGGHHHSGHHEDHYGKHGHHDKHHYDEDHHGHKGHHGHDEHHHHHHDHGKKGGHEDHKHWAHEAEADDQAVAASKHEHGGGHEHHAHHHHEHGGKGDKGHKGHHHHHKGDEGHHGKHHHEGHHHEHGGGHKKHWDEHDHHGEHHEKGHHHKGEKHGHHEHHDKGEKTDGYHKKYHKDHFHKDHHFHDGHHLEGKHHKHGHHHKHHEDHGGHHKKGGHHHSGHHEDHYGKHGHHDKHHYDEDHHGHKGHHGHDEHHHHHHDHGKKGGHEDHKHWGFHHGKH-