Monarch geneset OGS2.0

DPOGS201446
TranscriptDPOGS201446-TA1374 bp
ProteinDPOGS201446-PA457 aa
Genomic positionDPSCF300006 - 920089-924973
RNAseq coverage6317x (Rank: top 2%)
Annotation
HeliconiusHMEL0159710.084.50% 
BombyxBGIBMGA002604-TA0.074.18% 
DrosophilaCG10527-PA4e-7447.77% 
EBI UniRef50UniRef50_E2BC986e-13752.51%C3 and PZP-like alpha-2-macroglobulin domain-containing protein 8 n=4 Tax=Neoptera RepID=E2BC98_HARSA
NCBI RefSeqXP_001944506.11e-11446.59%PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum]
NCBI nr blastpgi|3287884133e-13953.30%PREDICTED: hypothetical protein LOC412543 isoform 1 [Apis mellifera]
NCBI nr blastxgi|3287884131e-14453.30%PREDICTED: hypothetical protein LOC412543 isoform 1 [Apis mellifera]
Group
KEGG pathwayspu:5941308e-07 
 K00457 (HPD, hppD)maps-> Tyrosine metabolism
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[388-458] IPR0066168.4e-31DM9 repeat
[36-136] IPR0220412.2e-29Farnesoic acid 0-methyl transferase
Orthology groupMCL16045 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201446-TA
ATGGCAAACATTCTGGATGTTGCGACGGAGGACAACCTTCAATACCATTTCTTCCCAATTGCCAGTGGCTCCGTCCAGTTCAAGATAAGATCACCCAATGACGCTCATATTGCCCTCACCATGGGCCCTCAGGAATCCGACCCGATGGTTGAGATATTCATCGGCGGTTGGGGCAACACAAAGAGTGTCATCAGACGGAACAGAACGAAACCAGAGAAGGCTGAAATTGACACACCCAACATTCTTAACGGTGGAGAGTTCCGTGGCTTTTGGGTGCGTTGGGACGGCGGCATTATTTCAGCGGGTCGCGAAGGAGAAGCAATTCCCTTCATTTCTTGGCAAGATCCGGAACCTTTCCCCATTGGTTTCGTCGGTGTCTGCACTGGCTGGGGCGCTAGTGGCACATGGAAGATTGAAGATGGAGCCGAGTTTAATACTCCGGACAAGTTAGAGTATAAATTCGGGCCGGTGGCGGCAGGATCTTTGGAATTCGAATATCGAGGACCTCACAATTGCCACGTCTGTTTGACATCGGCACCCGCAGAAATCGATCCAATGTACGAAGTTATCCTTGGTGGCTGGGAAAATACACAATCTGTGATCAGACACTGCAGACAAAAACCGGATAAGGTTACCGTTCCAACACCGAGTCTTATGAACGCCAATGAGTTTAGGAAGTTCATCTTCGAATGGCGCTGTGGCAGATTGACCGTCCGCGACGGAGGAACTGGTGCAATTCTCATGGAATGGGTGGATCCAACGCCTTTCCCTGTATTGCACTTCGGTGTCCGCACCGGTTATGGTGCAAGGGGCAATTGGCGTATTTCTCATTTCTACAAAGGGAACGCGCAGCCACTGCCTCCACCACCATCGGCTCCTTCCGCTGCTGCTTTATACAGCCCACCACCCGGCTACCCTGGCGTCGCTCCAGTACCTGGATCAGGTGCTTCTGGCGTGTGGGTGGACGCTACCTCAGGACTAGTGCCTCCAGGTGCGGTTGTAGGCGGACAAGATTGCTCCGGCGAAGCTTTATACGTAGCTAGAGCGCAACATGAAGGGGCACTACTCCCTGGAAAACTTGTAGGGTCTCACGGTTGCGCGTATGTTCCATGGGGTGGACAAGAACACGGAAAACCTGAATATCAGGTTTTGGTTGGAGGTCCTAACAACTGGATAGCAACCAGCGGTTCCAACATTCCCCCCGGAGCCCTACCGGGAGGACAATCCGAAGATGGGGAGACTCTGTACGTCGGCAGAGTGAATCACGAAGGCAGCATCACAACAGGAAAAGTTCAGCAGTCTCATGGCGTTTGCTACATTTCATTTGGAGGCCAAGAACTCGGCTTCCCTGACTACGAAGTCCTCGTCCAATAA

Protein sequence:

>DPOGS201446-PA
MANILDVATEDNLQYHFFPIASGSVQFKIRSPNDAHIALTMGPQESDPMVEIFIGGWGNTKSVIRRNRTKPEKAEIDTPNILNGGEFRGFWVRWDGGIISAGREGEAIPFISWQDPEPFPIGFVGVCTGWGASGTWKIEDGAEFNTPDKLEYKFGPVAAGSLEFEYRGPHNCHVCLTSAPAEIDPMYEVILGGWENTQSVIRHCRQKPDKVTVPTPSLMNANEFRKFIFEWRCGRLTVRDGGTGAILMEWVDPTPFPVLHFGVRTGYGARGNWRISHFYKGNAQPLPPPPSAPSAAALYSPPPGYPGVAPVPGSGASGVWVDATSGLVPPGAVVGGQDCSGEALYVARAQHEGALLPGKLVGSHGCAYVPWGGQEHGKPEYQVLVGGPNNWIATSGSNIPPGALPGGQSEDGETLYVGRVNHEGSITTGKVQQSHGVCYISFGGQELGFPDYEVLVQ-