Monarch geneset OGS2.0

DPOGS208902
TranscriptDPOGS208902-TA1320 bp
ProteinDPOGS208902-PA439 aa
Genomic positionDPSCF300009 - 639541-641188
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0146780.086.46% 
BombyxBGIBMGA002476-TA0.078.78% 
DrosophilaCG6123-PA4e-6051.37% 
EBI UniRef50UniRef50_D6WB521e-9449.58%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D6WB52_TRICA
NCBI RefSeqXP_001605223.13e-8145.42%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|2700024914e-9449.58%hypothetical protein TcasGA2_TC004561 [Tribolium castaneum]
NCBI nr blastxgi|2700024912e-9749.58%hypothetical protein TcasGA2_TC004561 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL15656 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208902-TA
ATGATGCGGCGTGGTTCTATGGCGATCCCCGGCGGGGGCGAGCCACTTATAGTTGTTGAAGAAAGTGGGGCTGAAGAAGAGGCCTGCTATCGGCCTCCCGCGCCTTTCCGAGGCTTTTCTGAGGATCAAGAGTCGCCGCCAGATCCCTACCACTTGTCACCATGGCGAGACAACCGCAAGCATTCCCTTCCAACACCAGCTTGCACATCTGGACCCACTGCTAGCCAGGTACGTCGCTTATCAGAGCGAGGAGAAGGCGCAGCTCGTGAAGCTAGAGAAGCAGCCTTTTTGGCTACTTTGAGCCAGGCTCCACCACAGACGGGCGGTCGAAGGCATTCCGTAGTTACAATATCAAGAGTGCCTCAGGCACTCTTTGGACGAGGCCGACGAGAATCTATTGCGGCTTTCCCCGCCCTGGGCCATCGCCGGGACTCCGGAGCTGGAATTAAAAAATGTCCACCAGCCACTGATGCGCTGGGTAGTACTCATAATCTTCAATTGGACATAATGGATGATATCGTACAGGCACGCAAAGTGCGAATGCGTCTATGGAACACATCGAATGAAAAGGTTTGTGAAGTACAACCTTTAGATGAGCGATCTCCAATTGGCAGTTCGGTGAGATATACAAACCGAGGACGCAGACATTCAGATTTCGTTGGATCTCCACTACCACCAATTCCTTCCCGTCGTCGCGCTTCTGAAATGCCCCCTCCACCTCCGATACCTCCCCGTAGCGGTGCTGGTGTGGTTTGTACTGACACAGATCTCAAGCTGATGTTAAATGCTCTAACTTCCTCTGCTACGGAGATAGATAGATGTGGTAAACCTGATCGTAACAGACGTTTGGCGGACATGAGATCAAGCAGTTTCGATGCTTCTACGCTACGCGAAAAGCTTTCAGATTCAGGTACTGGATGGTTTGCAAGAAGGCATCAGACATTGGCTACCAAGAAGAAAGAAAACGAAGTTAAGAAACCTAAAGTGACGTTCGCCCCAGACTCCAAGTCGGCTCCAGGAGATGCTGCCGTTGTGTGGGATAAGCCAACGGGATCTGTTGTGGATGCAAGTGCTCTTGGCAGCGCCATAGAGGTTTTTCTAAGGAGTGGAAACACCGTCAATCCAGCTCCTTCGTCATCAGGAATATCAATACCTGTAGAAATAACAAAACCTGAAAATGAAGCTCGACCTACACCCAGTACGAGTCGAACTTCAGGACGCGAAAATGAGCGTTGGTTCTCAAACAGACCGGAGGAGGAGGAAACCGGAGAAGGTTGCGATGCGTCCCTTTGCACCTCTCTGAAGGACCTCTTCGTGTAG

Protein sequence:

>DPOGS208902-PA
MMRRGSMAIPGGGEPLIVVEESGAEEEACYRPPAPFRGFSEDQESPPDPYHLSPWRDNRKHSLPTPACTSGPTASQVRRLSERGEGAAREAREAAFLATLSQAPPQTGGRRHSVVTISRVPQALFGRGRRESIAAFPALGHRRDSGAGIKKCPPATDALGSTHNLQLDIMDDIVQARKVRMRLWNTSNEKVCEVQPLDERSPIGSSVRYTNRGRRHSDFVGSPLPPIPSRRRASEMPPPPPIPPRSGAGVVCTDTDLKLMLNALTSSATEIDRCGKPDRNRRLADMRSSSFDASTLREKLSDSGTGWFARRHQTLATKKKENEVKKPKVTFAPDSKSAPGDAAVVWDKPTGSVVDASALGSAIEVFLRSGNTVNPAPSSSGISIPVEITKPENEARPTPSTSRTSGRENERWFSNRPEEEETGEGCDASLCTSLKDLFV-