Monarch geneset OGS2.0

DPOGS209249
TranscriptDPOGS209249-TA1725 bp
ProteinDPOGS209249-PA574 aa
Genomic positionDPSCF300111 - 386864-392314
RNAseq coverage186x (Rank: top 49%)
Annotation
HeliconiusHMEL0167352e-7967.08% 
BombyxBGIBMGA007065-TA5e-8362.46% 
Drosophilaect-PA2e-3133.95% 
EBI UniRef50UniRef50_F4W7162e-5848.65%Replicase polyprotein 1a n=3 Tax=Acromyrmex echinatior RepID=F4W716_ACREC
NCBI RefSeqXP_001947169.12e-5542.95%PREDICTED: similar to ectodermal CG6611-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|3838558586e-5848.83%PREDICTED: uncharacterized protein LOC100877435 [Megachile rotundata]
NCBI nr blastxgi|3454933053e-8735.29%PREDICTED: hypothetical protein LOC100121893 [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL16964 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209249-TA
ATGAGGTTGTCGTTGTTGTTAGCGGCGGTATGCTGCTTAGTGTTGTCGGCTTATGCGAGCCCTATCACACGGCCTGATGATGAGCAAAAGGAGGTAGGCAGACAATCCGCTGCACCACCTGCTGCACAACCAGCGGCAACAGACGATGATGATGATGACGATGATGATGCGATGCGATCTCGACGACCCATTATCAGATTCTTCGATGATATCTTAGGAGGAGGAGAAGACGATGACGATGATGATGAATCTACAGGAGTACAATCTGTAGTCGCTAGTGCTGATACCTCCCCTGCCGCACCAGCTCCAGCCGCTCCTGTTGAAGAAGCTGCAGCTGTTTCTGAAGAAGCCGAAGTTTCAGAAGCTGGAGAAGCTGCTGTCGAGGGTGATGAACAAGAAGGTGCACAGAAAGAGCCAGCTAGTGCTGAGGTAGAAGCTCCCGTTGCTGCTAGCTCGGAAACTGTTGCTAACGCTGATGCCGATGATGATGAAGAGGAGGAGGAAGATATCGCTGATGCTTTTGATGGTGAAGAGGATGATGATGATGATGAAGAGGATGATGAAGATGAGGATGAAGATGATGATATCTCCGAAGTTCTCAGCAGGAAGAGTTCGAAACAAGCACGGGTCGACAGCGCGTACAAGGCCTGTCGGCCGCTTACTTTCTCATCCGAGACTGGTTCATTGATCCCACCCGATTTCTTTTACAACCTTGACACTTTCTTTGACCTTGTCCTGTACTACTCTGAAAATGACCTTGACGGACAATTTAACCACCGTAAAACGATAGAAAGTGCTTTAGAAGAAGAACCCCCAAAGAAGCCTGCACCAGTTCCAGCAATTTATATTGTGAAGTATAATAGGTTTGTGGATAGTATTCTTGAAAAAATGAATGATATCTTGAAGCGTTCTTATGACCCTGTCAACGTGAAACTCCAACCCATTGATGCTAATAAGAAAACCACAAAACCGAAGAAGAATAAGACAAAGACAAATAAAAGAACATCAAATAAAAAGAAGAACTCTGGTCGTGGTGGAATTACTAATAAAATGGCTGAAAATGTGACAAACCAAATAGAACAGAAAAATGAAATTGAAAACGCTGTTACTAAAGACGAAATCATTGCAAATGAAAATATAGAACAAATGGCCATCGAGTCAAGAGCTTCGAAAAATCCTAACAGTAAAGCAAAAGCAACCACTGCAAAACCAAAACCTAATAAGACTACGAAGCCAAAGGCGAAACCTAAACCACCACCAAAAACAACGACAAAGAAGCCAGCCATTAAAAACAAAGTGAAAACCAGCGAAAAGAGCAAACCCAGAGCCAAGGGGACACTCTACGGACTCTCGACTTTAAGAAGAAGTGGTGATGTTGCAGTTAGCATCATGTCTAACCACACAACTATTAAAAGTAATTTCGCCGTAGGACCGCTTATATTGAGAGTTGAAAAAGAGGTCGGACGGAAGAAGGAAATCAAATCGGCTACAGCAACAACAGCCGAAATGTTGGGAAAACTAACATTGAGAGTTAACAATCAAGGCGTTGCAACTTTACATTCCATCAAAGTCCTGCAACCTAAACAGGTTCGAGTGGAAAGCAATCATGAGAGGACGAGGGAATTAGTTTGGCAGCGTAGCGCCAGAATCGCGCATGTAGTGTCTGAAAAACTGAGATCGGCTTCCAAGCCAATGTTTCTTCACCAAAGCGTCGTGAGGCAGTGA

Protein sequence:

>DPOGS209249-PA
MRLSLLLAAVCCLVLSAYASPITRPDDEQKEVGRQSAAPPAAQPAATDDDDDDDDDAMRSRRPIIRFFDDILGGGEDDDDDDESTGVQSVVASADTSPAAPAPAAPVEEAAAVSEEAEVSEAGEAAVEGDEQEGAQKEPASAEVEAPVAASSETVANADADDDEEEEEDIADAFDGEEDDDDDEEDDEDEDEDDDISEVLSRKSSKQARVDSAYKACRPLTFSSETGSLIPPDFFYNLDTFFDLVLYYSENDLDGQFNHRKTIESALEEEPPKKPAPVPAIYIVKYNRFVDSILEKMNDILKRSYDPVNVKLQPIDANKKTTKPKKNKTKTNKRTSNKKKNSGRGGITNKMAENVTNQIEQKNEIENAVTKDEIIANENIEQMAIESRASKNPNSKAKATTAKPKPNKTTKPKAKPKPPPKTTTKKPAIKNKVKTSEKSKPRAKGTLYGLSTLRRSGDVAVSIMSNHTTIKSNFAVGPLILRVEKEVGRKKEIKSATATTAEMLGKLTLRVNNQGVATLHSIKVLQPKQVRVESNHERTRELVWQRSARIAHVVSEKLRSASKPMFLHQSVVRQ-