Monarch geneset OGS2.0

DPOGS213778
TranscriptDPOGS213778-TA3378 bp
ProteinDPOGS213778-PA1125 aa
Genomic positionDPSCF300212 + 462780-466386
RNAseq coverage1967x (Rank: top 6%)
Annotation
HeliconiusHMEL0104020.074.57% 
BombyxBGIBMGA009263-TA0.068.88% 
DrosophilaCG13003-PB3e-5342.50% 
EBI UniRef50UniRef50_UPI00021A6D061e-10331.75%UPI00021A6D06 related cluster n=2 Tax=unknown RepID=UPI00021A6D06
NCBI RefSeqXP_001603237.12e-5940.13%PREDICTED: similar to RE03018p, partial [Nasonia vitripennis]
NCBI nr blastpgi|3407150105e-10331.75%PREDICTED: hypothetical protein LOC100646063 [Bombus terrestris]
NCBI nr blastxgi|3504172032e-12532.00%PREDICTED: hypothetical protein LOC100740269 [Bombus impatiens]
Group
KEGG pathway 
Orthology groupMCL18348 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213778-TA
ATGACAGTCGATTTTGTCGCAGACCTCGCGGCCTTCCTGCTTCGTGGTCATTACTTCTTAATTTCCAAATTAAACGGTTGGTTCTCAATGCCACCTAGATCGAAGATGGTGAGAGGGGCGACGTCAGTGCTGACCATAGCGCTGGCATGGGCGTGCGTGGTTCGTTCTAGCGTCCTTGTGCCCACCTACGTGTTGCCGCCTGACTCGTTCGTTCGTGACCAACGATCTCTCGACACACGATCTGACAGCGACACCTCCGAATCACAGCGTATGATAAAACGGCTGATCGGAGGGCATATGAAAATCAAAGCTGAAAAAGATCATGGCAACCCTAAAAACTCGATTAAACTGGACTCAAAGAGTGAAAGTTCTAAATTCGCTGGAGTAATCTTGCCGGACCTTTCAGGCGGCACCGTGGGTTGGAACCAAGGTATCTGGGATGACGGCATGCTCATGTCCAGCGCGGGAGCGGGGAAGGATACGGGAAAAATGAAAGAGGATGATGAGGACAAAAAAACCTTATCAGACCAAGTGGCTGAAGGAAAATACGGATTAATACAAAATGAAATATTCTCCAAGGCACCGAAACGTCCTGGTATATTAAGCTATGAGCCCAATTCCGAAACAAGAACTAAAGATAATGTGCAAAGTCTTGGCGGCCTTAAAAAGGAAGAGATATGGCTCGCTGAGGACCACCTCCTGGTTTTAAAAGGAGGCTCATTCCCAGAACGTTCCCAGGAAGCTGAAAATAACCCGTGGCCACCAATAGACAACTATGAGGCGCCCAGGAGGCAAGTAAAGCTACCACAGAAACCAAAAGTACCACCGCCCTTTCCGGTTCGACTCTCCGACGACGGACCGCTCGTCTTTCTCACACCTAACGGCAGTATCCCTGCGCCCATTTACCCACCCTTCCCTACAGGAGAAGGCGAAGGCCCTCTACCTCCTTCGTCATTCCTTATTCCAGACGATGCGCCCTACCCGGAAAGTGATTATCAAAAAAACACTTCTGCCGGGCCCTCTTCGTTGCCAAGTCCTCCGTTCCCATTCCTATCCGGGAATGCGAGCGAAGGAGCATTTCCCTACCCTCCATCCATAAACGGCTCTTTTCCAGAAGGCTTTCCTCCGGGCGCAGCATTTCTACCACCACCCAGCAACCAAACCGATCTATACGACGAAGACGACCCTTCGATATATTATCCGCCGCCATACAATTTTTCCTATCACGCTGATTACAAAAGCAATGTACCAGCTGGACCTCTGGTACCTGGGATAATTTTACCACCTCCGCCAGATTTCTTTGCGCCTCTTGAGGAAAAAACAACAACGGAGACGGACAAACCTTCAAGGCCATCTCCAACCCCGACGTACTCAAGAGCAAAGACTACTACTAAACAAATAGTAAATAGAGGCAAATATAAAACAAGACCAACAACAACTGAAAGTGCACAAACGAGTGAGCTTCCTACAACGACGTCAATGCCAACAGTTGAAGTTGTAACTTCCCCGAAATACAGAAAAGTCCAACGCTTGCCACTACCTCCAAGGCAATCTTTCAGATCGCAAAATCTTCCAAAGGAACCAGTCACTAAACCCAAGACTGATAAGCCAGTTTACAAAACACGAACAAAATTGACATCAAAACCCCTTTCTGTGTCAGTAATTTATGATTACCCACAACAGGCTTACGATAATAATCCTCCCACAACTACGGAGAAACCTTACATCTATTACGAAGTGCCACAAAAAACAAAAGAAGTCGCCAATGACATAACGTCAACAGCCGTACCATTACGAGCATATTATTCAAACCATCAAAACGACGAGGTCCCAACGACAAAGTTGCAACCAGTGTACAATCGGAAACCGAATGAAGATGCTATCGCTTCGTTTTACTTCTTTGACGAACAACCTAAAAGTTCACCCAGACCTGATAACTTTTATGATGGAAGAAATTACTACAAAACTGTACCCAGTCAAACTCCATACAATCCTCAACAAAATAGCAACTCTCAAACCGGATACAGACCCACAGTCGATGTTGAATACGGATCTATCGACCAGGAAGCCCTATTTTTATGGCCGCAGAAACAAGGACCGAAAACATTAACACAAGAATACTTCAGCATACAAAAACCAAGACAACAGGTTTATGTTCAACAGATTAAACAGAGACCTGATCCATTCTATCAACAAATAGCAGATATACAACAAACTATCGAGTTATATACAACTAAGAGACCGAAATCGCATAGAACACACACCAGCAAACCTCAACACACGAACCCCAGGCCGGTGTATCAATTTAGTTTTGAAACAAACCCTCGACCTGAGAAACTGACCTTCAGAGCACCTAAACTTGACCCCGAACCATTCAGACCGATGGTTAGTTACAGTAAACCATTCAATTTACAAAACGAGTTCAATGCCATCACACCGTCTGCCTCTCCTGTTTACCATCAGCAATATCTAGTAGAAAATGTTCAAGTTACAACTGAATCTCCCACTTCATCGAGATATTATCCTAAAACCAAGACAAGGGACGATGATTACGAAGATACTGTATATCAAAAGGACACAGTTCCACAGAACAACATAAACCAGATCCCAGTAAGAACGGGAAAACCTACGATATCAATAAATAAACATCCGAGCACAACACCAAATCCAATTAGTAATGGCTATTACACTAAACAGGATGAGAAGTATTTTGACGACATCACAAAAAATTCATTTGATGTCTTTGGTCAAAAACTGGAAGACACGCAAGACGTGAATGGAGTAGCCGTCACAGAACCGATTGGCACCGTGAAGACGCCGATCGATAACAACATAAACCAACAATACTACGAAGTTATTAACGCGAATCCAAACGCACCGACTCTGAGCAAGGACACTATCGTCAACGACCGATTTCCGCGACCGACGGTCAATCCTTACAGTGTACCGATTGACCATCGACCACACAACGAAGAACTGATCGAGCAACCGAAACCGATTTCTCTATACGGTGACACGTTGGTCAACGAGAAACTTCCACGACCCATGATAAATCCAGACAGCGAGTTCATACCAATACCGGATCCTAATTACAGGAAACCCCAGCAGTATAGACAACAGAGCCAGGTTCAGAGACCTCAGTATACTGGTGAACAGTATGACCTGAACGGTCCTTCGTTAGCTGGAGATACCGCCGTGAATTACAAGCGGCCCCTACCACCAGTCAATCCGGACTCGGAGTGGATAGGACCAGTGAATTCTGGCGAAGGTCGCCCCGGATCATACGTATCGTATCGTCTGCCAGGCGACGGCGCCCACGTTTACTTCCTAACACCCCAGACGGCACAAAGATACAGAAAACCGGGTTACGGTCGTTGA

Protein sequence:

>DPOGS213778-PA
MTVDFVADLAAFLLRGHYFLISKLNGWFSMPPRSKMVRGATSVLTIALAWACVVRSSVLVPTYVLPPDSFVRDQRSLDTRSDSDTSESQRMIKRLIGGHMKIKAEKDHGNPKNSIKLDSKSESSKFAGVILPDLSGGTVGWNQGIWDDGMLMSSAGAGKDTGKMKEDDEDKKTLSDQVAEGKYGLIQNEIFSKAPKRPGILSYEPNSETRTKDNVQSLGGLKKEEIWLAEDHLLVLKGGSFPERSQEAENNPWPPIDNYEAPRRQVKLPQKPKVPPPFPVRLSDDGPLVFLTPNGSIPAPIYPPFPTGEGEGPLPPSSFLIPDDAPYPESDYQKNTSAGPSSLPSPPFPFLSGNASEGAFPYPPSINGSFPEGFPPGAAFLPPPSNQTDLYDEDDPSIYYPPPYNFSYHADYKSNVPAGPLVPGIILPPPPDFFAPLEEKTTTETDKPSRPSPTPTYSRAKTTTKQIVNRGKYKTRPTTTESAQTSELPTTTSMPTVEVVTSPKYRKVQRLPLPPRQSFRSQNLPKEPVTKPKTDKPVYKTRTKLTSKPLSVSVIYDYPQQAYDNNPPTTTEKPYIYYEVPQKTKEVANDITSTAVPLRAYYSNHQNDEVPTTKLQPVYNRKPNEDAIASFYFFDEQPKSSPRPDNFYDGRNYYKTVPSQTPYNPQQNSNSQTGYRPTVDVEYGSIDQEALFLWPQKQGPKTLTQEYFSIQKPRQQVYVQQIKQRPDPFYQQIADIQQTIELYTTKRPKSHRTHTSKPQHTNPRPVYQFSFETNPRPEKLTFRAPKLDPEPFRPMVSYSKPFNLQNEFNAITPSASPVYHQQYLVENVQVTTESPTSSRYYPKTKTRDDDYEDTVYQKDTVPQNNINQIPVRTGKPTISINKHPSTTPNPISNGYYTKQDEKYFDDITKNSFDVFGQKLEDTQDVNGVAVTEPIGTVKTPIDNNINQQYYEVINANPNAPTLSKDTIVNDRFPRPTVNPYSVPIDHRPHNEELIEQPKPISLYGDTLVNEKLPRPMINPDSEFIPIPDPNYRKPQQYRQQSQVQRPQYTGEQYDLNGPSLAGDTAVNYKRPLPPVNPDSEWIGPVNSGEGRPGSYVSYRLPGDGAHVYFLTPQTAQRYRKPGYGR-