Monarch geneset OGS2.0

DPOGS213153
TranscriptDPOGS213153-TA3378 bp
ProteinDPOGS213153-PA1125 aa
Genomic positionDPSCF300016 + 1239188-1246786
RNAseq coverage147x (Rank: top 54%)
Annotation
HeliconiusHMEL0103210.088.71% 
BombyxBGIBMGA007911-TA0.077.70% 
DrosophilaCG42748-PG2e-8948.04% 
EBI UniRef50UniRef50_B4IIJ58e-13538.93%GM16161 n=9 Tax=Drosophila RepID=B4IIJ5_DROSE
NCBI RefSeqXP_002043555.12e-13538.93%GM16161 [Drosophila sechellia]
NCBI nr blastpgi|1953541333e-13438.93%GM16161 [Drosophila sechellia]
NCBI nr blastxgi|3287891447e-16642.74%PREDICTED: hypothetical protein LOC725508 [Apis mellifera]
Group
KEGG pathway 
Orthology groupMCL11549 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213153-TA
ATGGTTTCCATAGTGCTGTTGACGACAGAGGAGGTTTTACAAAGGAATGGCTTTTGGTACTGTCTTGATTCAATTTGGATCACGCAAGAATTGGAGCCCTTTCAAGAGGTGGAACCGGAACAGGTGTACGCTAAACCTATGCCGAGGGATATTTACTACGCCAGATACCCAGATATACCAAGAAGCGAATCCGAGGACGAGATTCCGTTAACCGTCAAAGGTTTTTGGAACAAACATGCAGAGTGGAAGTCTAGAATATGCATCAGAAACCTATGTTCCCAAAACAAAAACATGACACATTCAACTTTTAGTGCGAAATCTATTAATAACAATACCACTATTGACGTGAGCTGTAAAGCCGTGACTTCGAATGTTAAGAAAAAAAACCGTCCAGAAGCTTTGGATGATGATGAACGGATTCGTGATTTTTTTAACCGTCTGATAGAAAGTGTTCCGCCCCCGCCCGTACAGGATGTATCGGAATTTGCCAGCTCTATGTCTACTTCGCGATCAATTCAACACAATATAGAATGTATTGATCTACCCGACTACGCTGACTTTGAAAGCGATAGATTGCTTCAGCGTTTCGATGAGATCCCCGGTCCTTGTACTTCAACGTTAAAGAAAACTACGAACGGAACACATAACAGTTTCAGCTCTTCTGACTCTGATCAGGAATGGTACGAAACTATTTACGGAGTATCAGAATTGTTGGATCCTCAAGGATATCCGGGCAAGGGAAAACTTACATCGGATTCTGACTGTCTGATATCCTGGTCCGAGATACTCGAGATTTGCGGTCCAAGCAGCATTTACAGTTATTGTAACAGCACTGAAGAAACTGAAAATATAATAAGTCAGTCTCACCACATGATGAATAATTTATATGATATATCAGGTGATGACGACGAGGAAGCTTATGCCAAATTTTCTATTAATAAACTTCGACAAACAATAAACACGAAAGGAAAGATACAGGATTCGTCATCTATGGAAAATTTAAACTATGAGGAAAATGTCTTGCTAATGTCCATAGTTGAAATCAACCCCTCCTTGCATGAAACTTTCTACCGATTAGCTCCATCAGATAGTGACGAAGATTTGCCTCCGGAAAACGAATTGCAGAAGAATTCCGACCGTGTGCTCTCTGAGCACGAACTTCGGGTTCAAAGATCATTGCAGAAGCTCAATGTGCCAGAATGGTACAAAAACGCCCCAGCCCCACGCGAGGGCTTCCTTCTTCGGAAGCGGTTGTCCGACGCTTCATCAGCAGCTCGGTGGAGTGGTCTTAACTCGAAGACCACATCACTGGGCAGTCTCGGTGCTAACAACGCCCAGCCCCCGCCACCTCAGCTATCCCCTCACACAACCAGTTTCGGTAGATGGTCTACCAGTCGACTTAACTCGAATCAAACCTCCCCGTGTTCGTCTACCCGCAGCAGTGTCCGCGGCGCCAGTCCCCTGTGCTCGCCGTCTGCTCGTTCTTCTTTCAGCGCTCGTCAGCCCTACCTCGGCTGGAGGAGTCAGGAGAGACTCAACTCCACTCCACGAACGCCGCATGAGAGGCTAGCTTCTTCTCTTCTTCAGCAGTCCGCTTCGGCGAAAGCCGCAGAGGAGATACAGACGTCGATAAAGGAAGTGACGTCAGCGATCGTCCACTACGTATCAGGCTTAGAGCCGGCAAACGGTGACGTAGAGAGGCAACCCTCGCCGCGCTCCAGTCAGAAGTTGTACTGGCTCGAAAGCTCGTTTGTTGGTACAAAACCGCTGGAATCTCCTCAGACTCCGCTGGTGGTGTCGGAGTCCCTGCCCCCGGCCCACCCCCGGCCGCCCTCCTCCCTGCGCCTCGAGCACCGCGCGGCGCCCGACTGGCAGGATTCTCAGCGTTCTCTAAACCTGGGCGTAACTCGCCCGTCCCCGGGTTCAACGACCCTCGAAGACGTGCTGGATTCACTTCTGGGGCTTCCTTCTCAGCCCACCAGAGTTCCCACTCCTCAACCCAGCCCAAGGAAATCAGCGAGTCCATATTACTTGCTCAGCGGAAAGAGTCCTCGCTACGAGCAACTGGCTAGCAGCGGCCACAGCAGCATTTCGCCAGCTTCCCGTTTCCTCGGCACACCCAGTGAAGCCCCCAGCTCTCTCACCGACCCCAGCCCGGACCACGATCGACCAGGGAAGGATACAGTTGATACTCCTGCTATGCAGGAAACGAGGCGATCGCGGTCTCAGGGCGAGCACTCTCGCAGGAGGAGTGAGCCCTTTGCTCGTCCTAGTACATCCCTCGACCGCCGCACCAGTCTCGACGTTGCCGCCCTGCGCGAACAGAACCTCACCCATCACGACTCCAGAACCTCCCTCGCCTCACTCCAAACCGAAAACAGCACAGATGACACCTGCGTCAAGTGCAAGTATCCTAAATGTTCGTCCCGAGCCCCATTACCGGATGCCAAGAGGCATTACAAAACTTGCCACAACTGTACCACCATGTACTGTTCCAAAGAATGTAGACGAGCCCATTGGGAAAAGCACAGGAAGGTCTGTCTACACTCTCGTGCGAGCAGTTTGTGTAGACAAATTATATCGGCCGCGAAAGAGGATTCCGACTCTCTGCATCAAATCAGTACGATCGCTCATAAAGGATATCTGGCCCAAGGTAGAGGTGTTGTCAAAATCTTCTTCACGAGTCCAGAGGCGGCTGAGAAGTTCACCACGCACGGCTACCAGTACTTGAGCGAACCGGAATTCGTCAAATGGACTTCTCTGCAGCCAAATGAAATGGGTGCCGAACTATACACCGAGGTCGTAAAACTTTGCAAGGCTTACAATCCAGAAACGAGAGTAATTTTATACGTAGCGGTGTGTATCATTAGCGAAGTACCAACAAAAGGCGCCGTTAAATGGGAGAGACAGATGGTGTCCAGGTGTGCTAAACTTCGTTTAAGCAAAACGGTTTCATCGGCCATCCAAGAACAGAACAGAAAAAGAAATAGAAGAGACAGTAAGGGAACACCCGATGACCGCGAGACCTTGATACTGACTTCCAAATTAACGAACGCGGGGGAGAAGAACGCAGCGACAGCGCACAAGTTCAGGGAGATATGCTTCAGGAACATCCTGAACGAATTGGAGAGTCGCGGCGTGGTGATGAAGAAACATTTCCCCGAAGTGTACTCGCGGCTGGCGGCTTACGTGGACGGCACCAGCGACAGGTTCATACCTATGACGATATACCCGAAAGACGTCACCAGTGGACGGTCCTTCGTCTGCGTCATCATGCCGGACAACGACACGGAATGCGGAACTGCGATCGATAGCAAAGTGACTACAGTGGATGTTGGGGTGGATCGCTCCAAACACCAGCTCTCAACGCCGATGTAG

Protein sequence:

>DPOGS213153-PA
MVSIVLLTTEEVLQRNGFWYCLDSIWITQELEPFQEVEPEQVYAKPMPRDIYYARYPDIPRSESEDEIPLTVKGFWNKHAEWKSRICIRNLCSQNKNMTHSTFSAKSINNNTTIDVSCKAVTSNVKKKNRPEALDDDERIRDFFNRLIESVPPPPVQDVSEFASSMSTSRSIQHNIECIDLPDYADFESDRLLQRFDEIPGPCTSTLKKTTNGTHNSFSSSDSDQEWYETIYGVSELLDPQGYPGKGKLTSDSDCLISWSEILEICGPSSIYSYCNSTEETENIISQSHHMMNNLYDISGDDDEEAYAKFSINKLRQTINTKGKIQDSSSMENLNYEENVLLMSIVEINPSLHETFYRLAPSDSDEDLPPENELQKNSDRVLSEHELRVQRSLQKLNVPEWYKNAPAPREGFLLRKRLSDASSAARWSGLNSKTTSLGSLGANNAQPPPPQLSPHTTSFGRWSTSRLNSNQTSPCSSTRSSVRGASPLCSPSARSSFSARQPYLGWRSQERLNSTPRTPHERLASSLLQQSASAKAAEEIQTSIKEVTSAIVHYVSGLEPANGDVERQPSPRSSQKLYWLESSFVGTKPLESPQTPLVVSESLPPAHPRPPSSLRLEHRAAPDWQDSQRSLNLGVTRPSPGSTTLEDVLDSLLGLPSQPTRVPTPQPSPRKSASPYYLLSGKSPRYEQLASSGHSSISPASRFLGTPSEAPSSLTDPSPDHDRPGKDTVDTPAMQETRRSRSQGEHSRRRSEPFARPSTSLDRRTSLDVAALREQNLTHHDSRTSLASLQTENSTDDTCVKCKYPKCSSRAPLPDAKRHYKTCHNCTTMYCSKECRRAHWEKHRKVCLHSRASSLCRQIISAAKEDSDSLHQISTIAHKGYLAQGRGVVKIFFTSPEAAEKFTTHGYQYLSEPEFVKWTSLQPNEMGAELYTEVVKLCKAYNPETRVILYVAVCIISEVPTKGAVKWERQMVSRCAKLRLSKTVSSAIQEQNRKRNRRDSKGTPDDRETLILTSKLTNAGEKNAATAHKFREICFRNILNELESRGVVMKKHFPEVYSRLAAYVDGTSDRFIPMTIYPKDVTSGRSFVCVIMPDNDTECGTAIDSKVTTVDVGVDRSKHQLSTPM-