Monarch geneset OGS2.0

DPOGS215458
TranscriptDPOGS215458-TA1794 bp
ProteinDPOGS215458-PA597 aa
Genomic positionDPSCF300098 - 566527-568320
RNAseq coverage91x (Rank: top 63%)
Annotation
HeliconiusHMEL0083570.068.22% 
BombyxBGIBMGA007488-TA4e-13573.29% 
DrosophilaCG10428-PA2e-11141.51% 
EBI UniRef50UniRef50_UPI00021A88B32e-12039.97%UPI00021A88B3 related cluster n=1 Tax=unknown RepID=UPI00021A88B3
NCBI RefSeqXP_001601512.18e-11840.68%PREDICTED: similar to GA10313-PA [Nasonia vitripennis]
NCBI nr blastpgi|3838520872e-12041.58%PREDICTED: glutathione S-transferase C-terminal domain-containing protein homolog [Megachile rotundata]
NCBI nr blastxgi|3304178733e-11440.68%glutathione S-transferase C-terminal domain-containing protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL13727 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215458-TA
ATGAAAAATGCAATTTATTTAGAAACCTCTACGTCTCCAGAGGATTATTATCCAAGTAATAAAATAAAAATATCTTTAGAGTCCTTAATAACTTATTCGGTATATAAATTCTGTGTATCTGATATTCAATTGTATTTTGTACATAATAAAAACACACCAGAGTCAGGCTTGGTTGAAATTATTTGTGAAAATATAGATTTTGTAGATTTGAAAATTGTACCATGGCAAGTTAAATCGTGTATATACCCTGTTGCTTTTTATGATGATACTATTATAACTGGTTTGTGTGCGGTGTGTAGACATATATGTAAGTTCAGATTGAGTCCTCGTACCTATCATGAATATGAGGAAGGTCTACTCGCCTTTCGTAGAGGTTGCCTACAAGCTCCCAATGAAGTATCTATATGGACAAAATTTTGTGAAGTAGAACTTATCAACACAGTCAAAGAGTTATTTAACATAACTAGCTTAAGGGAAGTACCAAAAAGTTTAGTGAGGTTTGAAAATCATTTGAATAAACCTGTGAGAATTCACAACATATTTAAAGTAGCTAATGATTTGAGCAAGGATAATTCTAAAAGTATAGAACATGTTGACAAATTCCAAAAAATGAAATCACAAAAAGATTCTAGAATTTCAAAGCAGCGGAAGTGGAAATCAAATGTAAAGGAAGATCAAACAAAGGTTAAGTGCCTCGGGAGAGAAGATCTTGATATACATCACCAATACGCTGAGGGACCATTTTTTACTTTAGCTGATTTGGTATTACTACCTTCATACTATATTGTTATAAAGTTCTTTGGAGAAAAACTTTTCCAATCCTTATTACCACTTTCACATAAATGGCTACTAAATGTGAAAAATGTAAAAGAAGTTGATGAATTGTTTAAATTCTTAAGCACAATTCAAATTCAGCCGCTTGAAATCAGCCAAATTGATTTTCCAGATGTAGAGGATGTTAGTCTATATAAGTCAGATCCTAAAAGACACAACCCTAAAAAAAGATTGTTTACCAAACCTGACGACATAGAGAAAGCATTAAATGTTTTAGTAGAGGGTATGGAACTACTTATGAGTAATTTAGAATTTGAAAAAACATTTGACTGGGACGACATACCAGATGGTGCAAATCCTGAGGCCGGTCATTTGCCTGATGAACGAGTGGAACGGAAATCACAGCAACTACAGAATCTAACCCTTGCTGTGCTAGCTATGGCAAAAGATGGGGATCATATTGTTGACTTTTGCAGTGGTAGTGGGCATTTGGGTATACTGATTGCACATCTCTTGCCAAAATGCACAATAATTTTACTTGAAAATAAAGAGCAGTCTTTATTAAGAGCTAGAGAAAGAGTCCACAAAATGGGACTAAAGAATGTTTACTTTTTCCAATGCAATTTGGATTTCTTTATCGGAAAATTTGATATTGGCATTGCATTACATGCATGCGGAATAGCCAGTGATTTGGTTTTGGACAAATGTTTGAGTTCCAAAGCTAAGTTTGTGTTATGTCCCTGTTGCTATGGTTCCATACACGCTACTGATAGGCTGGTGTATCCGAGAAGTGCTGCATTTAAAAAAATGAGTATTGAACAATATTTATGCATCGGTCACACCGCTGATCAAACGCACAAGGAACATCCGCTCACAGTAAGAGGAGCGAGATGTATGGCAGTTATTGATTCTGATCGAGCCAGGTTAGCCGAGGAACACGGCTATACAGTAACACTATCAAGACTAAAACCGCTGTCGTGTACTCCTAAGAATAATCTACTTATTGGTGTACCTTTATAA

Protein sequence:

>DPOGS215458-PA
MKNAIYLETSTSPEDYYPSNKIKISLESLITYSVYKFCVSDIQLYFVHNKNTPESGLVEIICENIDFVDLKIVPWQVKSCIYPVAFYDDTIITGLCAVCRHICKFRLSPRTYHEYEEGLLAFRRGCLQAPNEVSIWTKFCEVELINTVKELFNITSLREVPKSLVRFENHLNKPVRIHNIFKVANDLSKDNSKSIEHVDKFQKMKSQKDSRISKQRKWKSNVKEDQTKVKCLGREDLDIHHQYAEGPFFTLADLVLLPSYYIVIKFFGEKLFQSLLPLSHKWLLNVKNVKEVDELFKFLSTIQIQPLEISQIDFPDVEDVSLYKSDPKRHNPKKRLFTKPDDIEKALNVLVEGMELLMSNLEFEKTFDWDDIPDGANPEAGHLPDERVERKSQQLQNLTLAVLAMAKDGDHIVDFCSGSGHLGILIAHLLPKCTIILLENKEQSLLRARERVHKMGLKNVYFFQCNLDFFIGKFDIGIALHACGIASDLVLDKCLSSKAKFVLCPCCYGSIHATDRLVYPRSAAFKKMSIEQYLCIGHTADQTHKEHPLTVRGARCMAVIDSDRARLAEEHGYTVTLSRLKPLSCTPKNNLLIGVPL-