Monarch geneset OGS2.0

DPOGS200551
TranscriptDPOGS200551-TA2058 bp
ProteinDPOGS200551-PA685 aa
Genomic positionDPSCF300119 + 66190-77564
RNAseq coverage231x (Rank: top 44%)
Annotation
HeliconiusHMEL0168713e-12175.65% 
BombyxBGIBMGA010765-TA0.076.98% 
DrosophilaCG34355-PE0.064.79% 
EBI UniRef50UniRef50_B4N8300.063.61%GK11110 n=8 Tax=Endopterygota RepID=B4N830_DROWI
NCBI RefSeqXP_973211.10.067.03%PREDICTED: similar to CG34355 CG34355-PA [Tribolium castaneum]
NCBI nr blastpgi|910899390.067.03%PREDICTED: similar to CG34355 CG34355-PA [Tribolium castaneum]
NCBI nr blastxgi|910899390.067.65%PREDICTED: similar to CG34355 CG34355-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[147-253] IPR0195453e-55DM13 domain
[283-446] IPR0050181e-25DOMON domain
Orthology groupMCL15953 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200551-TA
ATGGCGTCTATAATTGTTCCAGGAGCGCTAGCATTAAGGCGACCGGAGCCCTACTATGGACGGATCATCGGACGGCTCACTCAATATGCCCACGGGATCCGAGGCACAGTGTACGCCGTGGACGAGAGCACCATCTTCGTAAAAGGCTTCGCATATGACGGAACTGGCCCCGACGCCTATTTCTGGGTGGGGGACACGCCCCAGCCGTCACCAGAGGGTACTCTGGTACCCTATCCCGAAGACTATCCCAGTCGCGACCCCCCGGTACTGTCATCACATACTAATTCAGATATCTTGCTTCGTCTGCCCGCCGGGAAACGACTGAGGGACATCAAATGGATCAGCGTGTGGTGTCGAAGATTTACCGTGAACTTCGGGGACGTCTTCCTTCCCCCCGGCCTGGACCCTCCTCGTCCGCGAGTGCTCCCGGAGTTCAAGCGCCTGGCCCACGGCCTCCGGTCAGGGAACATCAGCGTGTTGGACGCCAAGACCTTCTACATCCCCAACTTGCACTATGACGGCGCCGGTCCTGACGCTTACTTCTGGGTCGGTAACGGCAGTGAACCCAATCCGTTTGGTACTAAAGTCCCTAACGAGATGGGATCCCTCAGCCCCCTCCGAGGATACCAGGGTGAGGATATAGAGCTGGTGCTGCCGGGCTCTCTGTCCGCCCACGACATCGACTGGCTCGCCGTGTGGTGTGTGGAGTACAGGCACAACTTCGGTCACGTCTACATCCCCAAGGATCTGGACGTGCCACCAGCTCTCGGACAGACCAAGATCACGACAAGTTCCACCACGTCCCAAGTACCATTTAACGCCGCTAATAATTGCAAGGAAATTCTGGACAAGCGGCTCCAGGTCCGATGGGAAGTTCAGGGTGATCACGTTCAAGTGACGCTAGCAGCTCGTCTACGTAAAGAACACTACATGGCCTTTGGGCTGTCAGGAGCTGAGGGAAGACCTGCCATGCTGGGAGCTGATGTAGTAGTCGCCTTCTGGGACACTAAGAACAACCAGCCGCGAGCCTTCGACTACACAATCAGTCACCTGGCGCAGTGTGACGGCGAGCGAGGCGTGTGCCCTGACGCCCGGCTCGGAGGCAGCGAGGACGTCGCTGTCGTATCGGGTCATCAGAAAGACGGCGTCACCACTATAACCTACCGCAGGCCGCTCGCCGCCACCGAACACACCAGGGACAAGGAGATCCTGACCTCGAGGTCTCAGTCCGTCATCGCCGCCTTCGGACCTCTCAACTCACGATACGAGGCCAACGCACATTCCTTCATGGACACCACCAGAAGTGACGTGCAACTTGACTTTGGAGCTCAGAACGACAACTCATGTGTGGGTCTAGCGACGCTGGACGCGGCAGGTCCCACTCCCTGGTCGCCGCGAGTTCTGGCCGGGGTCACTAACATGACAGCTCGTATAGGGCCGGCCGGGGGTCGGAGAGGGTACACGCCCATCACTGGTCATCCGTCGTGGGGCATCGCGTGGTACATTAACGACCAGCTGATACCCGAGATATATGTGGAGCGTGGCAAGACATACACCTTCCTGGTGGAGGGAGGAGACGACAGGACCAACCCTGCTAAGTTCCACCCGCTGTACATCAGCGACTCCTCGGAGGGGGGCTTCGGGCAGAAGAGACCCGAGGAGCAAAGGAAGCAGCGCGTGTTCGCGGGAGTCGCCTTCGACAACGAGGGCTACCCTTACCCCACGGCCGTGGGCCGGTACTGCGAGTGGACCCACAAGACGACGGACCAGTCCGCGGCGTCCGACACCTTCGAGCAGTACATGAGGACCTTACAGCTGGAGTGTAACGACGGAGAACCCGCCACACTCAACTGGACCGTGGCCCACGAGACGCCAGACCTCGTCTACTACCAGTGCTACACTCATAACAATCTAGGATGGAAGATCCACGTGGTAGACCCTGGGACCCCGATGCCAATACCGGGAGACAAGAGCGCTATCGTGAACGAAGGAAGCCAGCTGTCAGCTGTCATTGCAACTGTCATATCTACTTTCATTATGACAGCTCTCAGCAGATAA

Protein sequence:

>DPOGS200551-PA
MASIIVPGALALRRPEPYYGRIIGRLTQYAHGIRGTVYAVDESTIFVKGFAYDGTGPDAYFWVGDTPQPSPEGTLVPYPEDYPSRDPPVLSSHTNSDILLRLPAGKRLRDIKWISVWCRRFTVNFGDVFLPPGLDPPRPRVLPEFKRLAHGLRSGNISVLDAKTFYIPNLHYDGAGPDAYFWVGNGSEPNPFGTKVPNEMGSLSPLRGYQGEDIELVLPGSLSAHDIDWLAVWCVEYRHNFGHVYIPKDLDVPPALGQTKITTSSTTSQVPFNAANNCKEILDKRLQVRWEVQGDHVQVTLAARLRKEHYMAFGLSGAEGRPAMLGADVVVAFWDTKNNQPRAFDYTISHLAQCDGERGVCPDARLGGSEDVAVVSGHQKDGVTTITYRRPLAATEHTRDKEILTSRSQSVIAAFGPLNSRYEANAHSFMDTTRSDVQLDFGAQNDNSCVGLATLDAAGPTPWSPRVLAGVTNMTARIGPAGGRRGYTPITGHPSWGIAWYINDQLIPEIYVERGKTYTFLVEGGDDRTNPAKFHPLYISDSSEGGFGQKRPEEQRKQRVFAGVAFDNEGYPYPTAVGRYCEWTHKTTDQSAASDTFEQYMRTLQLECNDGEPATLNWTVAHETPDLVYYQCYTHNNLGWKIHVVDPGTPMPIPGDKSAIVNEGSQLSAVIATVISTFIMTALSR-