Monarch geneset OGS2.0

DPOGS201181
TranscriptDPOGS201181-TA2022 bp
ProteinDPOGS201181-PA673 aa
Genomic positionDPSCF300262 - 52536-56782
RNAseq coverage3560x (Rank: top 3%)
Annotation
HeliconiusHMEL0225790.069.64% 
BombyxBGIBMGA014270-TA0.067.58% 
Drosophilal(1)G0193-PA9e-13647.02% 
EBI UniRef50UniRef50_D6WVE49e-16844.53%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WVE4_TRICA
NCBI RefSeqXP_001355344.22e-15141.19%GA15301 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|2700121063e-16744.53%hypothetical protein TcasGA2_TC006209 [Tribolium castaneum]
NCBI nr blastxgi|2700121061e-16444.44%hypothetical protein TcasGA2_TC006209 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL14584 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201181-TA
ATGAGACTCGCAGCGCTTCTAGCGCTGGTGGCGGCGGTGAGCGCGACCGAGCTCACCGCTGTAGACTATATGAGAACCAAGTTCTACAGGCTCGAGGAGGAGCTGTGGAAGAACGTTACCGACCCCGAGTGGAGTGTGGGGGGTGCGGGGGGAGACGTCGAGCTGACCAAGGCCTTCGCCGCCTTCGACGTACAACTGCAAGCGATCGGACGAGCCCGTCGACCGCCGCTGGAGTCGTGGCTGTGGGTGAAGGCGGTGGAGAGGTTCCAGATCATAGATGGATACTACAAGGAATTCGTGTCGTTCGTCCGTCGCCAGTCTGCGGCGGGCGCGGTGCCAGCCCCGGTCAGGGAATGGCTGGACCTGGCCGAGGGCATCCTCATGGACCCCAAGTCGTCCGTGGCGCAGGCCGTTAGAAAGATACACGACCTGCTGGAGCAGGGGAACATGTTCCGCGGGACCTTTCAGGAGGAGCATCCCGACCTGTGCGAGCTGCAGCTGTCCTCTCACCAGCTCATATACGACATGTACAACACCATCTCACTCACAGAGATCAAAGGATACGCCATGATGCAGTTCTCATGGATGCTGCTAAGGATATATGGAAAAGGGAATTTCACTCAGGAGGCGAGTCTAACACGGCAGAGATACAGCGAGAGGACGTCTCGGACGGCGGCGGCGGCCAGGTCTGCTCTGGCGATGGCCAAGAGAGATCTCTACAGGTGTGACCCGCAACAACATAAGGAAGGCGAAACGTACGCGGAAGTGACTCGTCTGTTACAAGGCTACATAGAAAACGAAGTCAACATGAACTCCGACCAGACGTGCAAAGAAAACTGTGCCTTCTACACACTCGCCGAGCAACACTCGTGTTACGACGAGAAGATGTTCTGCTCGCAGCAGCCCAAGTGCAAGGGCAGGATCATCAGCTGCCAGTACATCGACTCCGACATGTGGATATGCAAGGCGAGTAAGAACAGTATTCGTCGATACGAGTGGATCGAATATGAGAACGGTCGCACCCTGGGGAAAGTCGGGAGCTGCATGAGGGGGACCACTAAGGTGGATTCTTGGTGGCGTTGGTTATTCTGGCACTGTTCTTACTGTATGTGTATCTGCGACGAGGCCGGTCCCAACTCCCATCGTTATTTTAGTCTGTGGGACACCACCTCCGACGTCGACTCCTGGTGGACTTGGAAACTCCTACACTGCTCCTACTGTTTGTGTCTTTGCGAAGATCATATCAGCATTTCAGAGCGCACTTTTAGTTTACGAGAAACATTGGCTGACATGACCGCCAACAAGGTCATCACCGGGCTGAGGCTAGTGAAGTATGGAAGGGTTTTCCACCTTCAGATAAGCGAAGGGACTCTCGGAGAGAGAGGTTCTATCACTCCTAGTGGCTGGGTGCCCATACAAAAGTTCGACATTACTGACGCCGGCGTCAAGGAGGGCGTCGACTACCACACCCTCACTTATGAGAGGAGAGCGATTGATTTGGACGAACTCGATTCACCATCAGGTCACGTTCTGACAGGAGTCAGATTCCGTATGATCGGTGCCCATCTTCACTTTGAGATACGTTCGACTCCTTTCAACTATACAACAGGTCGCCTGTCACCGGACAGGAGCCAGTGGATCAGTAACGATAACACTGATGGAGCCGACACCAAACCCAGGGTCCGCCTGGATCTATACAAGCCGGACCTGCCAACCCGCAGTTTATCACCGCTGCCGGTGGACTCTCAACACGATCAGTACATAGAGTTCACTCATAGCGACTTCGACGCGGACGCGGCGCAGAGTACGGTGCCTTTTATCGACATCCAACCGCTCGAACCTTCCAAGGGCTCGGCCTTGTTGAGCGGCGCGGGCATCATCCACCGCGGGGCCCGCGGCTCCGGCGGGTTCATAGCCACCAAACTGTTCACGTACGACTACTCGAGACACGTCAAGGCCGAGCTGCCTCCTAACAGCTTCGAGGACACCGAGGCGGACCTGCCGCCGCTCAATACCTTCTAG

Protein sequence:

>DPOGS201181-PA
MRLAALLALVAAVSATELTAVDYMRTKFYRLEEELWKNVTDPEWSVGGAGGDVELTKAFAAFDVQLQAIGRARRPPLESWLWVKAVERFQIIDGYYKEFVSFVRRQSAAGAVPAPVREWLDLAEGILMDPKSSVAQAVRKIHDLLEQGNMFRGTFQEEHPDLCELQLSSHQLIYDMYNTISLTEIKGYAMMQFSWMLLRIYGKGNFTQEASLTRQRYSERTSRTAAAARSALAMAKRDLYRCDPQQHKEGETYAEVTRLLQGYIENEVNMNSDQTCKENCAFYTLAEQHSCYDEKMFCSQQPKCKGRIISCQYIDSDMWICKASKNSIRRYEWIEYENGRTLGKVGSCMRGTTKVDSWWRWLFWHCSYCMCICDEAGPNSHRYFSLWDTTSDVDSWWTWKLLHCSYCLCLCEDHISISERTFSLRETLADMTANKVITGLRLVKYGRVFHLQISEGTLGERGSITPSGWVPIQKFDITDAGVKEGVDYHTLTYERRAIDLDELDSPSGHVLTGVRFRMIGAHLHFEIRSTPFNYTTGRLSPDRSQWISNDNTDGADTKPRVRLDLYKPDLPTRSLSPLPVDSQHDQYIEFTHSDFDADAAQSTVPFIDIQPLEPSKGSALLSGAGIIHRGARGSGGFIATKLFTYDYSRHVKAELPPNSFEDTEADLPPLNTF-