Monarch geneset OGS2.0

DPOGS213419
TranscriptDPOGS213419-TA1521 bp
ProteinDPOGS213419-PA506 aa
Genomic positionDPSCF300271 - 161178-169553
RNAseq coverage311x (Rank: top 36%)
Annotation
HeliconiusHMEL0168080.081.68% 
BombyxBGIBMGA004393-TA2e-16966.47% 
DrosophilaCG17341-PB1e-6854.62% 
EBI UniRef50UniRef50_D6WS563e-9947.84%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WS56_TRICA
NCBI RefSeqXP_975694.22e-10049.30%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892395894e-9949.30%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|2700094891e-10650.00%hypothetical protein TcasGA2_TC008753 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL16441 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213419-TA
ATGTGCATCACTATTCGCGAGGGAGGCAGTGGTGTCCAGGGGGTGTCCAGGGGGTGTCCGGGGGGCGCGGACAACCTTACCCCTGAACCAGCCGACATGACTGATGACGACAACTTTCCTGGTCAGGGCTACGCGAAAGACGCTGAGAAGTTGAAAATGATGTTGCTGGCGTGGAATTATCACAGCCAGACGCAAGGCCGCAATGGTGATATAACAGAGAGCGGCCCCGACAGTATGGCGGATCTGTGGGCCTCTTACCACAGCGCTCTGGGGCTCGGCAGCAAACCTCCTAAGCCTCCCACTCCGCTAGCTACAGGACACTCGAGTCCCATCCACGTCGGTCTGGGTACGCCGGTCGAGCCGGCATCGACTCACGATGAAACCTCGTCATCTGATAGGACCAAGGACGATGATGATGGAGGCGCCTCCGATGATGACAGCGATGACCGCGTCGATCCCCAACACCATGATCCTGAACGACTCAAGGCTTTTAATATGTTCGTTCGTCTTTTCGTCGATGAAAACCTGGACAGAATAGTTCCGATCTCCAAACAGCCGAAGGAGAAGATACAGGCGATCATAGATTCGTGCACGCGACAGTTCCCGGAGTTCGCTGAGCGGGCGAGGAAGAGGATCAGGACGTACCTGAAGAGCTGCAGGCGGAATAAGAAGGTCCGCGGCGAGGGACCTGCGACCGGCAACGGGGGAAACACGCCCTCAGGGAACGCCTGGGACACAGCGGTGCGTCCGACCCCGGCCCACTTGACTTCGGTCCAGGCCGAGCACATTCTGGCGCAGGCGTGCGAGAACGAGAGCCTCAACGCCAAACGCATGCGACTAGGACTTGACCCTGTAAGCCAGCCCATGCCCACTGTGCCTCAACCCATGGCGATCGACACCACAGCTACGTCGTCATTCTTGAGCCTGTACAGCGGTTCCAGCTCCACCACCGTCACCAGCTCCACACACAAGGACACGCGCCCGGCGACCAGCCCCGCGCTGGAGAACACCAACAACAACAGCCTCAAGATCGACTCCAACGGCAACACCGGCAGCAAAACTGCGAGCCCAGCGTCATCACCAGCTCCGCTGTTCAGACCCTCGTTCCCGAACACGTTCGGAAATCCATCGACTTTCGCGAGGCAGAGTTCATCAACTAACCCGACATCAGCTCTGTCGAGCACCGCTCTGCTGTCTACACTTACTTCTCTCCCGCCAACAGGTGTTGGAGTGAACGCAGCCATGGCGGCCTATCAGAGCGCTCTCCTCTCAGCTATGGCGGCGCACACGCACACATTCCCGAACTTAACAGCTCCGACTGATCTGTCTCTGAAGACATCGACGGCTCCTCTTCACACGAGTTCTCTATCAGGATCCAGCACTAAGTCACTTCTCTCACACAAACTGTCCAACGCCGAGGTAGGGGCTGTGAGGCAACTCATCACGGGCTACCGCGAGTCCGCTGCTTTCCTACTACGATCTGCGGACGAATTGGAGGCTTTGCTGCTGAACCAGCCCTAG

Protein sequence:

>DPOGS213419-PA
MCITIREGGSGVQGVSRGCPGGADNLTPEPADMTDDDNFPGQGYAKDAEKLKMMLLAWNYHSQTQGRNGDITESGPDSMADLWASYHSALGLGSKPPKPPTPLATGHSSPIHVGLGTPVEPASTHDETSSSDRTKDDDDGGASDDDSDDRVDPQHHDPERLKAFNMFVRLFVDENLDRIVPISKQPKEKIQAIIDSCTRQFPEFAERARKRIRTYLKSCRRNKKVRGEGPATGNGGNTPSGNAWDTAVRPTPAHLTSVQAEHILAQACENESLNAKRMRLGLDPVSQPMPTVPQPMAIDTTATSSFLSLYSGSSSTTVTSSTHKDTRPATSPALENTNNNSLKIDSNGNTGSKTASPASSPAPLFRPSFPNTFGNPSTFARQSSSTNPTSALSSTALLSTLTSLPPTGVGVNAAMAAYQSALLSAMAAHTHTFPNLTAPTDLSLKTSTAPLHTSSLSGSSTKSLLSHKLSNAEVGAVRQLITGYRESAAFLLRSADELEALLLNQP-