Monarch geneset OGS2.0

DPOGS206580
TranscriptDPOGS206580-TA1881 bp
ProteinDPOGS206580-PA626 aa
Genomic positionDPSCF300108 + 540902-542782
RNAseq coverage210x (Rank: top 46%)
Annotation
HeliconiusHMEL0180820.089.17% 
BombyxBGIBMGA013741-TA0.084.71% 
Drosophilasano-PB1e-9540.83% 
EBI UniRef50UniRef50_E0VQG30.057.85%Putative uncharacterized protein n=3 Tax=Neoptera RepID=E0VQG3_PEDHC
NCBI RefSeqXP_002428357.10.057.85%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420154320.057.85%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|910856350.060.31%PREDICTED: similar to serrano protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL16024 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206580-TA
ATGGCAGCCACAGATGGCCAAGTTGTGGTCGCAGCATCCCGTTGGTCAGATGAAGGCCACTACCTCAAAGAGTTTGTTGTTAAGAACCGTTTACCCAATGTCGCAAAAATCATAAAAGGGCAATATGGAGGCCTCGGAGTACCAACCCTCCCTAGTCCTGGATTACAGAGCACCGCATTACTTGTATCCGCTGGAAAAAAGAAGAAAATAATTGCACAGGCTATAAAATTAAAAGAAGGGAGACGAATGGTTAGTGTTGGACCCAGAATAGCAATCCCTGAAACTTACAAAGGCTATTTCGAACTTCTTAGTGAAGAGGGAAGAGCAGTGCGATGTATGGAATCGGTATCTGAAATAGCGAGGAGAAAACTCGAAGATGGATGCTTAGTAAGAGAGCCGATAAGAATAATATGCGCTAAAACTGATTTAAATGGTGACATAACTGCCGACGGATCACGGACTTTGCCTGCTGGTGAAGTAATAATGCCTAGAGGAGAGGTATTTTTAGGTAAAAATAAATATTTGAAATGCACCGATTCTAAAGGTGATACCATTCTCTTGGGCTTAGATCAAAGAGGCAAGTTTTCTGCTGTAGCCAAAGAGGAGAATATCAGCGGAGTGCATACAGCGAAGACGTTACTAACTAAACGATTGCCTATTACTGTTAGACTCGTGCATGGTACTCCACCAAGAGGACTAAAAAGTGCTAGTTATTTCGTACCAGAATTGAGATTATTATCTTTATATGAAGAAGATCATGTATTCGCACTGCCCTTACAAAAAGAAGGGAATGCTTTGATAGCTCTACCACTAGCTGCACCCCTTAAAATGTTAAAATGTAAGAACGAAGAGCACATAAAAAATTTTATGGAATTCTCACGATTAGTTGAAAAATGCAATAGATTATTAGTGGACGTAGTAGATCGAATACATGTTTTAGATGGAAAGCTTGGTGATCCGAAAAGATTACTAAGCACACCCCTCAATGCTCCACCTTTAATAAAAACGGGTTACTTCCTAAGAAGGAGTGCATCATCAGACACAGCAAACCAGCACAAATACATAAACCGACACAGTCATATTTACAGTAGCCACAGAGATGAAAATAGTATTCCCGATGAGTATGATGAAATAGATCAAATATATGATTATGTAAGAGGCTTTGCACCATTACCCAAAAATATTAAAGCATCATATTGTAACGAGCCCACAAGACATGAGTCTGCACCAGCTTCCCCAGCTGCGACTCCCATCCATATGCCCTTGATTACAGAAATAAAACCCGAACCCCCACCCATAGAGACCATACCCACAAAGAAAACGTTCAACCAAAAGGCAGAGAAAAGAACACGCAAAAGTACCACAACAACAAAAGAAACGCCGATTATTGTATACAAAGAGAAGTGTTCTATAGCAGCACCGGTGCCTAATTTACCTAAGTTGTATATAAAAAACAATTTGAATACTAACAAAAGTCGGATGCTGTTGACTCAAAAGAGTCACTCCCCGACTAAAGAGGTACCTAGTCCAGGAACTAGTCCGATAAGACCTCTAACAAAAGGGGATTCCCCGATCTTCAACATACGATATAAAAGTCTTTCCAATATTCATCAAGCCATGGAATTAGATGGTACTTTAGATTCGAGTCATTCTGGTGGCAGAACATCTGGGGATTCAGGAAATGGTCCCAAACTCCCTGAAAAAAGATCTAGGCGATTGAGTAGACCGAAGTCTCTGACGAATTTAGTTTGGGAGCTAAAAGGATGTCCAGTGGAGACCGCTGAAAAACCAAAAGCAAGGACAAAAAATGACGCATACAAGAAAATTGGGAATAAATTATCTTTGGGGGCGCAAAAACGAGTGCCAACGTTGTATCTTTAA

Protein sequence:

>DPOGS206580-PA
MAATDGQVVVAASRWSDEGHYLKEFVVKNRLPNVAKIIKGQYGGLGVPTLPSPGLQSTALLVSAGKKKKIIAQAIKLKEGRRMVSVGPRIAIPETYKGYFELLSEEGRAVRCMESVSEIARRKLEDGCLVREPIRIICAKTDLNGDITADGSRTLPAGEVIMPRGEVFLGKNKYLKCTDSKGDTILLGLDQRGKFSAVAKEENISGVHTAKTLLTKRLPITVRLVHGTPPRGLKSASYFVPELRLLSLYEEDHVFALPLQKEGNALIALPLAAPLKMLKCKNEEHIKNFMEFSRLVEKCNRLLVDVVDRIHVLDGKLGDPKRLLSTPLNAPPLIKTGYFLRRSASSDTANQHKYINRHSHIYSSHRDENSIPDEYDEIDQIYDYVRGFAPLPKNIKASYCNEPTRHESAPASPAATPIHMPLITEIKPEPPPIETIPTKKTFNQKAEKRTRKSTTTTKETPIIVYKEKCSIAAPVPNLPKLYIKNNLNTNKSRMLLTQKSHSPTKEVPSPGTSPIRPLTKGDSPIFNIRYKSLSNIHQAMELDGTLDSSHSGGRTSGDSGNGPKLPEKRSRRLSRPKSLTNLVWELKGCPVETAEKPKARTKNDAYKKIGNKLSLGAQKRVPTLYL-