Monarch geneset OGS2.0

DPOGS201672
TranscriptDPOGS201672-TA1194 bp
ProteinDPOGS201672-PA397 aa
Genomic positionDPSCF300103 + 407825-409940
RNAseq coverage213x (Rank: top 46%)
Annotation
HeliconiusHMEL0129568e-16972.12% 
BombyxBGIBMGA005459-TA1e-15667.48% 
DrosophilaCG11109-PB2e-7442.82% 
EBI UniRef50UniRef50_C3ZX252e-8545.26%Putative uncharacterized protein n=3 Tax=Eukaryota RepID=C3ZX25_BRAFL
NCBI RefSeqXP_001599497.13e-7942.90%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|2607836626e-8545.26%hypothetical protein BRAFLDRAFT_116066 [Branchiostoma floridae]
NCBI nr blastxgi|2607836624e-8345.41%hypothetical protein BRAFLDRAFT_116066 [Branchiostoma floridae]
Group
KEGG pathway 
InterPro domain[153-340] IPR0016783.8e-33Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p
[152-166] IPR0232676.3e-20RNA (C5-cytosine) methyltransferase
Orthology groupMCL16944 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201672-TA
ATGCAATTTAACGATACAAAGATTAAAAGTGTATTAGAACAGCAAAGGAGAGAATTAAACTGCCATTCTACTCCTGCGTATTATTTAATAAAACCGGACTGTCTCATCGTGGAACAATGGCCGGAAAATGTGGTAATAAAAAGAGGAAATATCGAGGTAGTAGTGGATTCAATGTGTGCAGCCGCAGTTTTAAGAGGCGCTCATGTGTATGCTCCTGGGGTGCTTGGTTTACCTACCAATTGCAGTCTCAATGAAAGAGTTGATATATATGGTGACTTAGATGGCCACTGTAAGAGAGGCTTAAAGGTTGAGTATAAAGGAAGAAAAATATATGTTGGTACCGGCTACATTAGAATGTTGCGTTATCAGTTATTTGACGATGGAGTCCAACCGAATGGTATAGCAATAGAAACCTTGCTCCCGGCATCAAGACTTCCTGTGATAAATGAATCTATGTATCCTAAAGGACATTTAGTATTACAAAACCTGCCATCTATCATAACAGGTTGGGTGGTCAATGCACAGCCCAATGAACATATATTAGACATGTGTGCCTCACCAGGAAATAAAACAACTCATTTAGCGGAAATGTCCAATAACCAAGCTCACATCACAGCAATTGATAAAACTGATAAAAAAGTTATTAAAATCCGTCAATCTTGCGAAACTCAGGGTGTCACCTGTGTTCATACATTTGCGTTTGATTCAACAAAATGTCATTCGGATGAAGCAAACAATGAAAAAGGGCCCCCATACAAATCAAACACTTTTGATAAAGTGCTATTGGACGCTCCTTGCAGTGGCTTGGGACAACGGCCTCTCTTGAACAATAAAATGACTGCCAAAATGCTTCAATCCTATAAGTTTGTGCAAAGAAAGCTTTTTGATTCAGCTGTGAAAGTTTTAAAAGTCGGGGGTAAGCTTGTATACAGTACATGTACAGTTACTGTAGAGGAAAATGAAGGAATGGTGAGCTGGGTATTGGATAAATATTCTTGTATGGAACTCATTCCCGCTGAACCACTACACGGTGGGCCGGGACAACCGAATGTAGGTCTGAATGATCAACAAAGAGTGATGGTACAGCGATTCGGTCCAGATAACGATCCACTCAGACCGGTGGAAGATTTGTACAGGAATACTATAGGTTTCTTTATAGCAGCCTTCACAAAAAATGATGAGCATTGCGCATGA

Protein sequence:

>DPOGS201672-PA
MQFNDTKIKSVLEQQRRELNCHSTPAYYLIKPDCLIVEQWPENVVIKRGNIEVVVDSMCAAAVLRGAHVYAPGVLGLPTNCSLNERVDIYGDLDGHCKRGLKVEYKGRKIYVGTGYIRMLRYQLFDDGVQPNGIAIETLLPASRLPVINESMYPKGHLVLQNLPSIITGWVVNAQPNEHILDMCASPGNKTTHLAEMSNNQAHITAIDKTDKKVIKIRQSCETQGVTCVHTFAFDSTKCHSDEANNEKGPPYKSNTFDKVLLDAPCSGLGQRPLLNNKMTAKMLQSYKFVQRKLFDSAVKVLKVGGKLVYSTCTVTVEENEGMVSWVLDKYSCMELIPAEPLHGGPGQPNVGLNDQQRVMVQRFGPDNDPLRPVEDLYRNTIGFFIAAFTKNDEHCA-