Monarch geneset OGS2.0

DPOGS215210
TranscriptDPOGS215210-TA1785 bp
ProteinDPOGS215210-PA594 aa
Genomic positionDPSCF300143 + 128544-135905
RNAseq coverage3x (Rank: top 91%)
Annotation
HeliconiusHMEL0038671e-13253.76% 
BombyxBGIBMGA008716-TA4e-14654.37% 
DrosophilaCG1234-PA1e-9538.51% 
EBI UniRef50UniRef50_E2AC181e-10337.66%Nucleolar complex protein 3-like protein n=4 Tax=Camponotus floridanus RepID=E2AC18_CAMFO
NCBI RefSeqXP_972028.14e-10139.31%PREDICTED: similar to CG1234 CG1234-PA [Tribolium castaneum]
NCBI nr blastpgi|3071814415e-10337.66%Nucleolar complex protein 3-like protein [Camponotus floridanus]
NCBI nr blastxgi|3838663061e-10137.79%PREDICTED: nucleolar complex protein 3 homolog [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[369-515] IPR0056123.3e-16CCAAT-binding factor
[59-120] IPR0115016.4e-13Nucleolar complex-associated
Orthology groupMCL12696 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215210-TA
ATGGCCTTTGAAGCCGAGATGAAAAGAGACCTCCACAAGGAAATTTCTGCTTGGAAAGTAAACCGAATGAAATGTACGACCCTTGTGAAGTGTCAAGTGACATATGACCTCATTCTCCACATTCCCCAGAGTCCTTTGAAACTGATGAAGCTTTTCTTTAAAAATAATTCCTATGATGAACTTTGTATCCATGTGACCTCAGTCCGTAAGCTGGCCTCCTTGTCTGCTGCGGAGGTGTTCAGAGACATCCTGCCCGAGTACAAGCTCCGGCATCAGGACTACAGCGACGTTAAATTGAAGAAAGATACTCTGTCACTATACAAATATGAAAAGGAACTGCTGGAGTTCTACAAGAGGTACCTCCAGAGGTTGGAGAAAGCAGCCAGCGTACTGAGACCCAAGAAAGGAGATAAACGGAAACACGATGACCCTCGGGTCAGTCTTGGCCTGCTGTCCATCAAGTGTATGTGTACCTTGTTAGTGGCGAGACCTAACTTCAACTACGCCACCAATATAGCTCAGAGTGTGATCCCGTTCCTTGACACCACACCGGAGGCAAGAAACACTGTCACCGAAGCCTGCACCGACGTGTTCAAGGAGGACAATAAAGGAGAGATCACATTGGCTATAGTTCGTCTAATAAACCAGCTCGCTAAACGGCGAGGCTCCCGCCTGAACCCCGCGGCCTTGGACTGTTTGTTACAGCTGAAGATACAGGACGTGGAGCTGGACGAGGAACACGACCTCAAGATGAAGAAGAAAACGGAAGAGAAGAAGAAGAAGAGGATCGTCAACCTGTCCAAGAAGGAAAAGAAGAGAGCTAAGAAGTTGAAAGAGGTCGAACGAGAACTTCTGGAAACAGAAGCCAAGGAGAGCGAGTCGGCGAGGAGGAAACAGCTGACGGAAGTTACGAAAACCGTCTTCCATATATACTTCCGGTTACTGAAGAGCTCTCCCACCACTGGCCTACTCATCGCGGCTTTGAACGGGATCGCCAAGTTCACGCACGTCATCAACCTAGAGTACTACTCGGACCTGGTGTCTATCTTGTCTGGTCTGGTGCGATCGTCCGCCGACCGCTCCACCCGGCTGGTGGTGGTGGGGACTGTGCTGGCTGTCCTGGCTAAGGCCGGAGACGCTCTCAACGTGGACCCCGCCGTCTTCCACACACACCTGTATCAGGACATGCTGGCCGTACACGCCGGTTGCTCGCGGTCGGAGACGGCCACGGTGGTGGAGTGTGCGAGCGCGGTGTGCCAGAGACTGAGGAGGGTGTCGTGCGGCGTGCTGAGGGCGCTGGCCAAGAGATTGCTGACCATGTCCTCGCACGGACAACACCACGCCGCGCTCGCCTCACTCGCGCTCGTGTGGACGATCATGCAGCACAACAAGCACGTGTCCGCCTTGTTCGAGGCGGAGTGTGCGGGCTCGGGTAGGTTCGACCCGCTGTCGCCGTCCCCCGAGCACTGCGGGTCGCACGCCGCCCTCGGGTACGAGGGACCCATCCTGACCTCCCACTACCACCCGGTGGTGCGCGCTGCCGCCGCCGCGCTCCTCCAGAGAGGAGGCTGGCCTAAGGAGTTACACGGACTCACCCCGAAGCAGATCTTCGATCAGTACGACTGTTCCCAGCTGTGTTTCAAGCCGGCCATTCCTCCGCCCAAGCCCCGCGACCTCACCAAGCCGAAGACCTCCCAGACCTGGGCCAGACCGGACTTCAAGACCTACTGCGAGACCGTCGAAGACAATCTAAATATGGACATCGACAGCTTCATAGAAAGATAA

Protein sequence:

>DPOGS215210-PA
MAFEAEMKRDLHKEISAWKVNRMKCTTLVKCQVTYDLILHIPQSPLKLMKLFFKNNSYDELCIHVTSVRKLASLSAAEVFRDILPEYKLRHQDYSDVKLKKDTLSLYKYEKELLEFYKRYLQRLEKAASVLRPKKGDKRKHDDPRVSLGLLSIKCMCTLLVARPNFNYATNIAQSVIPFLDTTPEARNTVTEACTDVFKEDNKGEITLAIVRLINQLAKRRGSRLNPAALDCLLQLKIQDVELDEEHDLKMKKKTEEKKKKRIVNLSKKEKKRAKKLKEVERELLETEAKESESARRKQLTEVTKTVFHIYFRLLKSSPTTGLLIAALNGIAKFTHVINLEYYSDLVSILSGLVRSSADRSTRLVVVGTVLAVLAKAGDALNVDPAVFHTHLYQDMLAVHAGCSRSETATVVECASAVCQRLRRVSCGVLRALAKRLLTMSSHGQHHAALASLALVWTIMQHNKHVSALFEAECAGSGRFDPLSPSPEHCGSHAALGYEGPILTSHYHPVVRAAAAALLQRGGWPKELHGLTPKQIFDQYDCSQLCFKPAIPPPKPRDLTKPKTSQTWARPDFKTYCETVEDNLNMDIDSFIER-