Monarch geneset OGS2.0

DPOGS215209
TranscriptDPOGS215209-TA2403 bp
ProteinDPOGS215209-PA800 aa
Genomic positionDPSCF300143 + 119708-125935
RNAseq coverage167x (Rank: top 51%)
Annotation
HeliconiusHMEL0038670.059.48% 
BombyxBGIBMGA008716-TA0.057.13% 
DrosophilaCG1234-PA4e-11237.62% 
EBI UniRef50UniRef50_E2AC181e-12434.63%Nucleolar complex protein 3-like protein n=4 Tax=Camponotus floridanus RepID=E2AC18_CAMFO
NCBI RefSeqXP_972028.11e-11637.43%PREDICTED: similar to CG1234 CG1234-PA [Tribolium castaneum]
NCBI nr blastpgi|3838663068e-12837.19%PREDICTED: nucleolar complex protein 3 homolog [Megachile rotundata]
NCBI nr blastxgi|3071814413e-13635.69%Nucleolar complex protein 3-like protein [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[229-326] IPR0115014.4e-21Nucleolar complex-associated
[575-721] IPR0056125.4e-16CCAAT-binding factor
Orthology groupMCL12696 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215209-TA
ATGGCGAAAAAAGGCAAACCAAAGATAAGTAAAATAAAAAGGAACAACCACACAACGAATAAAATGAAAAAACAGGGTAAATTAAAGCTCCAGAGGCACAGGACGAAGGTGCAAAAACAGAAACAGCCGGTGAAGGAGACACCAGAATATAGCAGTGACAGTGAGTCCTCGAGTAACGAGTGGGCGGACATGTTGGATGAGGAGGAACAGCAATACATAGTGAATAGACTGGCCAAGCAACCCAATCTGCTGTCAAACATCCCAGAAAAAGAAGACAATAAGAGAGGATCTAAACGCAAGAAAGACAAAGAACGCTTACCCAAAGTGAAGAAGCCAAGAACTGAAAATAGTATGGATGACAACATTGGATCGTCCGATGACTCTGACAGTGAAGTGGAAGTCAAGTATGAGAAGGACCTGGCACAGCGGCCAGTGAAGAAGATGAGATCACTGCTACCCATTAAAACCAAGGATGGCCTGGTGGAGAGAACTGAGGAGTGTGACGACTCAGATACGGAGTCGGCAGAGGAAGATCAGGCAGGTGTCGATGATGAAGTAGAAAAGGCAGCTGTTGAATCAGGCTCGGAACATGATTCAGATGAAGAGACGATGGAAAAGTCAGAAGATAACGAGGAGAAAGAGATTACTACTGTTGAACTGCTAGCAGCCAGACGGGACCGGCTGAGGCATGAGAAGTTGAGGATCGGAGCACTCTGCTCGTCTCTGTTGGAGAGCCCCGAAAAGAAGCTTAAAAACCTCTATCCAATCTTGTATCTGATGGATGAGCACTTGAAAGATGGCACCGCCAACCTGGTGTCAGTCCGTAAGCTGGCCTCCTTGTCTGCTGCGGAGGTGTTCAGAGACATCCTGCCCGAGTACAAGCTCCGGCATCAGGACTACAGCGACGTTAAATTGAAGAAAGATACTCTGTCACTATACAAATATGAAAAGGAACTGCTGGAGTTCTACAAGAGATACCTCCAGAGATTGGAGAAAGCAGCCAGCGTACTGAGACCCAAGAAAGGAGATAAACGTAAACCCGATGACCCTCGGGTCAGTCTTGGCCTGCTGTCGATCAAGTGTATGTGTACCTTGTTAGTGGCGAGACCTAACTTCAACTACGCCACCAATATAGCTCAGAGTGTGATCCCGTTCCTTGACACCACACCGGAGGCAAGAAACACTGTCACCGAAGCCTGCACCGACGTGTTCAAGGAGGACAATAAAGGAGAGATCACATTGGCTATAGTTCGTCTAATAAACCAGCTCGCTAAACGGCGAGGCTCCCGCCTGAACCCCGCGGCCTTGGACTGTTTGTTACAGCTGAAGATACAGGACGTGGAGCTGGACGAGGAACACGACCTCAAGATGAAGAAGAAAACGGAAGAGAAGAAGAAGAAGAGGATCGTCAATCTGTCCAAGAAGGAGAAGAAGAGAGCTAAGAAGTTGAAAGAGGTCGAACGAGAACTTCTGGAAACAGAAGCCAAGGAGAGCGAGTCGGCGAGGAGGAAACAGCTGACGGAAGTTACGAAAACCGTCTTCCATATATACTTCCGGTTACTGAAGAGCTCTCCCACCACTGGCCTACTCATCGCGGCTTTGAACGGGATCGCCAAGTTCACGCACGTCATCAACCTGGAGTACTACTCGGACCTGGTGTCTATCTTGTCTGGTCTGGTGCGATCATCCGCCGACCGCTCCACCCGGCTGGTGGTGGTGGGGACCGTGCTGGCTGTCCTGGCCAAGGCCGGAGACGCTCTCAACGTGGACCCCGCCGTCTTCCACACACACCTCTATCAGGACATGCTGGCCGTACACGCTGGTTGTTCGCGGTCGGAGACGGCCACGGTGGTGGAGTGTGCGAGCGCGGTATGCCAGAGACTGAGGAGGGTGTCGTGCGGCGTGCTGAGGGCGCTGGCCAAGAGATTGCTGACCATGTCCTCGCACGGACAACACCACGCCGCGCTCGCCTCACTCGCGCTCGTGTGGACCATCATGCAGCACAACAAGCACGTGTCCGCCTTGTTCGAGGCGGAGTGTGCGGGCTCGGGTAGGTTCGACCCGCTGTCGCCGTCCCCCGAGCACTGCGGATCGCATGCCGCCCTCGGGTACGAGGGACCCATCCTGACCTCCCACTACCACCCGGTGGTGCGCGCCGCCGCCGCCGCGCTTCTCCAGAGAGGAGGCTGGCCTCAGGAGTTACACGGACTCACCCCGATGCAGATCTTCGATCAGTACGACTGTTCCCAGCTGTGTTTCAAGCCGGCCATTCCTCCGCCCAAGCCCCGCGACGTCACCAAGCCGAAGACCTCCCAGACCTGGGCCAGGCCGGACTTCAAGACCTACTGCGAGACCGTAGAAGACAATCTAAATATGGACATCGACAACTTCATAGAAAGATGA

Protein sequence:

>DPOGS215209-PA
MAKKGKPKISKIKRNNHTTNKMKKQGKLKLQRHRTKVQKQKQPVKETPEYSSDSESSSNEWADMLDEEEQQYIVNRLAKQPNLLSNIPEKEDNKRGSKRKKDKERLPKVKKPRTENSMDDNIGSSDDSDSEVEVKYEKDLAQRPVKKMRSLLPIKTKDGLVERTEECDDSDTESAEEDQAGVDDEVEKAAVESGSEHDSDEETMEKSEDNEEKEITTVELLAARRDRLRHEKLRIGALCSSLLESPEKKLKNLYPILYLMDEHLKDGTANLVSVRKLASLSAAEVFRDILPEYKLRHQDYSDVKLKKDTLSLYKYEKELLEFYKRYLQRLEKAASVLRPKKGDKRKPDDPRVSLGLLSIKCMCTLLVARPNFNYATNIAQSVIPFLDTTPEARNTVTEACTDVFKEDNKGEITLAIVRLINQLAKRRGSRLNPAALDCLLQLKIQDVELDEEHDLKMKKKTEEKKKKRIVNLSKKEKKRAKKLKEVERELLETEAKESESARRKQLTEVTKTVFHIYFRLLKSSPTTGLLIAALNGIAKFTHVINLEYYSDLVSILSGLVRSSADRSTRLVVVGTVLAVLAKAGDALNVDPAVFHTHLYQDMLAVHAGCSRSETATVVECASAVCQRLRRVSCGVLRALAKRLLTMSSHGQHHAALASLALVWTIMQHNKHVSALFEAECAGSGRFDPLSPSPEHCGSHAALGYEGPILTSHYHPVVRAAAAALLQRGGWPQELHGLTPMQIFDQYDCSQLCFKPAIPPPKPRDVTKPKTSQTWARPDFKTYCETVEDNLNMDIDNFIER-