Monarch geneset OGS2.0

DPOGS201660
TranscriptDPOGS201660-TA2040 bp
ProteinDPOGS201660-PA679 aa
Genomic positionDPSCF300103 - 514088-518132
RNAseq coverage374x (Rank: top 32%)
Annotation
HeliconiusHMEL0119270.056.11% 
BombyxBGIBMGA005412-TA0.065.62% 
DrosophilaCG11417-PA1e-15043.92% 
EBI UniRef50UniRef50_D1ZZY94e-16651.26%Putative uncharacterized protein GLEAN_07375 n=1 Tax=Tribolium castaneum RepID=D1ZZY9_TRICA
NCBI RefSeqXP_001601072.17e-18049.52%PREDICTED: similar to LD23562p [Nasonia vitripennis]
NCBI nr blastpgi|910810291e-16551.26%PREDICTED: similar to CG11417 CG11417-PA [Tribolium castaneum]
NCBI nr blastxgi|910810290.053.76%PREDICTED: similar to CG11417 CG11417-PA [Tribolium castaneum]
Group
Gene OntologyGO:00056342.4e-11nucleus
KEGG pathway 
InterPro domain[604-633] IPR0125802.4e-11NUC153
Orthology groupMCL13539 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201660-TA
ATGAGTGATATTTTGAAAGATAATCGATTTGCCAAATATTTGAATGATCCTCGGTACAGACAAATACCTAAACATGAAAGGAAAGTTAAAATCGATAAAAGGTTTCAATCTATGTTCAGTGATGAAAAATTTAAAGTTAAATATTCAGTTGATAAACGGGGCCGACCAGTAAATGAAACTTCTACAGAAAATTTACGCAAGTATTACGAATTAGAAGAATCAGATGATGAAACAAGTGATGAAGATTCAGAGACAGGGGAAACTAAAATTGAAACACAGAATGAACAGGGTTCTCAGGAAGACGAAGGAGAACAATCTAATAAAACAAAGAAAAGATTATTGGACTTAGATATTGATTATGCTAGGGGCGAAGGAGTTTTGATGACTGACTCATCAAGTGACGAAGAAAGTAGTGGCTCTGAAGATGATAGTGAATTGGAACATGAATGGGGTGAACTTGACGCTGATGCTGAAACGACAGAGGAATCCACCAAAAGGTTAGCTATTTGTAACATTGAACATGAATGGGGTGAACTTGATGCTGATGCTGAAACAACAGAGGAATCCACCAAAAGGTTAGCTATTTGTAACATGGATTGGGATAACATTAAGGCAACAGACTTAATGGTTCTGCTCAGTTCATTTTTACCTCCAGGAGGAGTTATACACAAAGTTTCGATATATCCTTCTGAATATGGACTGAAAAGAATGCAAGAAGAGGACATAAGAGGTCCCATTGAATTAACAGAAAACAAGGAACAAGAAATAAACTCAGACGACGGAGGAAATGAGGAAGGTTCTACATACCACATGGAGAAGCTACGGAGATATCAATTGAACAGGCTAAAATATTTCTATGCCGTTGTTGAATGTGATTCTGTATCAACTGCTGATAAATTGTACAGTGAATGTGATGGGATGGAGTATGAGAGCAGCGCAACTAAACTCGATATGAGATTTATACCAGATGATGTTACATTTGACCAGGAACCTCGTGAGACATGTAACAACCTCCCAGATTTAACTAAATACAGGCCTCGGCTATTTACAACCACAGCATTACAACAGGCTAAAGTCGATCTCACCTGGGACACTACAAACCCAAACAGAGCCGAAGCTATCAAGAGTGCTCTCAGTGGAAAAATTGATAATTTAGATCTCAAAGAATATCTAGCTTCCAGTAGTGAAGATGAAAAGAGTGAGGAAGAAAAAGAAAATTCAGACATTGAAGACAATGAAGATCCTATTCAGAAGTACAAAATGCTCCTAGAAGATATTGAGAAGAAAGAAGACAAGAAACAGAACAAGGACATGGAAATGGAGATCACATGGGGCTTAGGGGTTAAAGACAAGGCAGAACAGTTGGTCAAAAAAAAGATGACGGAAGATGATAAGAATTTAACACCATTTGAAAAAATGATGCTCAAGAGAAAGGAAAAGAAAAAAGAACGTAAAATGAAGAATAAACAAAGTGTTGAAGGAAATATATCTGGTGCCGAAAGTGACTCGGATATACCATCTGACGTGGACATGAATGATCCGTACTTCGCAGAGGAATTTAATAAAGCTGAGTTTAGAAAGAAATCCAAAAATAAAAAACAAACAGAGGTGACCGAGGATTCGGACGAAGGGAAAAGAAACGCGGAATTGGAACTTCTGTTAGAGGAAGACGACGGAAAACAACACTTCAGTCTCAAGAAGATACAAGAAGCTGAGAACATAACGAACAAGTCGAAACGAAAACGAAAACTGAAAGAGAAAATGAAGCAACAGAAAGCAACAGTTCCAGACTTTGAGATCAATGTCGACGATAACAGATTTTCAGCTCTGTATGAATCACATCACTACAACATCGACCCCACGGACCCGAACTTCAAGAAGACTAAGAATATGGAAAAATTGATTAAAGAAAAATTAAAAAGAAGACCAGCGGAGAGTATAGAGACGGAGTCCAAGCCGAAGAAGACGAGGGAAGAAACCGAGCTTAATATGCTAGTTAAAAATATTAAACGCAAAACTCAAGATGCATTAAAAAAATAA

Protein sequence:

>DPOGS201660-PA
MSDILKDNRFAKYLNDPRYRQIPKHERKVKIDKRFQSMFSDEKFKVKYSVDKRGRPVNETSTENLRKYYELEESDDETSDEDSETGETKIETQNEQGSQEDEGEQSNKTKKRLLDLDIDYARGEGVLMTDSSSDEESSGSEDDSELEHEWGELDADAETTEESTKRLAICNIEHEWGELDADAETTEESTKRLAICNMDWDNIKATDLMVLLSSFLPPGGVIHKVSIYPSEYGLKRMQEEDIRGPIELTENKEQEINSDDGGNEEGSTYHMEKLRRYQLNRLKYFYAVVECDSVSTADKLYSECDGMEYESSATKLDMRFIPDDVTFDQEPRETCNNLPDLTKYRPRLFTTTALQQAKVDLTWDTTNPNRAEAIKSALSGKIDNLDLKEYLASSSEDEKSEEEKENSDIEDNEDPIQKYKMLLEDIEKKEDKKQNKDMEMEITWGLGVKDKAEQLVKKKMTEDDKNLTPFEKMMLKRKEKKKERKMKNKQSVEGNISGAESDSDIPSDVDMNDPYFAEEFNKAEFRKKSKNKKQTEVTEDSDEGKRNAELELLLEEDDGKQHFSLKKIQEAENITNKSKRKRKLKEKMKQQKATVPDFEINVDDNRFSALYESHHYNIDPTDPNFKKTKNMEKLIKEKLKRRPAESIETESKPKKTREETELNMLVKNIKRKTQDALKK-