Monarch geneset OGS2.0

DPOGS209957
TranscriptDPOGS209957-TA3459 bp
ProteinDPOGS209957-PA1152 aa
Genomic positionDPSCF300148 - 114121-117756
RNAseq coverage252x (Rank: top 42%)
Annotation
HeliconiusHMEL0135580.084.95% 
BombyxBGIBMGA011274-TA0.079.90% 
DrosophilaCG7728-PA0.058.19% 
EBI UniRef50UniRef50_Q9VVC90.058.19%CG7728 n=14 Tax=Diptera RepID=Q9VVC9_DROME
NCBI RefSeqXP_002047245.10.060.17%GJ12043 [Drosophila virilis]
NCBI nr blastpgi|1953769290.060.17%GJ12043 [Drosophila virilis]
NCBI nr blastxgi|1951271270.061.01%GI13274 [Drosophila mojavensis]
Group
Gene OntologyGO:00422542.8e-43ribosome biogenesis
GO:00056342.8e-43nucleus
KEGG pathwaytet:TTHERM_007845703e-09 
 K00162 (PDHB, pdhB)maps-> Citrate cycle (TCA cycle)
    Glycolysis / Gluconeogenesis
    Valine, leucine and isoleucine biosynthesis
    Butanoate metabolism
    Pyruvate metabolism
InterPro domain[685-979] IPR0070348.5e-96Ribosome biogenesis protein BMS1/TSR1, C-terminal
[232-318] IPR0129482.8e-43AARP2CN
Orthology groupMCL14018 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209957-TA
ATGGCCGATGAGGCTATTTTCGATAAAAAGAAGTCACACCGTGCAAAGCATGCCGGCAGAAAAGCAGAGAAGAAAAAAAAGAAAAATCAAGTTGATCAGTCTAACCTAAGTGCGCGGCAAAGAAATCCAAAAGCCTTCGCAATTAATTCAGCGGTACGAGCTGAGAGACAGTTTAGGCGTCGCGAAGATGTTATATCTAAAAAGCAGCACATACCACTAGTGGATAAAACACCTTTGGAGCCTCCACCTATCGTCGTCGCTGTAGTCGGGCCGCCGCGAGTCGGAAAAACTACAGTCATCAATAACCTTATTAAAAGTTTTGTCAAAACAAATGTCACCAGTACCAATGGCCCCATAACGATAGTCACGTCTAAGAAAAGGCGTCTCACGTTAATAGAGTGTAACAACGATGTCAATTCTATGATAGATATCGCGAAGTGTGCGGATCTCGTCCTCTTGCTGTGTGATGCGAGCTTCGGGTTTGAAATGGAAATATTCGAATTCCTGAATATATGCCAGGTACATGGGATGCCAAAGATTATGGGTGTGTTAACACATCTGGACATGATAAAGAATGCCAAGAAGTTAAAGATGACCAAGAAAACATTGAAGCATCGTTTCTGGACCGAAGTGTATCCTGGCGCCAAATTATTTTATTTATCTGGTATCATTCATGGAGAGTACTTGAGAAATGAAATAAAAAATTTGTCCAGATTCATATCTGTTATGAAATTTAGGCCTCTGAGCTGGCGAATGACTCATGCTTATATTTTGGCTGATAGATTAGAAGATATCACTAGTCAGGATAGCATCAGGAAGGATCCAAAAATTAATAGAGATGTGGTCTTATATGGTTATGTGAGGGGTGTTCCTCTCATGAAGGATTCAATGGTGCATTTAGCAGGTGTTGGTGATATGAAGATCAGTGAACTCTCATACCTGCCAGACCCATGTCCTCTACCCAGCAGTGAGAAAAAAAGACATTTGATGGAGAGGGAAAGACAAATCTATGCTCCCTTCTCTGGTGTGGGAGGAATTGTCTATGACAAGGACGCAGTTTATATTGAACTCAAAGGATCTCACTCACATAAACAAGAAGATGAAGAAACAAATGAGAAGCAAGCCTTACTTAAAAGTGTTGTGGAGACAACAGAGACTGTTGATGAGCAAATGCAAGAATCTGGCTTCAGACTGTTCAGTGGAGGAACTGTCATTTATCCAGATATGGTAAAAGATGATAAAGACTTGCAGAACAGAGAATCTCAGAATAATGAGTCCAGCTCTGATGAATCTGATTCTGGAGATGAAAATGATTCTGACAATGACAGCGGCATAGATCAAAGTGAGAAAGACACCACAGGAAAGAGTTCTAAACTGCCTTGGGACAATAACTCAGAATCTGACGAAGATGATGAAGACGATGAAGATGACGGTAACGATGAAGCAAAAAACCTTCAAGTGGACAAACTGTACAGTCATGAAAATGATTCTGATGAAATTGATGAAGAACATAAAAGTTCTGACGCAGAAGACTTTAGCATTAAGTGGAAGGAAAAATTAGGTGAAAAAGCATCATTGGCCTATTTTGAAAGACAAAAGACTTCAAAAAATATCATGAAACTTGTTTATGGAGAATTTGAAATAGGAAATAAGAGAAAAGAAGATGAAGATGGGAATCGTGATGAGGAGAGTGACGAAGAAGAAATAGGAGGTCTCTTCAAGAAAGTTACAAGTTCACAGAAAAGAAAGCAGGCGGACAAGGAACAGTTGGATCTCGAGGAATGTTCGTATTTCTATAACAGAAATAAACCACATCTAAGAGATTGGACCACGGAAGAAAATAAAAAACTGATTATTAATTGTTTTGTGACTGGAAAGAGGGGCGCCGACGAGGACGCCGAGGAACTCTTGAAATTAGACGACGCCAGTGACGGGGAGGATGAAGTCTATGGAGACTTCGAAGATCTAGAGACGGGTGAAAAGCATATTAGTAAACAGCAACCGGAAGATAAGACAGAAGAAGTAGGTTCAAAGCGCAAAGCGGAGCCGACCAAATCGGATATTTTAGACAAAAAATTAAAATTAAAAGCTAAGTTTGATGCTGAGTATGACAACCCAGACGACCACAGAATCAAGGGGGATCATTCTTATTATGAAAGTTTAAAAGCTGAAGCGCTCAAGCAGTCTCAACTAAACAAGTCAGTTTTCGAAAATTTAGACAACGGTTTGCGAGTCGAAGTCGAAGGCTTTCGACCCGGTCTGTACGTTCGTATGTTATTCAAGAATATGCCCTCGGAGTTCGTCACAAACTTCGATTCAAGCTATCCTCTCCTCATAGGCGCTTTAAATATGGCTGAGCAAAACATCGGCTTCGTCTCGTGTAAGGTTAAAAAACATAGATGGTACAAAAAGATATTGAAAACCAACGATCCTTTGATTATTTCCTTAGGGTGGCGGCGGTTCCAGACGTTGCCGATCTATTCCAAAATAGAGGATGATTTAAAATGTAGGTATTTGAAATATACCCCGGAACACGTCACTTGTAACATGCACTTCTATGGTCCGATAACGCCACAGAACACAGGGTTTCTGGCGTTACAGACGGTCAATAACAATTCAAACGAAATTAAACAGCTAGGGTTCAGAATCGCCGCCACCGGCAGCGTCAATGAGATTAACAAGTCGACTCAAATCATGAAGAAACTGAAATTGGTCGGGACTCCGTTAAAAATTTACAAGAAGACTGCCTTCATCAAGGACATGTTTACGAGCACTCTCGAAGTTGCTAAGTTCGAAGGAGCGAAAGTTAAAACGGTGTCCGGAATCAGAGGTCAAATCAAAAAAGCACTGAACAAACCCGAAGGTGCGTTCCGAGCGACGTTCGAGGACAAGATCCTCATGAGCGATGTGATCTTCTGCAGGACATGGTTCAAAGTGGACGTATCAAAATTCTACGCACCCGTCGTGAATCTTCTTCTTCCTATCGGAGCGAAGAACGCCTGGCAAGGCATGAAGACGAAAGGCCAATTGAAGAGAGAACGGAACATAAAGGTCGAAGCGAACACGGATTCCATGTACACCGATATTGTGAGAGAACCGAAAGTGTTCAAGCCACTAGTTATACCCAAGGAACTACAAAAAGGTTTGCCTTACAAGTTAAAACCCAAAGAGAAGACGAGCACGCTGACCAAGAAGAACATTAAAGAAAAATTGTCCAATAGAGTCGCCGTCATAAAGAGTCCGCACGAACAGAAAGTAGCGAATGTCATGAAGATGTTGAAGACCAACTTTGAAAAGAAGAGGGAAGTGCAGAAGGCCTCGACCGCCGAGAGGCTGAAGAAGTTCAAGAAGCAGCAAGAGGAAGAGGAGTGGAGGAAGATCAAGAGGCAGAAGGAACTCAAGAAGAAAATCTGCAGGCATCTGAGCAAAATATCCAACAAGAAACAGACACAGATGTGA

Protein sequence:

>DPOGS209957-PA
MADEAIFDKKKSHRAKHAGRKAEKKKKKNQVDQSNLSARQRNPKAFAINSAVRAERQFRRREDVISKKQHIPLVDKTPLEPPPIVVAVVGPPRVGKTTVINNLIKSFVKTNVTSTNGPITIVTSKKRRLTLIECNNDVNSMIDIAKCADLVLLLCDASFGFEMEIFEFLNICQVHGMPKIMGVLTHLDMIKNAKKLKMTKKTLKHRFWTEVYPGAKLFYLSGIIHGEYLRNEIKNLSRFISVMKFRPLSWRMTHAYILADRLEDITSQDSIRKDPKINRDVVLYGYVRGVPLMKDSMVHLAGVGDMKISELSYLPDPCPLPSSEKKRHLMERERQIYAPFSGVGGIVYDKDAVYIELKGSHSHKQEDEETNEKQALLKSVVETTETVDEQMQESGFRLFSGGTVIYPDMVKDDKDLQNRESQNNESSSDESDSGDENDSDNDSGIDQSEKDTTGKSSKLPWDNNSESDEDDEDDEDDGNDEAKNLQVDKLYSHENDSDEIDEEHKSSDAEDFSIKWKEKLGEKASLAYFERQKTSKNIMKLVYGEFEIGNKRKEDEDGNRDEESDEEEIGGLFKKVTSSQKRKQADKEQLDLEECSYFYNRNKPHLRDWTTEENKKLIINCFVTGKRGADEDAEELLKLDDASDGEDEVYGDFEDLETGEKHISKQQPEDKTEEVGSKRKAEPTKSDILDKKLKLKAKFDAEYDNPDDHRIKGDHSYYESLKAEALKQSQLNKSVFENLDNGLRVEVEGFRPGLYVRMLFKNMPSEFVTNFDSSYPLLIGALNMAEQNIGFVSCKVKKHRWYKKILKTNDPLIISLGWRRFQTLPIYSKIEDDLKCRYLKYTPEHVTCNMHFYGPITPQNTGFLALQTVNNNSNEIKQLGFRIAATGSVNEINKSTQIMKKLKLVGTPLKIYKKTAFIKDMFTSTLEVAKFEGAKVKTVSGIRGQIKKALNKPEGAFRATFEDKILMSDVIFCRTWFKVDVSKFYAPVVNLLLPIGAKNAWQGMKTKGQLKRERNIKVEANTDSMYTDIVREPKVFKPLVIPKELQKGLPYKLKPKEKTSTLTKKNIKEKLSNRVAVIKSPHEQKVANVMKMLKTNFEKKREVQKASTAERLKKFKKQQEEEEWRKIKRQKELKKKICRHLSKISNKKQTQM-