Monarch geneset OGS2.0

DPOGS210051
TranscriptDPOGS210051-TA2361 bp
ProteinDPOGS210051-PA786 aa
Genomic positionDPSCF300017 - 1093960-1097583
RNAseq coverage688x (Rank: top 19%)
Annotation
HeliconiusHMEL0104210.072.92% 
BombyxBGIBMGA012690-TA0.080.92% 
DrosophilaCG7338-PA0.054.73% 
EBI UniRef50UniRef50_Q9VP470.054.73%Pre-rRNA-processing protein TSR1 homolog n=14 Tax=Diptera RepID=TSR1_DROME
NCBI RefSeqXP_972503.10.057.58%PREDICTED: similar to ribosome biogenesis protein TSR1 [Tribolium castaneum]
NCBI nr blastpgi|2700160430.057.65%hypothetical protein TcasGA2_TC012891 [Tribolium castaneum]
NCBI nr blastxgi|2700160430.057.65%hypothetical protein TcasGA2_TC012891 [Tribolium castaneum]
Group
Gene OntologyGO:00422545.7e-33ribosome biogenesis
GO:00056345.7e-33nucleus
KEGG pathwaytet:TTHERM_007845701e-53 
 K00162 (PDHB, pdhB)maps-> Citrate cycle (TCA cycle)
    Glycolysis / Gluconeogenesis
    Valine, leucine and isoleucine biosynthesis
    Butanoate metabolism
    Pyruvate metabolism
InterPro domain[479-766] IPR0070343.7e-112Ribosome biogenesis protein BMS1/TSR1, C-terminal
[227-307] IPR0129485.7e-33AARP2CN
Orthology groupMCL14136 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210051-TA
ATGCAGCAGGCTCATCGATCAGGAAATCTTAAACAAAGTAACAAGGCTCATAAGTCTCGACATAGATCAAAACGGGGAATTTCGGCCGCTGTGAAGGGTAAAGTAAATGTAAAGGAATTTGTGCGTCGAAACCGGCACATCCTTAAGAAAGAGGAGCGCCGTCATCAAGCCCTACAGATTCGTAAAAACAAACGTGAAGAAGTTTTGTCAAAGAAACGCGCACTCGGCGGTAATCGCAATCCTCCGTTTCTTGTGTGTGTGGTGCCGCTTAATGCCCAGCTCGATGTTCAGTCTGCGCTGGTTATACTGAAGACCTGTTCTGAGGGCGCTATTGTTAGTCAAAGTGAGAGTGGAGTCTTACATATCCGGCTACCACATTTTAAACAGAGATTTTCATTTGTTTGTCCTGAGGTTGGCAATGACTTTGCACTGCTAGATGCACTCAAGATATCTGATACAGTTTTGTTTGTGAGTTCAGCTCTGGAGGAGCCAGTTGATGAATGGGGTGAGAAAGCTTTAGCATTGTCCATGGCCCAGGGAATGCCTACACCTGTGGTGGCAGCTATGGATATTGAGGGTGTCCATCCTAAAAAAAGAACAAATGAAAAGCAAAATGTACAAAAACTCATTTCTAAATGGTTGCCGGAAGAGAAAGTTATGCAGCTGGACAAGAGTTCTGATGGATTAAACCTCTTGCGTAGAATCGGTAACCAAAAACGGAATATATTGCATCACAGGGAAAAGAGGCCATATCTTCTCGCGGAGGAAGTTGAGTATGTGCCAGATACTGAAGGGGATCATGGTACTTTAAAAATCAGTGGTTATCTCAGAGGCATGCCCTTAAATGTTAATGGACTCATTCACATTACTGGTCTAGGTGATTATCAAATGTCAAGAATTGATTGCTTGGACGATCCCCATCCTTTACAAACAGGCAAAGAACATGCAAAACAAGATATGATGGATGCTGAAAACAACAAAGTGTCAATACTGCAAGTTGCCAATCCTGACAAACAAGAATCTTTAGAGAGAGAAAATATACCCGATCACATGGATGCTGAACAAACTTGGCCAACTGAAGAAGAAATTATGGAATCAAATATGGAGACTAAGAAGAAAATTAAGAAAGTTCCCAAAGGTTGGTCAGATTATCAGGCAGCTTGGATTGTGGAGTCGGATGCTGAAGGTGATAACTCTGAAGAAGCCAGTGATGAAGATGATGATGATGATGATAATGAAAATGATGAATTTATGTCATGCGAGGAAGATGAGTCTGATAAAGAAGTTAATGAGGAGATCAATGACTTTGAATCTGTCATGGAATCAGAAGTAGGACCCACGGATGAAAGATATGATGCAACTATTGATGCCCATGAAGAGCATGAGATGTTGAAAAAGTTGGCAGCAGCTAAGGAAGATCAACAATTTCCAGATGAAGTAGATACTCCACAAGATGTACCAGCAAGGGAGAGATTTGTCAGATATAGAGGTCTTGAATCATTTAGGACATCTCCATGGGACCCTAAGGAAAATCTTCCACAGGATTATGCTAGGATCTTCCAATTTGAAAACTATGACAGAACAAGAAGAAGGGTATTTAAAGAACTTGAAGATAGCCTCGAAAATATGTATGGATTCTACATAACAATTCATGTGAAAGGTGTTAGGCAAGATTTATGGAAGGCATTCCACGAATCCAATGGCAACACTCCACTATCAGTATTTGGCCTGCTGCCGCATGAACATAAGATGTCATTAATGAATGTTGTTCTGAAGCGCACCGGTGTCAGCGAGGATCCTATTAAAAGCAAAGAGAGGCTCATCTTCCAAGTCGGGTACAGGAGATTCATTGTCAATCCAATATTTAGCCAACACACGAACGGGTCCAAACACAAATACGAGAGATTCTTCCAACCAGGTTCGACGTGTGTCGCTACATTTTTCGCCCCTATTCAGTTTAGTCCATCAACAGTTTTGTGTTTCAAGGAAAAGAAGAACACAAAGTTGCAGCTGGTGGCATCTGGAGTGCTGCTGTCCTGTAATCCAGACAGACTGGTTATTAAGAGGATAGTGCTCTCTGGTCATCCTTACAAGGTTAACAAGAAGTCAGCAGTCATAAGGTTTATGTTCTTTAATAGAGATGACGTCATTTACTTCAAACCCTGCAAATTAAGAACTAAATACGGCAGAACCGGACACATTAAGGAACCATTAGGGACCCACGGTCACATGAAGTGTGTGTTCGACGGACAGCTCAAGTCACAGGACACGGTACTCCTTAACCTCTACAAGAGAATGTTCCCCAAATGGACCTACGAGGACTGTATCGTTACTGATAGAAATGAAGATCTTATGGAATAA

Protein sequence:

>DPOGS210051-PA
MQQAHRSGNLKQSNKAHKSRHRSKRGISAAVKGKVNVKEFVRRNRHILKKEERRHQALQIRKNKREEVLSKKRALGGNRNPPFLVCVVPLNAQLDVQSALVILKTCSEGAIVSQSESGVLHIRLPHFKQRFSFVCPEVGNDFALLDALKISDTVLFVSSALEEPVDEWGEKALALSMAQGMPTPVVAAMDIEGVHPKKRTNEKQNVQKLISKWLPEEKVMQLDKSSDGLNLLRRIGNQKRNILHHREKRPYLLAEEVEYVPDTEGDHGTLKISGYLRGMPLNVNGLIHITGLGDYQMSRIDCLDDPHPLQTGKEHAKQDMMDAENNKVSILQVANPDKQESLERENIPDHMDAEQTWPTEEEIMESNMETKKKIKKVPKGWSDYQAAWIVESDAEGDNSEEASDEDDDDDDNENDEFMSCEEDESDKEVNEEINDFESVMESEVGPTDERYDATIDAHEEHEMLKKLAAAKEDQQFPDEVDTPQDVPARERFVRYRGLESFRTSPWDPKENLPQDYARIFQFENYDRTRRRVFKELEDSLENMYGFYITIHVKGVRQDLWKAFHESNGNTPLSVFGLLPHEHKMSLMNVVLKRTGVSEDPIKSKERLIFQVGYRRFIVNPIFSQHTNGSKHKYERFFQPGSTCVATFFAPIQFSPSTVLCFKEKKNTKLQLVASGVLLSCNPDRLVIKRIVLSGHPYKVNKKSAVIRFMFFNRDDVIYFKPCKLRTKYGRTGHIKEPLGTHGHMKCVFDGQLKSQDTVLLNLYKRMFPKWTYEDCIVTDRNEDLME-