Monarch geneset OGS2.0

DPOGS209862
TranscriptDPOGS209862-TA1377 bp
ProteinDPOGS209862-PA458 aa
Genomic positionDPSCF300510 - 13536-22003
RNAseq coverage618x (Rank: top 21%)
Annotation
HeliconiusHMEL0226080.093.95% 
BombyxBGIBMGA001683-TA0.091.67% 
DrosophilabetaTub60D-PA0.081.42% 
EBI UniRef50UniRef50_Q135090.082.47%Tubulin beta-3 chain n=160 Tax=Eukaryota RepID=TBB3_HUMAN
NCBI RefSeqXP_967267.10.084.38%PREDICTED: similar to beta1-tubulin [Tribolium castaneum]
NCBI nr blastpgi|910860930.084.38%PREDICTED: similar to beta1-tubulin [Tribolium castaneum]
NCBI nr blastxgi|3132256110.085.62%unnamed protein product [Oikopleura dioica]
Group
Gene OntologyGO:00512586.5e-138protein polymerization
GO:00432346.5e-138protein complex
GO:00070181.1e-108microtubule-based movement
GO:00058741.1e-108microtubule
GO:00051981.1e-108structural molecule activity
GO:00055251.1e-108GTP binding
GO:00070172.4e-101microtubule-based process
GO:00061841.4e-80GTP catabolic process
GO:00039241.4e-80GTPase activity
KEGG pathwaytca:6556140.0 
 K07375 (TUBB)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-443] IPR0002170Tubulin
[1-266] IPR0030086.5e-138Tubulin/FtsZ, GTPase domain
[41-58] IPR0024531.1e-108Beta tubulin
[244-430] IPR0082801.4e-80Tubulin/FtsZ, C-terminal
[261-382] IPR0183161e-47Tubulin/FtsZ, 2-layer sandwich domain
[374-429] IPR0231231.9e-30Tubulin, C-terminal
Orthology groupMCL10017 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209862-TA
ATGCGTGAAATAGTACATGTGCAAGCGGGCCAGTGCGGTAATCAGATTGGTTCAAAGTTCTGGGAGGTGATATCGGACGAACATGGCATAGATCCGAATGGTTACTACAAGGGAGTGAGCGACCTGCAAGCCGAACGCCTCAACGTGTACTATAATGAAGCCGCTGAAGGTAAATTTGTTCCTCGGGCTGTTCTCGTGGATCTGGAGCCAGGGACCATGGACTCAGTGCGTTCAGGTCCCTATGGGCAACTGTTCCGACCTGATAACTTCGTATTCGGACAGAGCGGCGCTGGTAACAACTGGGCCAAGGGTCACTACACAGAGGGCGCTGAGCTCGTTGATGCTGTGTTGGATGTTATCCGGAAGGAGGCTGAAGGTTGCGACTGTCTGCAAGGTTTCCAAATGACGCATTCTCTTGGCGGAGGCACTGGTTCAGGCATGGGGACACTGTTGTTGAGCAAGATCAGAGAGGAGTACCCTGATCGTATTATGAATACCTTCAGTGTTGTGCCTTCACCCAAGGTCAGCGAAGTTGTCCTGGAGCCGTATAATGCGACTCTGTCGGTGCACCAACTGGTCGAGAACACGGATATGTCCTATTGCATCGACAACGAAGCCCTTTATAACATTTGCTTCAGAACGTTGCGATTGAGCAGCCCAAGTTACGGCGATCTGAATCATTTGATATCAATGACAATGTCCGGTGTCACGACTTGCCTCCGTTTCCCTGGACAATTGAATGCTGATCTCAGGAAGCTGGCCGTCAACATGGTGCCCTTCCCGAGACTGCACTTTTTTATGCCCGGCTTCGCTCCTCTGACAGCAAGAAACTCGTTCAACTACCGCCCTCAGACCGTTCCGGAGCTTATGAGCCAAATGTTCAACCCTGGGAACATGATGACGGCTTGCGACCCCCGTCACGGCCGCTACCTCACAGTGGCCACCGTGTTCAGAGGTCACATGTCCATGAGAGAGGTCGACGACCAAGTGTTGGCGGTCCAGAACAAGAACTCGAGCTACTTCGTGGAATGGATCCCCAACAACCTGAAGGTGGCCGTCTGCGACGTCCCGCCGCGCGGCCTCAAGATGTCCGCCACGTTCATCGGCAACTCCACCGCCATCCAGGAGATATTCAAACGCATTTCAGAACAGTTCACCGCCATGTTCAGGAGAAGAGCGTTCCTCCACTGGTACACGGGCGAGGGTATGGACGAGATGGAGTTCACGGAGGCGGCCAGCAACATGGCCGACCTCATATCAGAGTACCAACAGTACCAGGAGGCTAACGTGGATGATGAAGAGGTAGGCTTCGACGAGGAAGAGGAAGAAGACGATCAAAATTACGACCACAAGGAGTCGGTGCACGTCCCGCTATAG

Protein sequence:

>DPOGS209862-PA
MREIVHVQAGQCGNQIGSKFWEVISDEHGIDPNGYYKGVSDLQAERLNVYYNEAAEGKFVPRAVLVDLEPGTMDSVRSGPYGQLFRPDNFVFGQSGAGNNWAKGHYTEGAELVDAVLDVIRKEAEGCDCLQGFQMTHSLGGGTGSGMGTLLLSKIREEYPDRIMNTFSVVPSPKVSEVVLEPYNATLSVHQLVENTDMSYCIDNEALYNICFRTLRLSSPSYGDLNHLISMTMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTARNSFNYRPQTVPELMSQMFNPGNMMTACDPRHGRYLTVATVFRGHMSMREVDDQVLAVQNKNSSYFVEWIPNNLKVAVCDVPPRGLKMSATFIGNSTAIQEIFKRISEQFTAMFRRRAFLHWYTGEGMDEMEFTEAASNMADLISEYQQYQEANVDDEEVGFDEEEEEDDQNYDHKESVHVPL-