Monarch geneset OGS2.0

DPOGS213386
TranscriptDPOGS213386-TA1344 bp
ProteinDPOGS213386-PA447 aa
Genomic positionDPSCF300109 + 322389-326088
RNAseq coverage12601x (Rank: top 1%)
Annotation
HeliconiusHMEL0163680.098.66% 
BombyxBGIBMGA009132-TA0.098.43% 
DrosophilabetaTub85D-PA0.094.65% 
EBI UniRef50UniRef50_Q135090.091.01%Tubulin beta-3 chain n=160 Tax=Eukaryota RepID=TBB3_HUMAN
NCBI RefSeqNP_001036964.10.098.43%beta-tubulin [Bombyx mori]
NCBI nr blastpgi|1129833180.098.43%beta-tubulin [Bombyx mori]
NCBI nr blastxgi|1129833180.098.43%beta-tubulin [Bombyx mori]
Group
Gene OntologyGO:00058740microtubule
GO:00055250GTP binding
GO:00070170microtubule-based process
GO:00512581.8e-140protein polymerization
GO:00432341.8e-140protein complex
GO:00070183.8e-121microtubule-based movement
GO:00051983.8e-121structural molecule activity
GO:00061846e-84GTP catabolic process
GO:00039246e-84GTPase activity
KEGG pathwaytca:6556140.0 
 K07375 (TUBB)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-441] IPR0002170Tubulin
[1-266] IPR0030081.8e-140Tubulin/FtsZ, GTPase domain
[41-58] IPR0024533.8e-121Beta tubulin
[244-430] IPR0082806e-84Tubulin/FtsZ, C-terminal
[246-383] IPR0183162.9e-51Tubulin/FtsZ, 2-layer sandwich domain
[374-429] IPR0231232.5e-31Tubulin, C-terminal
Orthology groupMCL10017 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213386-TA
ATGAGGGAAATCGTTCATATCCAGGCCGGCCAATGCGGAAACCAGATCGGTGCTAAGTTCTGGGAGATCATCTCCGACGAGCACGGCATCGACCCCACTGGTGCCTACCATGGAGACTCCGACTTGCAGCTGGAGCGCATCAATGTGTACTACAATGAGGCCTCCGGTGGCAAGTACGTCCCCCGCGCCATCCTCGTGGACTTGGAGCCCGGCACCATGGACTCTGTCCGCTCTGGACCTTTCGGACAGATCTTCCGCCCTGACAACTTCGTCTTCGGTCAATCCGGAGCCGGCAACAACTGGGCCAAGGGTCACTACACAGAGGGAGCTGAACTAGTCGATTCAGTTTTAGACGTCGTGCGTAAAGAGGCAGAGTCGTGCGACTGTCTTCAAGGTTTCCAACTGACACACTCACTGGGAGGCGGCACCGGGTCCGGCATGGGCACGTTACTCATCTCCAAGATCAGAGAGGAATATCCCGACAGAATCATGAACACATACTCAGTAGTCCCCTCGCCCAAAGTATCAGACACCGTCGTAGAACCTTACAACGCGGTGCTCTCCATCCATCAGCTAGTCGAAAACACAGACGAAACATATTGCATAGATAACGAGGCCCTATACGATATCTGCTACAGAACACTCAAAGTGCCCAACCCGACTTACGGCGACCTCAACCATCTCGTGTCGCTCACCATGTCCGGTGTCACCACCTGCCTCAGGTTCCCCGGTCAGTTAAACGCCGACCTCCGCAAGTTGGCCGTCAACATGGTTCCCTTCCCCCGTCTCCACTTCTTCATGCCCGGATTCGCACCCCTGACATCTCGCGGCAGCCAGCAGTACCGCGCCCTGACCGTTCCCGAGCTCACTCAGCAGATGTTCGACGCCAAGAACATGATGGCAGCCTGCGACCCTCGCCACGGCCGCTACCTCACCGTGGCCGCCATCTTCCGTGGTCGCATGTCCATGAAGGAGGTGGACGAGCAGATGCTCAACATCCAGAACAAGAACTCGTCGTACTTCGTGGAATGGATCCCCAACAACGTGAAGACCGCCGTGTGCGACATCCCTCCCCGCGGTCTCAAGATGGCCGCCACCTTCATCGGCAACTCCACCGCCATCCAAGAGCTGTTCAAGCGCATCTCGGAGCAGTTCACCGCCATGTTCAGACGCAAGGCTTTCCTCCACTGGTACACCGGCGAGGGAATGGACGAGATGGAGTTCACGGAGGCGGAGAGCAACATGAACGACCTGGTGTCCGAGTACCAGCAGTACCAGGAAGCCACCGCCGACGAGGACGCGGAGTTCGACGAGGAACAGGAGCAGGAGATCGAGGAGAACTAG

Protein sequence:

>DPOGS213386-PA
MREIVHIQAGQCGNQIGAKFWEIISDEHGIDPTGAYHGDSDLQLERINVYYNEASGGKYVPRAILVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDSVLDVVRKEAESCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYPDRIMNTYSVVPSPKVSDTVVEPYNAVLSIHQLVENTDETYCIDNEALYDICYRTLKVPNPTYGDLNHLVSLTMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQMFDAKNMMAACDPRHGRYLTVAAIFRGRMSMKEVDEQMLNIQNKNSSYFVEWIPNNVKTAVCDIPPRGLKMAATFIGNSTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQEATADEDAEFDEEQEQEIEEN-