Monarch geneset OGS2.0

DPOGS215440
TranscriptDPOGS215440-TA1344 bp
ProteinDPOGS215440-PA447 aa
Genomic positionDPSCF300298 + 43802-45445
RNAseq coverage1300x (Rank: top 10%)
Annotation
HeliconiusHMEL0158440.0100.00% 
BombyxBGIBMGA004603-TA0.099.78% 
DrosophilabetaTub85D-PA0.096.05% 
EBI UniRef50UniRef50_Q135090.091.96%Tubulin beta-3 chain n=160 Tax=Eukaryota RepID=TBB3_HUMAN
NCBI RefSeqXP_967267.10.096.15%PREDICTED: similar to beta1-tubulin [Tribolium castaneum]
NCBI nr blastpgi|1184042760.097.91%tubulin, beta 4B class IVb [Xenopus (Silurana) tropicalis]
NCBI nr blastxgi|2912427460.096.87%PREDICTED: Tubulin beta chain-like [Saccoglossus kowalevskii]
Group
Gene OntologyGO:00058740microtubule
GO:00070170microtubule-based process
GO:00512582.8e-140protein polymerization
GO:00432342.8e-140protein complex
GO:00070184.2e-123microtubule-based movement
GO:00051984.2e-123structural molecule activity
GO:00061842.3e-83GTP catabolic process
GO:00039242.3e-83GTPase activity
GO:00055251.7e-50GTP binding
KEGG pathwaytca:6556140.0 
 K07375 (TUBB)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-441] IPR0002170Tubulin
[1-266] IPR0030082.8e-140Tubulin/FtsZ, GTPase domain
[41-58] IPR0024534.2e-123Beta tubulin
[244-430] IPR0082802.3e-83Tubulin/FtsZ, C-terminal
[261-382] IPR0183161.7e-50Tubulin/FtsZ, 2-layer sandwich domain
[374-429] IPR0231231.3e-31Tubulin, C-terminal
Orthology groupMCL10017 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215440-TA
ATGAGGGAGATCGTGCATATACAAGCAGGCCAGTGTGGAAATCAAATTGGAGCTAAGTTCTGGGAAGTTATATCAGACGAACATGGCATCGATCCAACTGGCACATACCACGGTGACTCCGACCTTCAATTGGAGCGTATAAATGTATACTACAACGAAGCAACTGGAGGCAAATATGTACCGAGAGCCATTTTGGTTGATCTCGAGCCGGGCACTATGGATTCTGTTCGTTCAGGACCTTTTGGCCAAATTTTCCGTCCCGATAATTTTGTCTTTGGACAATCAGGTGCGGGTAATAATTGGGCAAAAGGTCATTACACTGAAGGAGCAGAACTAGTGGATTCTGTTCTTGATGTGGTTAGGAAAGAAGCAGAGGGCTGTGATTGTCTTCAGGGTTTCCAATTGACGCATTCTCTTGGAGGTGGTACCGGTGCTGGGTTAGGAACGTTGCTTATATCAAAAATAAGGGAAGAATACCCTGATCGTATCATGAATACCTTTAGCGTCGTTCCTTCTCCCAAGGTGTCTGATACGGTTGTAGAACCATACAATGCCACGTTATCTGTTCACCAGCTTGTTGAAAATACTGATGAATCGTATTGTATCGATAATGAGGCTTTGTACGATATTTGTTTCCGCACACTTAAGCTGACTACACCGACCTACGGAGATCTAAACCACCTTGTATCAGCAACTATGTCGGGAGTTACTACCTGTCTAAGGTTCCCTGGTCAGTTAAATGCTGACTTGAGAAAGTTGGCGGTGAATATGGTACCGTTTCCTCGTCTTCACTTCTTTATTCCTGGATTCGCACCACTAACTTCTCGCGGCAGCCAGCAATACCGAGCCCTGACTGTTCCAGAACTGACCCAACAAATGTTTGATGCGAAAAATATGATGGCTGCTTGCGATCCACGTCACGGTCGCTATTTAACAGTAGCTGCAGTTTTCCGAGGCCGAATGTCTATGAAAGAAGTTGACGAACAAATGATGAACATACAGAACAAGAATTCCTCGTATTTTGTTGAATGGATACCCAACAATGTTAAGACAGCCGTATGCGATATTCCGCCACGAGGTCTGAAGATGTCCGCCACGTTCATAGGAAATTCTACCGCGATTCAAGAGCTGTTTAAACGAATTTCTGAACAATTTACAGCCATGTTCAGACGCAAGGCATTCTTGCATTGGTATACCGGCGAGGGAATGGATGAAATGGAATTTACAGAGGCTGAGAGTAACATGAACGATTTGGTATCGGAATACCAACAATATCAGGATGCTACCGCTGAAGAAGAGGGAGAATTTGATGAGGAAGAAGAGGGTGGTGATGAAGGAGACTAG

Protein sequence:

>DPOGS215440-PA
MREIVHIQAGQCGNQIGAKFWEVISDEHGIDPTGTYHGDSDLQLERINVYYNEATGGKYVPRAILVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDSVLDVVRKEAEGCDCLQGFQLTHSLGGGTGAGLGTLLISKIREEYPDRIMNTFSVVPSPKVSDTVVEPYNATLSVHQLVENTDESYCIDNEALYDICFRTLKLTTPTYGDLNHLVSATMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFIPGFAPLTSRGSQQYRALTVPELTQQMFDAKNMMAACDPRHGRYLTVAAVFRGRMSMKEVDEQMMNIQNKNSSYFVEWIPNNVKTAVCDIPPRGLKMSATFIGNSTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQDATAEEEGEFDEEEEGGDEGD-