Monarch geneset OGS2.0

DPOGS213388
TranscriptDPOGS213388-TA1341 bp
ProteinDPOGS213388-PA446 aa
Genomic positionDPSCF300109 + 352318-355328
RNAseq coverage2223x (Rank: top 5%)
Annotation
HeliconiusHMEL0163650.099.09% 
BombyxBGIBMGA009133-TA0.098.42% 
DrosophilabetaTub60D-PA0.091.88% 
EBI UniRef50UniRef50_Q135090.087.91%Tubulin beta-3 chain n=160 Tax=Eukaryota RepID=TBB3_HUMAN
NCBI RefSeqNP_001036888.10.097.72%beta-tubulin [Bombyx mori]
NCBI nr blastpgi|1129834560.097.72%beta-tubulin [Bombyx mori]
NCBI nr blastxgi|1129834560.097.72%beta-tubulin [Bombyx mori]
Group
Gene OntologyGO:00058740microtubule
GO:00055250GTP binding
GO:00070170microtubule-based process
GO:00512587.5e-122protein polymerization
GO:00432347.5e-122protein complex
GO:00070181.8e-115microtubule-based movement
GO:00051981.8e-115structural molecule activity
GO:00061842.9e-82GTP catabolic process
GO:00039242.9e-82GTPase activity
KEGG pathwaycqu:CpipJ_CPIJ0032630.0 
 K07375 (TUBB)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[9-438] IPR0002170Tubulin
[7-261] IPR0030087.5e-122Tubulin/FtsZ, GTPase domain
[30-47] IPR0024531.8e-115Beta tubulin
[239-425] IPR0082802.9e-82Tubulin/FtsZ, C-terminal
[256-377] IPR0183167.6e-50Tubulin/FtsZ, 2-layer sandwich domain
[369-424] IPR0231232.1e-30Tubulin, C-terminal
Orthology groupMCL10017 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213388-TA
ATGTATAATTTAAAAGAAACAACTTTCTGGGAGATAATATCCGAGGAGCACGGCATCGACCCCACGGGAGTGTACAGAGGAACCAGCGACCTGCAGCTCGAACGGATCTCAGTCTACTATAATGAGGCCTCTGTTGCGACGGCGGAGAACGGGGGCAAGTACGTCCCCCGCGCCATCCTGCTCGACCTCGAGCCCGGGACTATGGACGCCGTGCGCTCCGGGGGCTACGGCCAGCTGTTCCGCCCGGACAACTTCGTCTTCGGTCAGTCCGGCGCAGGGAACAACTGGGCCAAGGGTCACTACACAGAGGGAGCTGAACTCGTCGACGCCGTACTCGACGTAGTGCGCAAGGAGTGCGAAAACTGCGACTGTCTCCAGGGCTTCCAACTGACACACTCACTGGGAGGCGGCACCGGGTCCGGCATGGGCACGTTACTCATCTCCAAGATCAGAGAGGAATATCCCGACAGAATCATGAACACATACTCAGTTGTCCCCTCGCCCAAAGTATCAGACACCGTCGTAGAACCTTACAACGCGGTGCTCTCCATCCATCAGCTAGTCGAAAACACAGACGAAACATATTGCATAGATAACGAGGCCCTATACGATATCTGCTACAGAACACTCAAAGTGCCCAACCCGACTTACGGTGACCTCAACCATCTCGTGTCGCTCACCATGTCCGGTGTCACCACCTGCCTCAGGTTCCCCGGTCAGTTAAACGCCGACCTCCGCAAGTTGGCCGTCAACATGGTTCCCTTCCCCCGTCTCCACTTCTTCATGCCCGGATTCGCACCCCTGACATCTCGCGGCAGCCAGCAGTACCGCGCCCTGACCGTTCCCGAGCTCACTCAGCAGATGTTCGACGCCAAGAACATGATGGCAGCCTGCGCCCCTCGCCACGGCCGCTACCTCACCGTGGCCGCCATCTTCCGTGGTCGCATGTCCATGAAGGAGGTGGACGAGCAGATGCTCTCTATTCAAAACAAAAACAGCAGCTTCTTCGTAGAATGGATCCCCAACAACGTGAAGACTGCCGTGTGCGACATTCCTCCCAAGGGTCTCAAGATGTCTTCCACGTTCATCGGAAACACGACAGCCATCCAAGAGCTGTTCAAGCGCATCTCGGAGCAGTTCACCGCCATGTTCAGACGTAAGGCTTTCCTCCACTGGTACACCGGCGAGGGGATGGACGAGATGGAGTTCAACGAAGCAGAGAGCAACGTCAACGACCTGGTCTCCGAGTACCAGCAGTACCAGGAGGCGACGGCGGAAGACGACACGGAGTTCGACCAGGAGGAGATGGAGGAGCTGGCGCAGGACGACCACCACGACTGA

Protein sequence:

>DPOGS213388-PA
MYNLKETTFWEIISEEHGIDPTGVYRGTSDLQLERISVYYNEASVATAENGGKYVPRAILLDLEPGTMDAVRSGGYGQLFRPDNFVFGQSGAGNNWAKGHYTEGAELVDAVLDVVRKECENCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYPDRIMNTYSVVPSPKVSDTVVEPYNAVLSIHQLVENTDETYCIDNEALYDICYRTLKVPNPTYGDLNHLVSLTMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQMFDAKNMMAACAPRHGRYLTVAAIFRGRMSMKEVDEQMLSIQNKNSSFFVEWIPNNVKTAVCDIPPKGLKMSSTFIGNTTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFNEAESNVNDLVSEYQQYQEATAEDDTEFDQEEMEELAQDDHHD-