Monarch geneset OGS2.0

DPOGS202403
TranscriptDPOGS202403-TA1329 bp
ProteinDPOGS202403-PA442 aa
Genomic positionDPSCF300233 - 184096-185511
RNAseq coverage1911x (Rank: top 6%)
Annotation
HeliconiusHMEL0163710.095.46% 
BombyxBGIBMGA003442-TA0.098.56% 
DrosophilabetaTub85D-PA0.096.04% 
EBI UniRef50UniRef50_Q135090.092.76%Tubulin beta-3 chain n=160 Tax=Eukaryota RepID=TBB3_HUMAN
NCBI RefSeqXP_967267.10.096.60%PREDICTED: similar to beta1-tubulin [Tribolium castaneum]
NCBI nr blastpgi|910860930.096.60%PREDICTED: similar to beta1-tubulin [Tribolium castaneum]
NCBI nr blastxgi|1129833420.097.96%beta-tubulin [Bombyx mori]
Group
Gene OntologyGO:00058740microtubule
GO:00070170microtubule-based process
GO:00512581.7e-141protein polymerization
GO:00432341.7e-141protein complex
GO:00070185e-122microtubule-based movement
GO:00051985e-122structural molecule activity
GO:00061843.2e-84GTP catabolic process
GO:00039243.2e-84GTPase activity
GO:00055252.5e-51GTP binding
KEGG pathwaytca:6556140.0 
 K07375 (TUBB)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-442] IPR0002170Tubulin
[1-266] IPR0030081.7e-141Tubulin/FtsZ, GTPase domain
[41-58] IPR0024535e-122Beta tubulin
[244-430] IPR0082803.2e-84Tubulin/FtsZ, C-terminal
[261-382] IPR0183162.5e-51Tubulin/FtsZ, 2-layer sandwich domain
[374-429] IPR0231232.1e-31Tubulin, C-terminal
Orthology groupMCL10017 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202403-TA
ATGAGGGAAATCGTCCACATCCAAGCCGGGCAATGCGGCAACCAAATTGGGGCTAAGTTTTGGGAAGTAATATCAGACGAACACGGTATTGACCCTACAGGAGCTTACCAGGGTGACTCCGACCTGCAATTGGAGCGCATTAATGTATATTACAATGAGGCTTCCGGCTGCAAATACGTACCGAGAGCCGTTCTCGTCGACCTTGAGCCTGGCACTATGGATTCTGTAAGGTCCGGACCATTCGGGCAGATATTTCGCCCGGACAATTTTGTTTTCGGGCAATCTGGTGCCGGTAACAATTGGGCGAAAGGACACTATACGGAGGGAGCCGAGTTGGTGGATTCGGTTCTCGATGTAGTCAGAAAGGAAGCAGAGGGTTGTGACTGTCTGCAAGGATTCCAGTTGACGCATTCTCTTGGTGGTGGTACTGGCTCGGGGATGGGTACTTTGTTGATATCAAAAATAAGAGAAGAATACCCCGACCGGATCATGAATACCTTTTCGGTTGTACCCAGTCCAAAAGTGTCGGACACTGTAGTCGAACCTTACAATGCTACTTTATCTGTCCACCAATTAGTGGAAAATACGGATGAAACTTACTGCATCGATAATGAAGCGTTATATGATATTTGTTTCCGCACTCTTAAATTGACTACCCCTACCTACGGTGACTTGAATCACTTGGTGTCAGCGACGATGTCTGGTGTCACAACCTGTCTACGTTTTCCCGGCCAACTAAACGCTGATCTCCGTAAACTTGCTGTCAACATGGTTCCATTTCCGCGATTACATTTTTTCATGCCAGGTTTCGCACCGCTCACATCAAGAGGCAGTCAACAATATAGAGCACTCACTGTACCAGAACTTACTCAGCAAATGTTCGACGCTAAGAACATGATGGCGGCGTGTGATCCTCGCCATGGGCGATATCTGACTGTGGCTGCTGTATTCCGTGGCCGCATGTCCATGAAGGAAGTGGACGAGCAGATGTTAAATATACAAAATAAAAACAGCAGCTACTTCGTTGAATGGATTCCGAATAACGTAAAGACGGCCGTTTGCGATATTCCACCTCGTGGACTCAAGATGTCCGCCACATTTATCGGTAATACAACAGCTATACAGGAGTTGTTCAAGAGAATATCTGAACAATTTTCAGCCATGTTTAGACGAAAAGCTTTTTTGCATTGGTACACTGGCGAAGGAATGGACGAGATGGAGTTCACTGAAGCTGAGAGTAATATGAATGACCTGGTGTCTGAGTACCAGCAGTATCAAGATGCAACGGTCGATGATGAAGGGGAATTTGACGAGGAAGTGGAAGAATAG

Protein sequence:

>DPOGS202403-PA
MREIVHIQAGQCGNQIGAKFWEVISDEHGIDPTGAYQGDSDLQLERINVYYNEASGCKYVPRAVLVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDSVLDVVRKEAEGCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYPDRIMNTFSVVPSPKVSDTVVEPYNATLSVHQLVENTDETYCIDNEALYDICFRTLKLTTPTYGDLNHLVSATMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQMFDAKNMMAACDPRHGRYLTVAAVFRGRMSMKEVDEQMLNIQNKNSSYFVEWIPNNVKTAVCDIPPRGLKMSATFIGNTTAIQELFKRISEQFSAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYQDATVDDEGEFDEEVEE-