Monarch geneset OGS2.0

DPOGS202417
TranscriptDPOGS202417-TA1344 bp
ProteinDPOGS202417-PA447 aa
Genomic positionDPSCF300233 + 181516-182944
RNAseq coverage1085x (Rank: top 12%)
Annotation
HeliconiusHMEL0062790.093.95% 
BombyxBGIBMGA004603-TA0.083.26% 
DrosophilabetaTub85D-PA0.083.02% 
EBI UniRef50UniRef50_Q135090.079.91%Tubulin beta-3 chain n=160 Tax=Eukaryota RepID=TBB3_HUMAN
NCBI RefSeqNP_001093079.10.094.87%beta-tubulin [Bombyx mori]
NCBI nr blastpgi|17695280.096.41%beta-tubulin [Heliothis virescens]
NCBI nr blastxgi|17695280.096.41%beta-tubulin [Heliothis virescens]
Group
Gene OntologyGO:00058740microtubule
GO:00070170microtubule-based process
GO:00512588.2e-130protein polymerization
GO:00432348.2e-130protein complex
GO:00070184e-111microtubule-based movement
GO:00051984e-111structural molecule activity
GO:00061847.9e-81GTP catabolic process
GO:00039247.9e-81GTPase activity
GO:00055252.4e-48GTP binding
KEGG pathwaytca:6556140.0 
 K07375 (TUBB)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-441] IPR0002170Tubulin
[1-265] IPR0030088.2e-130Tubulin/FtsZ, GTPase domain
[41-58] IPR0024534e-111Beta tubulin
[244-430] IPR0082807.9e-81Tubulin/FtsZ, C-terminal
[261-382] IPR0183162.4e-48Tubulin/FtsZ, 2-layer sandwich domain
[374-429] IPR0231231.2e-29Tubulin, C-terminal
Orthology groupMCL10017 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202417-TA
ATGAGGGAGATCATTACTCTTCAAGTTGGTTCGTGTGGAAATCAGATTGGAGGAAAGTTCTGGGAGGTAATATCTGACGAGCATGGCATCGACCCCAGCGGCTGCTACCACGGGGATTCGGATCTTCAATTAGAACGCATTAATGTTTACTACAATGAAGCGTCTGGTGGTAAATATGTTCCGAGGTCTATATTGATTGATCTGAAGCCTGCGACGATGGACGCTGTTCGGTCTGGACCGTTTGGTTGTCTCTATCGACCTGACAATTTCGTCTACGGCCAGGGTGGCGGTGCTAACAACTGGGCCAGAGGCCATTACACGGAAGGCGTAGAGATATTAGAATCGGTTCTGGACGTGGTCCGTCGAGAAGCAGAGGGATGCGACTGCTTGCAAGGTTTCCAAATTAGTCATTCGCTTGGAGGCGGGACGGGGTCGGGCATGGGCACGTTGATTATCAACCGGATTAGAGAGGAATACCCCGATCGTATCATACTATCTTTCTCAGTTTTTCCAAGCGCGAGAGTATCAGACTGTGTAGTGGAGCCGTACAATACCACTCTAGCAGTAAATCAGTTGGTCGAAAATACGGATCACACTTTTTGCTTGGACAATGAGGCGCTGTATGATATATGTTTCAGAACACTAAAGCTGACTACTCCTACATATGGCGATTTAAACCATTTGATTTGCGCTACAATGTCGGGTATTACGACCTGTCTTCGTTTTCCCGGTCAGTTGAATGCGGACTTGAGAAAATTGGCTGTGAATATGGTTCCCTTCCCACGCCTCCACTTTTACACACCAGGCTTTGCGCCGCTGACGTCCAGAGGGGCACAACAATACAGAGCTCTTACTGTTCCCGAATTGACACTGCAAATGTTCGACGCGAAGAACATGATGGTCGCCTGTGACCCACGGCACGGGAGATATCTTACAGTAGCTACAATCTTCAGAGGCCGTATGTCAATGAAGGAGGTGGATGAACAAATTATGAATATTCAAAATAAGAATAGTACATATTTTGTTGAATGGATACCGAATAACTGCAAGATCGCAGTATGTGACATACCGCCTCGAGGCTTGAAAATGGCGTCGACATTTATTGGCAATACGACGGCTATACAGAGTATATTTAAAAGGGTGGCGGAACAGTTCGTGGCCATGTTCAGGCGGAAAGCTTTCCTTCACTGGTACACTGGTGAAGGTATGGATGAGATGGAGTTCACGGAAGCGGAAAGCAACATGAATGACCTCATCTCTGAATACCAGCAGTACCAAGACGCGACGGCGGACGATGAAGGAGAATTTGATGAGGAAGCAGAGGGTGAAGGATTGGAGGGTTAA

Protein sequence:

>DPOGS202417-PA
MREIITLQVGSCGNQIGGKFWEVISDEHGIDPSGCYHGDSDLQLERINVYYNEASGGKYVPRSILIDLKPATMDAVRSGPFGCLYRPDNFVYGQGGGANNWARGHYTEGVEILESVLDVVRREAEGCDCLQGFQISHSLGGGTGSGMGTLIINRIREEYPDRIILSFSVFPSARVSDCVVEPYNTTLAVNQLVENTDHTFCLDNEALYDICFRTLKLTTPTYGDLNHLICATMSGITTCLRFPGQLNADLRKLAVNMVPFPRLHFYTPGFAPLTSRGAQQYRALTVPELTLQMFDAKNMMVACDPRHGRYLTVATIFRGRMSMKEVDEQIMNIQNKNSTYFVEWIPNNCKIAVCDIPPRGLKMASTFIGNTTAIQSIFKRVAEQFVAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLISEYQQYQDATADDEGEFDEEAEGEGLEG-