Monarch geneset OGS2.0

DPOGS202402
TranscriptDPOGS202402-TA1353 bp
ProteinDPOGS202402-PA450 aa
Genomic positionDPSCF300233 - 186343-187774
RNAseq coverage1487x (Rank: top 9%)
Annotation
HeliconiusHMEL0074170.089.53% 
BombyxBGIBMGA004603-TA0.086.48% 
DrosophilabetaTub85D-PA0.085.08% 
EBI UniRef50UniRef50_Q135090.082.74%Tubulin beta-3 chain n=160 Tax=Eukaryota RepID=TBB3_HUMAN
NCBI RefSeqXP_967267.10.083.86%PREDICTED: similar to beta1-tubulin [Tribolium castaneum]
NCBI nr blastpgi|24433440.086.48%beta-tubulin [Halocynthia roretzi]
NCBI nr blastxgi|3272639830.085.94%PREDICTED: tubulin beta-4 chain-like [Anolis carolinensis]
Group
Gene OntologyGO:00058740microtubule
GO:00070170microtubule-based process
GO:00512582.8e-134protein polymerization
GO:00432342.8e-134protein complex
GO:00070186.1e-108microtubule-based movement
GO:00051986.1e-108structural molecule activity
GO:00061848.5e-79GTP catabolic process
GO:00039248.5e-79GTPase activity
GO:00055257.3e-49GTP binding
KEGG pathwaytca:6556140.0 
 K07375 (TUBB)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-441] IPR0002170Tubulin
[1-266] IPR0030082.8e-134Tubulin/FtsZ, GTPase domain
[41-58] IPR0024536.1e-108Beta tubulin
[244-430] IPR0082808.5e-79Tubulin/FtsZ, C-terminal
[246-383] IPR0183167.3e-49Tubulin/FtsZ, 2-layer sandwich domain
[374-428] IPR0231231.5e-26Tubulin, C-terminal
Orthology groupMCL10017 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202402-TA
ATGAGGGAGATCGTCCACATCCAGGTCGGGCGGTGCGGAAATCAAATAGGATCGAAGTTCTGGGAGGTGATATCGGATGAACATGGCATAGATCCAAGTGGTTGCTACGCCGGAGATTCTGATCTACAACTCGAACGCATCAACGTGTATTACAACGAAGCGGCAGCCGGTAAATACGTGCCGCGAGCTGTTCTTGTTGATCTCGAACCCGGGACGATGGATTCGTTACGCGCTGGGCCATACGGTCAAATATTTCGTCCCGATAACATAGTTTTTGGTGTATCGGGAGCTGGTAATAATTGGGCTAAAGGACATTATACAGAAGGAGCGGACTTGCTCGAGTCGGTCTTGGATGTTATTAGGAAGGAAGCTGAAGGTTGTGACTGTCTTCAAGGTTTCGAACTGATTCACTCGTTGGGAGGCGGTACTGGCTCAGGTTTAGGAACCTTGTTGCTGAATAATTTAAGGGAAGAGTATGCAGATAGAATTATTTTAACGTTTTCCGTCGTCCCGAGCCCTAAAGTTTCTGATACCGTCGTAGAGCCGTATAATGCCACGTTATCATTAAACCAGCTCATAGAAAATTCCGATCAATCATTTTGTATAGACAACGAAGCTTTGTACGATATTTGTTTCCGAACGTTGCGACTGCAAACACCCACATACGGTGACTTGAACCATTTGGTGTCGGCGACGATGTCTGGTGTCACGACGTGCCTGCGGTTTCCCGGACAATTAAATGCGGACCTTCGAAAGCTTGCAGTCAATATGGTGCCGTTCCCGAGGCTACACTTCTTCATGCCGGGATTCGCTCCGCTCACAGCCAGGGGCAGTCAGCAGTACAGAGCATTGACCGTACCTGAGCTCACTCTCCAGATGTTCGACGCCAAGAACATGATGGCAGCATGCGATCCACGTCACGGACGATATCTCACCGTGGCGGCTGTGTTCCGCGGTCGGATGTCAATGAAAGAAGTCGACGAGCAAATGCTTAATATACAGAACAAAAATAAAGACTACTTTGTGAAATGGATACCCAATAACGTTAAGACCGCCGTGTGTGACATCCCACCCCGTGGATTGAAAATGTCTGCAACTTTTATTGGCAATACGACCGCCATACAAGAGATTCTCAAAAGAGTGTCTGAACAATTCGCCTCCATGTTTCGAAGGAAAGCATTCATACACTGGTACACCGGCGAGGGTATGGACGAAACTGACTTCACCGAGGCAGACAATAACCTCAGCGATCTTATATCAGAATATCAACAGTACCAAGATGCGACAACAGAAGAACAAGAATTCGAGGAAGAGGAGGACGAAGCTGCACCAAATGAAGAGAGTGACCAATAA

Protein sequence:

>DPOGS202402-PA
MREIVHIQVGRCGNQIGSKFWEVISDEHGIDPSGCYAGDSDLQLERINVYYNEAAAGKYVPRAVLVDLEPGTMDSLRAGPYGQIFRPDNIVFGVSGAGNNWAKGHYTEGADLLESVLDVIRKEAEGCDCLQGFELIHSLGGGTGSGLGTLLLNNLREEYADRIILTFSVVPSPKVSDTVVEPYNATLSLNQLIENSDQSFCIDNEALYDICFRTLRLQTPTYGDLNHLVSATMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTARGSQQYRALTVPELTLQMFDAKNMMAACDPRHGRYLTVAAVFRGRMSMKEVDEQMLNIQNKNKDYFVKWIPNNVKTAVCDIPPRGLKMSATFIGNTTAIQEILKRVSEQFASMFRRKAFIHWYTGEGMDETDFTEADNNLSDLISEYQQYQDATTEEQEFEEEEDEAAPNEESDQ-