Monarch geneset OGS2.0

DPOGS200512
TranscriptDPOGS200512-TA1350 bp
ProteinDPOGS200512-PA449 aa
Genomic positionDPSCF300450 + 45031-52414
RNAseq coverage320x (Rank: top 36%)
Annotation
HeliconiusHMEL0176240.094.66% 
BombyxBGIBMGA009132-TA0.086.18% 
DrosophilabetaTub60D-PA0.080.89% 
EBI UniRef50UniRef50_Q135090.083.56%Tubulin beta-3 chain n=160 Tax=Eukaryota RepID=TBB3_HUMAN
NCBI RefSeqXP_392313.10.084.56%PREDICTED: similar to -Tubulin at 56D CG9277-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|480955250.084.56%PREDICTED: tubulin beta-1 chain [Apis mellifera]
NCBI nr blastxgi|2904622050.086.04%Tubulin beta chain [Lepeophtheirus salmonis]
Group
Gene OntologyGO:00058740microtubule
GO:00055250GTP binding
GO:00070170microtubule-based process
GO:00512584.4e-135protein polymerization
GO:00432344.4e-135protein complex
GO:00070182.3e-109microtubule-based movement
GO:00051982.3e-109structural molecule activity
GO:00061841.2e-79GTP catabolic process
GO:00039241.2e-79GTPase activity
KEGG pathwayame:4087820.0 
 K07375 (TUBB)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-440] IPR0002170Tubulin
[1-267] IPR0030084.4e-135Tubulin/FtsZ, GTPase domain
[41-58] IPR0024532.3e-109Beta tubulin
[245-431] IPR0082801.2e-79Tubulin/FtsZ, C-terminal
[262-383] IPR0183166.5e-49Tubulin/FtsZ, 2-layer sandwich domain
[375-428] IPR0231238.2e-31Tubulin, C-terminal
Orthology groupMCL10017 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200512-TA
ATGCGTGAAATAGTTCATCTCCAGGCGGGACAGTGCGGCAATCAGATCGGCTCTAAGTTCTGGGAGATAATATCTGACGAGCACGGCATCGACCCGACCGGCGTGTACCGCGGCAACAGCGACTTACAGCTGGACAGAATCCAGGTGTATTACAATGAAGCAGCAGATGGTAACCGTTACGTGCCGCGTGCGGTGCTCGTGGACTTGGAGCCGGGCACCATGGACTCTATCAGGGGCTCGAGCCACGGGAGACTGTTCCGACCTGATAACTACGTGTTTGGTCAGAGCGGAGCCGGAAACAACTGGGCGAAGGGCCACTATACGGAGGGCGCGGAATTGATTGACTCTGTCATGGACGTGGTGAGGAAGGAGGCGGAGCCTTGCGACTGTTTACAAGGATTCCAGTTAACCCACTCGCTTGGCGGCGGCACGGGCTCGGGGCTCGGCACGCTGCTGCTCAGCAAACTACGAGAGGAGTACCCGGACAGGATCGTCAACACGTTCAGTGTCACGCCTTCACCTAAGGTGTCTGACACAGTGGTGGAGCCTTACAACGCGACGCTGTCCGTCCATCAGCTCGTGGAGAACACGGACGAGACCTTCTGCATCGACAACGAGGCGCTCTACGACATCTGCTTCCGGACCCTGAGACTTGCAACACCCACGTATGGTGACCTCAACCATTTGGTGTCGTTGACTATGTCCGGAGTGACCACTTGTCTCCGATTCCCGGGACAGTTGAACGCGGACCTCCGCAAGTTGGCCGTCAACATGGTGCCCTTCCCACGACTGCACTTCTTTATGCCCGGGTTCGCTCCCCTCACCGCTCGCAACAGCCAGGAGTACCGAGCGCTTACAGTTGCCGAGCTCACACAGCAGATGTTTTCTCCCGTCAACATGATGGCGGCGTGTGACCCTCGCCGCGGCCGTTACCTCACCGTGGCAGCCATCTTCAGAGGACGAGTCAGCACCAAAGAGGTGGAGGAACAAATGATCAGCGCTCAGGACAAGAACAGCAGCTACTTCGTCGAGTGGATACCAAATAATGTGAAGGTTGCTGTGTGTGACGTCCCTCCACGCGGCCTCAAAATGGCTGCCACCTTCGTTGGGAACACGACCGCTATACAGGAAATATTCAAAAGAATCTCTGAACAGTTCACGTTAATGTTTAGGAGGAAGGCCTTCCTTCATTGGTACACGGGCGAGGGGATGGACGAGATGGAGTTCACCGAGGCGGAAAGCAACATGAACGATCTGGTCTCCGAGTACCAACAGTATGAGGAAGTTGGAGTTGAAGACGAGTTTGATGAACAAGAAGAAACACAGGAAGAGGAATATCCTGACGAATAA

Protein sequence:

>DPOGS200512-PA
MREIVHLQAGQCGNQIGSKFWEIISDEHGIDPTGVYRGNSDLQLDRIQVYYNEAADGNRYVPRAVLVDLEPGTMDSIRGSSHGRLFRPDNYVFGQSGAGNNWAKGHYTEGAELIDSVMDVVRKEAEPCDCLQGFQLTHSLGGGTGSGLGTLLLSKLREEYPDRIVNTFSVTPSPKVSDTVVEPYNATLSVHQLVENTDETFCIDNEALYDICFRTLRLATPTYGDLNHLVSLTMSGVTTCLRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTARNSQEYRALTVAELTQQMFSPVNMMAACDPRRGRYLTVAAIFRGRVSTKEVEEQMISAQDKNSSYFVEWIPNNVKVAVCDVPPRGLKMAATFVGNTTAIQEIFKRISEQFTLMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSEYQQYEEVGVEDEFDEQEETQEEEYPDE-