Monarch geneset OGS2.0

DPOGS203334
TranscriptDPOGS203334-TA1353 bp
ProteinDPOGS203334-PA450 aa
Genomic positionDPSCF300003 - 294923-296275
RNAseq coverage467x (Rank: top 27%)
Annotation
HeliconiusHMEL0134980.099.78% 
BombyxBGIBMGA002103-TA0.099.78% 
DrosophilaalphaTub84B-PA0.097.95% 
EBI UniRef50UniRef50_Q137480.097.27%Tubulin alpha-3C/D chain n=49 Tax=Bilateria RepID=TBA3C_HUMAN
NCBI RefSeqXP_002019473.10.097.95%GL12416 [Drosophila persimilis]
NCBI nr blastpgi|3320276960.096.22%Tubulin alpha-1 chain [Acromyrmex echinatior]
NCBI nr blastxgi|910763820.097.78%PREDICTED: similar to alpha-tubulin 1 isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00512583.9e-129protein polymerization
GO:00432343.9e-129protein complex
GO:00070181.3e-119microtubule-based movement
GO:00058741.3e-119microtubule
GO:00051981.3e-119structural molecule activity
GO:00055251.3e-119GTP binding
GO:00070179.5e-91microtubule-based process
GO:00061841e-85GTP catabolic process
GO:00039241e-85GTPase activity
KEGG pathwaycin:1001851760.0 
 K07374 (TUBA)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-451] IPR0002170Tubulin
[1-268] IPR0030083.9e-129Tubulin/FtsZ, GTPase domain
[18-33] IPR0024521.3e-119Alpha tubulin
[246-440] IPR0082801e-85Tubulin/FtsZ, C-terminal
[248-393] IPR0183162.5e-60Tubulin/FtsZ, 2-layer sandwich domain
[384-439] IPR0231231.4e-28Tubulin, C-terminal
Orthology groupMCL10039 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203334-TA
ATGCGTGAATGCATATCTATTCACGGTGGCCAAGCTGGAGTTCAAATTGGTAACGCTTGCTGGGAGTTATACTGTTTAGAACATGGAATCCAACCGGATGGACAGATGCCATCTGATAAAACCGTCGGCGGTGGTGATGATTCCTTCAACACATTTTTCAGTGAAACGGGTGCTGGAAAACACGTCCCGAGAGCAGTTTTTATAGATTTAGAGCCCACTGTGGTTGACGAAGTCCGCACCGGCACCTATCGGCAACTATTCCACCCAGAACAATTAATAACTGGTAAGGAAGATGCCGCTAACAATTACGCCAGAGGACATTACACCATCGGCAAAGAAATTGTAGACCTTGTTCTAGACAGGGTCCGTAAGTTGGCCGATCAATGCACCGGACTGCAGGGATTTTTGATTTTCCATTCGTTTGGGGGTGGAACTGGTTCTGGATTTGCCTCTTTACTTATGGAACGCTTGTCAGTAGACTACGGCAAAAAATCTAAGCTCGAATTTGCAATATATCCAGCTCCTCAAATATCAACAGCAGTCGTTGAACCATACAATTCTATATTAACTACACACACTACCTTAGAACACTCCGATGCTGCCTTCATGGTGGATAATGAGGCTATATACGATATCTGCAGGAGAAATTTAGATATAGAACGTCCTACATACACTAATTTGAATCGGTTAATTGGACAGATCGTATCTTCAATCACAGCGTCTCTGAGGTTTGATGGTGCATTAAATGTTGATCTTACCGAATTCCAAACAAACTTGGTCCCCTATCCTCGCATTCATTTTCCCTTAGTGACATATGCTCCTGTAATATCTGCAGAGAAGGCCTACCATGAACAACTTTCCGTGGCTGAAATCACGAATGCATGCTTCGAACCGGCAAACCAAATGGTAAAATGCGATCCCAGGCACGGTAAATACATGGCCTGCTGTATGTTGTACCGCGGTGATGTTGTGCCCAAAGACGTCAATGCTGCTATAGGCACTATTAAGACTAAGCGCACTATACAATTTGTCGACTGGTGTCCTACAGGTTTTAAAGTTGGTATCAACTACCAACCACCTACCGTCGTGCCTGGAGGGGACTTGGCGAAGGTGCAGAGAGCTGTCTGTATGCTTTCCAATACCACTGCTATTGCAGAAGCTTGGTCTCGTCTTAACCATAAATTTGATCTAATGTATGCCAAGCGTGCTTTTGTTCATTGGTATGTCGGTGAGGGTATGGAAGAGGGAGAGTTTTCGGAGGCTCGCGAGGATTTGGCTGCTTTAGAGAAGGATTACGAAGAAGTTGGCATGGACTCCGGAGAAGGGGAGGGCGAGGGTGGAGAAGAATATTAA

Protein sequence:

>DPOGS203334-PA
MRECISIHGGQAGVQIGNACWELYCLEHGIQPDGQMPSDKTVGGGDDSFNTFFSETGAGKHVPRAVFIDLEPTVVDEVRTGTYRQLFHPEQLITGKEDAANNYARGHYTIGKEIVDLVLDRVRKLADQCTGLQGFLIFHSFGGGTGSGFASLLMERLSVDYGKKSKLEFAIYPAPQISTAVVEPYNSILTTHTTLEHSDAAFMVDNEAIYDICRRNLDIERPTYTNLNRLIGQIVSSITASLRFDGALNVDLTEFQTNLVPYPRIHFPLVTYAPVISAEKAYHEQLSVAEITNACFEPANQMVKCDPRHGKYMACCMLYRGDVVPKDVNAAIGTIKTKRTIQFVDWCPTGFKVGINYQPPTVVPGGDLAKVQRAVCMLSNTTAIAEAWSRLNHKFDLMYAKRAFVHWYVGEGMEEGEFSEAREDLAALEKDYEEVGMDSGEGEGEGGEEY-