Monarch geneset OGS2.0

DPOGS211429
TranscriptDPOGS211429-TA1350 bp
ProteinDPOGS211429-PA449 aa
Genomic positionDPSCF300115 + 561595-565478
RNAseq coverage2092x (Rank: top 6%)
Annotation
HeliconiusHMEL0170910.091.06% 
BombyxBGIBMGA004681-TA0.096.33% 
DrosophilaalphaTub84B-PA0.079.55% 
EBI UniRef50UniRef50_P683630.080.00%Tubulin alpha-1B chain n=1161 Tax=root RepID=TBA1B_HUMAN
NCBI RefSeqNP_001036886.10.096.45%alpha-tubulin [Bombyx mori]
NCBI nr blastpgi|1129834790.096.45%alpha-tubulin [Bombyx mori]
NCBI nr blastxgi|1129834790.096.45%alpha-tubulin [Bombyx mori]
Group
Gene OntologyGO:00512583.4e-121protein polymerization
GO:00432343.4e-121protein complex
GO:00070185.5e-93microtubule-based movement
GO:00058745.5e-93microtubule
GO:00051985.5e-93structural molecule activity
GO:00055255.5e-93GTP binding
GO:00070171.1e-79microtubule-based process
GO:00061847.3e-78GTP catabolic process
GO:00039247.3e-78GTPase activity
KEGG pathwayxla:4935770.0 
 K07374 (TUBA)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-440] IPR0002170Tubulin
[1-267] IPR0030083.4e-121Tubulin/FtsZ, GTPase domain
[18-33] IPR0024525.5e-93Alpha tubulin
[245-439] IPR0082807.3e-78Tubulin/FtsZ, C-terminal
[269-382] IPR0183161.9e-49Tubulin/FtsZ, 2-layer sandwich domain
[383-436] IPR0231231.7e-25Tubulin, C-terminal
Orthology groupMCL10039 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211429-TA
ATGAGGGAGTGCATCTCGGTGCACGTGGGGCAAGCGGGCGTTCAGATGGGGGTGGCGTGTTGGCAGCTGTATTGCCTCGAGCACGGCATCAGACCTGACGGGACGCTGCCCGGCTGTGACAGCGACGTGGCCGACTCCTGCTTCAACACATTTTTCTCTGAAGCGGACCGAGGCAAGATGGTGCCCAGGGTGGTGATGGTAGATCTAGAAGCTACTGTTATAGACGAGGTCCGCACGGGCGAGTATCGTCAGTTGTATCACCCCGAGCAACTGATCACGGGCAAGGAGGACGCCGCAAACAACTACGCGCGGGGACACTACTCGACCGGCCGCGAGGTGCTCGGACCGGTCATGGAGCGAGTGAGGAAGCTGGCAGACCAGTGCACCGGCCTGCAGGGTTTCTTCGTGTTTCATTCGTTTGGAGGAGGGACGGGTTCAGGCTTTACGTCGCTTCTGATGGAGAAGTTGTCGGACGAATTCGGCAAAAAGAGCAAACTGGAGTTCGCGATATACCCGGCGCCTCAAGTGTCCACGGCGGTAGTTGAACCCTACAACGCGGTGCTGACGACGCACGCTACCATAAGCCACTCGGACTGCGCCTTCATGGTGGACAACGAGGCCATATACGACATCTGCAGGAGAAGGCTGTCCATCGAAAGACCGTCGTACGCGAATCTGAATCGGCTTATATCACAGGTGGTGTCTTCCATCACGGCGTCTCTCCGGTTCGACGGAGCCTTGAACGTGGACTTGACGGAGTTCCAGACGAACCTGGTGCCCTACCCCCGGATACACTTCCCGCTCGCCGCCTACGCACCAGTCGTGTCCGCGGACAAGGCGTACCACGAGGGTATGTCAGTGTCTGAGATCACGGCGGAACTGTTCGAGCCTCAGAACCAGATGGTGAAGTGCGACCCTCGCGAGGGGAAGTACATGGCGTGCTGCCTGCTGTACCGCGGGGACGTGGTGCCCAAGGACGTGAACGCGGCCATCGCGGCCATGAAGGGGCGGGCGGGGATACGCTTCGTGGACTGGTGTCCCACTGGCTTTAAGGTGGGTATCAACTACCAGCCGCCGTCGGTGGTTACGGGCGGAGACCTGGCCCAGGTGAAGCGCGCCGCGTCTATGCTAAGCAACACGACCGCCATCGCGGAGGCGTGGGGGAAGCTTGACCACAAATTCGACCTCATGTACTCGAAGCGGGCCTTTGTCCATTGGTACGTGGGCGAAGGTATGGAGGAGGGGGAATTCACGGACGCCCGGGAGGACCTCGCGGCGCTCGAGAGAGACTATGATGAGGTGGCCATAGAGACGTCGGACATGGCTCCGGGCTGCGAGGACGCCTTATGA

Protein sequence:

>DPOGS211429-PA
MRECISVHVGQAGVQMGVACWQLYCLEHGIRPDGTLPGCDSDVADSCFNTFFSEADRGKMVPRVVMVDLEATVIDEVRTGEYRQLYHPEQLITGKEDAANNYARGHYSTGREVLGPVMERVRKLADQCTGLQGFFVFHSFGGGTGSGFTSLLMEKLSDEFGKKSKLEFAIYPAPQVSTAVVEPYNAVLTTHATISHSDCAFMVDNEAIYDICRRRLSIERPSYANLNRLISQVVSSITASLRFDGALNVDLTEFQTNLVPYPRIHFPLAAYAPVVSADKAYHEGMSVSEITAELFEPQNQMVKCDPREGKYMACCLLYRGDVVPKDVNAAIAAMKGRAGIRFVDWCPTGFKVGINYQPPSVVTGGDLAQVKRAASMLSNTTAIAEAWGKLDHKFDLMYSKRAFVHWYVGEGMEEGEFTDAREDLAALERDYDEVAIETSDMAPGCEDAL-