Monarch geneset OGS2.0

DPOGS210217
TranscriptDPOGS210217-TA1260 bp
ProteinDPOGS210217-PA419 aa
Genomic positionDPSCF300196 - 607665-619058
RNAseq coverage376x (Rank: top 32%)
Annotation
HeliconiusHMEL0139320.092.03% 
BombyxBGIBMGA002542-TA0.094.19% 
DrosophilaalphaTub84B-PA0.092.27% 
EBI UniRef50UniRef50_G6DDQ00.095.61%Moesin n=3 Tax=Opisthokonta RepID=G6DDQ0_DANPL
NCBI RefSeqXP_002019473.10.092.25%GL12416 [Drosophila persimilis]
NCBI nr blastpgi|1129835010.094.20%alpha-tubulin [Bombyx mori]
NCBI nr blastxgi|1129835010.094.20%alpha-tubulin [Bombyx mori]
Group
Gene OntologyGO:00512581.2e-124protein polymerization
GO:00432341.2e-124protein complex
GO:00070184e-95microtubule-based movement
GO:00058744e-95microtubule
GO:00051984e-95structural molecule activity
GO:00055254e-95GTP binding
GO:00070171.3e-64microtubule-based process
GO:00061843e-60GTP catabolic process
GO:00039243e-60GTPase activity
KEGG pathwaytca:6554910.0 
 K07374 (TUBA)maps-> Pathogenic Escherichia coli infection
    Gap junction
    Phagosome
InterPro domain[1-410] IPR0002170Tubulin
[1-288] IPR0030081.2e-124Tubulin/FtsZ, GTPase domain
[18-33] IPR0024524e-95Alpha tubulin
[266-414] IPR0082803e-60Tubulin/FtsZ, C-terminal
[268-413] IPR0183163.1e-59Tubulin/FtsZ, 2-layer sandwich domain
Orthology groupMCL10039 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210217-TA
ATGAGGGAATGCATCTCAGTTCACATCGGCCAAGCCGGAGTTCAGATCGGTAACGCCTGCTGGGAGCTGTACTGCCTCGAGCATGGCATCCAGCCAGACGGACAGATGCCCTCGGACAAGACCCTCGGCGGCGGAGACGACTCCTTCAACACCTTCTTCAGTGAGACCGGCGCCGGCAAGCACGTGCCGAGAGCCGTGTTCGTCGACCTCGAGCCTACAGTCGTTGGTCCAAAACCTAAGCATTCATCGGTCAGTGAACTGAAAAAGTCGAACGCATTAGATGAAGTCCGCACTGGCACCTACCGCCAGCTGTTCCACCCTGAACAACTGATCACTGGCAAGGAAGACGCCGCCAACAACTACGCCAGAGGTCACTACACCATCGGCAAGGAGATCGTCGACGTCGTTCTTGACAGACTGAGGAAACTCGCTGACCAATGTACTGGACTACAGGGTTTCTTGGTGTTCCACTCCTTCGGCGGCGGCACCGGTTCAGGGTTCACGTCTCTGCTCATGGAACGTCTGTCAGTGGACTACGGCAAGAAGTCCAAGCTGCTAGAGTTCTCCATCTACCCCCCCGCGCCCCAGGTATCGACGGCGGTGGTCGAGCCCTACAATTCGATCCTGACCACTCACACAACCCTGGAACACTCGGACTGCGCCTTCATGGTGGACAACGAGGCTATCTACGACATCTGTCGCAGGAACCTCGACATCGAGCGCCCGACCTACACCAACCTGAACAGGTTGATCGGCCAGATAGTATCCTCGATCACGGCATCCCTTCGTTTCGACGGCGCCCTGAACGTGGACCTGACGGAGTTCCAAACGAATTTGGTTCCGTATCCCCGCATCCACTTCCCGCTGGCGACGTACGCGCCCGTCATCTCCGCAGAGAAGGCTTACCACGAACAGCTCACTGTAGCTGAGATCACCAACGCTTGCTTTGAACCAGCCAACCAGATGGTGAAATGCGATCCGAGACACGGCAAGTACATGGCTTGCTGCATGTTGTATAGAGGGGATGTGGTACCGAAGGATGTAAACGCAGCCATCGCCACTATTAAGACTAAGAGGACTATTCAGTTCGTTGACTGGTGTCCAACAGGATTCAAGGTTGGTATCAACTACCAGCCCCCGACGGTCGTCCCAGGAGGTGATCTGGCGAAGGTACAAAGAGCCGTGTGCATGTTGTCCAATACTACCGCCATCGCCGAAGCTTGGGCCAGGTACGGTGCAAAGAGAATTATAAATAAATAA

Protein sequence:

>DPOGS210217-PA
MRECISVHIGQAGVQIGNACWELYCLEHGIQPDGQMPSDKTLGGGDDSFNTFFSETGAGKHVPRAVFVDLEPTVVGPKPKHSSVSELKKSNALDEVRTGTYRQLFHPEQLITGKEDAANNYARGHYTIGKEIVDVVLDRLRKLADQCTGLQGFLVFHSFGGGTGSGFTSLLMERLSVDYGKKSKLLEFSIYPPAPQVSTAVVEPYNSILTTHTTLEHSDCAFMVDNEAIYDICRRNLDIERPTYTNLNRLIGQIVSSITASLRFDGALNVDLTEFQTNLVPYPRIHFPLATYAPVISAEKAYHEQLTVAEITNACFEPANQMVKCDPRHGKYMACCMLYRGDVVPKDVNAAIATIKTKRTIQFVDWCPTGFKVGINYQPPTVVPGGDLAKVQRAVCMLSNTTAIAEAWARYGAKRIINK-