Monarch geneset OGS2.0

DPOGS209136
TranscriptDPOGS209136-TA2337 bp
ProteinDPOGS209136-PA778 aa
Genomic positionDPSCF300061 - 763725-768415
RNAseq coverage198x (Rank: top 47%)
Annotation
HeliconiusHMEL0147850.074.27% 
BombyxBGIBMGA001309-TA2e-14477.36% 
DrosophilaalphaTub84B-PA5e-13263.01% 
EBI UniRef50UniRef50_P683636e-13063.29%Tubulin alpha-1B chain n=1161 Tax=root RepID=TBA1B_HUMAN
NCBI RefSeqXP_974091.11e-14871.76%PREDICTED: similar to tubulin alpha 6 [Tribolium castaneum]
NCBI nr blastpgi|910779242e-14771.76%PREDICTED: similar to tubulin alpha 6 [Tribolium castaneum]
NCBI nr blastxgi|910779242e-14971.76%PREDICTED: similar to tubulin alpha 6 [Tribolium castaneum]
Group
Gene OntologyGO:00512583.8e-84protein polymerization
GO:00432343.8e-84protein complex
GO:00166276.9e-84oxidoreductase activity, acting on the CH-CH group of donors
GO:00160206.9e-84membrane
GO:00551146.9e-84oxidation-reduction process
GO:00064616.9e-84protein complex assembly
GO:00061841e-46GTP catabolic process
GO:00039241e-46GTPase activity
GO:00055251e-46GTP binding
GO:00058741.8e-46microtubule
GO:00070171.8e-46microtubule-based process
GO:00070187.6e-46microtubule-based movement
GO:00051987.6e-46structural molecule activity
GO:00038246.4e-10catalytic activity
KEGG pathwayaga:AgaP_AGAP0017442e-135 
 K02259 (COX15)maps-> Oxidative phosphorylation
    Two-component system
    Porphyrin and chlorophyll metabolism
InterPro domain[7-229] IPR0030083.8e-84Tubulin/FtsZ, GTPase domain
[434-763] IPR0037806.9e-84Heme A synthase/Protoheme IX farnesyltransferase
[207-353] IPR0082801e-46Tubulin/FtsZ, C-terminal
[14-33] IPR0002171.8e-46Tubulin
[6-19] IPR0024527.6e-46Alpha tubulin
[231-343] IPR0183168.6e-41Tubulin/FtsZ, 2-layer sandwich domain
[433-481] IPR0090036.4e-10Peptidase cysteine/serine, trypsin-like
Orthology groupMCL34791 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209136-TA
ATGTTGGTTTTTTCAGATGATAGAAGCTGTGGAACATTTTTTAGCCATACTGGAGCCGGGAAAATGGTCCCTAGGGTCGTTATGGTTGACTTAGAACCTACACCTATAGATGAGATCAGAACAGGAGCGTATAGGCAACTGTTTCATCCAACATCATTAATTACTGGAAAAGAAGATGCAGCTAGTAATTTTGCACGAGGATATTTTGGTGTGGGTAGAGAGATGATAGATATTGCTCTAAATCGTGTAAGAATAGCGGCGGAAGACTGCAGTTGCCTCCAAGGTTTTATTATCTTCCGATCTTTCGGAGGAGGTACAGGATCTGGATTCACTGCACTATTACTAGATAGTCTCACTAAAGATTATGGTAAACTTTCTAAAATTGAATACGCTATATATCCATCACCAAAAATATCGCCGGTAATAGTAGAGCCGTACAACGCAGTACTGACTGCCCACGCTTGTATGAACACCGAGGACGTATGTTTTATTTTCGACAACGAAGCTCTCTATGATATACTAGCAAGGCTTCTGGATGTACCGAGGCCCACATATACAAATTTAAACAGACTTATCGCACAGGTAGTGTCTTGTATGACGGCGTCATTGAGATTTGAAGGGTCGTTGAACGTGGAACTAGTAGAATTTAGAACAAATTTAATACCCTATCCTAGAATTCATTTCCCTTTAGTGACTTTCGCTCCTTTTGTGCCGCCAACAAAAGCACTTCATGAGACCATGACGACCCAACAGCTAATAATGTCATGCTTCGAACCGTCCAATCAGATGGTTAAATGTGATCCCAGGACGGGAAGTTACATGTCTTGTTGTTTACTGTTCAGGGGCGATGTTAATACTAACGATATTAATTTTGCGATTAATCAAATAAAAAGTATGCGTTCTATTAAATTTGTCTCTTGGTCTCCTACTGGTTTTAAGATTGGTGTAAATAATCAACCACCGACAACCGTCCCTGGGGGCGACTTAGCAGCTCTTCAAAGAGCAGTCGCGATGGTGTCTAATTCTTCAGCTGTTCGTACCGCTTGGGAACGATTAATGTTGGGTATGGCGAATTTATGTCGGTACTCTCAACTTGTAAAAGTTGCTCCGACCAAACTGCTAGGATCAAATTCGGGTGTTAGCCGCTTAGTTTCAAGGCAGCTCATTACACCGATAAGAAACAGCAACCACAAGCACACCATATACAAGGGGTTTCAGATACAGAATATAATAAAATCAAATCCAATAATATTAAGATTCTGTTCATCATCACAACCAAAGAGGTCTAAGCTTGTTGGCTACTGGTTACTGGGATGCAGTGGGATGGTGTTTACTGCTGTTGTTTTAGGCGGAGTGACTCGACTCACTGAGTCTGGGTTATCTATGGTCACATGGAAATTGTTAGGAGAGAAGTTACCAAGAACTGATGAGGAGTGGGAGACGGAGTTCAAGAAATATCAGCAGTACCCGGAGTATATATATAAGAATCATTCACTGACACTGTCCGAGTTCAAATGGATCTGGTATATGGAGTATGCTCATAGGACGTGGGGTCGACTCATAGGGGCCTCTGTCTTCATCCCGGCCGCTGTGTTCTGGGCTAAGGGCTGGTTCGACAAGGCTATGAAGATAAGGGTGTCCGCATACTGCGCGCTCGTTGCTGCACAGGGTCTTATGGGTTGGTACATGGTGAAGTCAGGTCTTGAAGACAGATTTCAAGGGCCGTCGGACGTTCCGCGCGTGTCCCAGTACCGCCTGGCCGCTCATCTCAGTCTCGCCTTCATTCTGTACTCGGGGCTACTGGCCGGAGCCCTGCGGGTGCTCCGCCCCTTCCCTAAGGGAGCTCTCGTGAGGATCAAAGAGCTGGCCGCCGTCACCGGACTCGCGCATGCCGTTAAAGCTATGGCGTTCTTCACGGCTGTTTCAGGAGCGTTCGTGGCCGGTCTAGACGCGGGATTGGTCTACAATTCATTCCCGAAGATGGGTGACAACTGGATCCCGGACGACATCCTGTCCTTCGCCCCCACCATCAAGAACTTCACGGAGAACCCCACGACAGTTCAATTCGACCATCGGGTCCTTGGCACCAGCACATTGATAGCGGCCACCACACTGTGGCTGATGGCGAGGGGCAGGCCACTGTCCCCGGTGGCGAGGAGGGTGGTCAATGGAGTGGGAGCCATGGCCTGGCTACAGGTGTGCCTGGGTATCATGACGTTGGTCCACTACGTGCCCACTCCGCTGGGCGCGTCTCACCAGGCCGGTTCCCTCGTCCTACTGTCGCTGGCAATCTGGCTCACTCACGAGATCAAGCTACTCAAGTACATACCAAAGTGA

Protein sequence:

>DPOGS209136-PA
MLVFSDDRSCGTFFSHTGAGKMVPRVVMVDLEPTPIDEIRTGAYRQLFHPTSLITGKEDAASNFARGYFGVGREMIDIALNRVRIAAEDCSCLQGFIIFRSFGGGTGSGFTALLLDSLTKDYGKLSKIEYAIYPSPKISPVIVEPYNAVLTAHACMNTEDVCFIFDNEALYDILARLLDVPRPTYTNLNRLIAQVVSCMTASLRFEGSLNVELVEFRTNLIPYPRIHFPLVTFAPFVPPTKALHETMTTQQLIMSCFEPSNQMVKCDPRTGSYMSCCLLFRGDVNTNDINFAINQIKSMRSIKFVSWSPTGFKIGVNNQPPTTVPGGDLAALQRAVAMVSNSSAVRTAWERLMLGMANLCRYSQLVKVAPTKLLGSNSGVSRLVSRQLITPIRNSNHKHTIYKGFQIQNIIKSNPIILRFCSSSQPKRSKLVGYWLLGCSGMVFTAVVLGGVTRLTESGLSMVTWKLLGEKLPRTDEEWETEFKKYQQYPEYIYKNHSLTLSEFKWIWYMEYAHRTWGRLIGASVFIPAAVFWAKGWFDKAMKIRVSAYCALVAAQGLMGWYMVKSGLEDRFQGPSDVPRVSQYRLAAHLSLAFILYSGLLAGALRVLRPFPKGALVRIKELAAVTGLAHAVKAMAFFTAVSGAFVAGLDAGLVYNSFPKMGDNWIPDDILSFAPTIKNFTENPTTVQFDHRVLGTSTLIAATTLWLMARGRPLSPVARRVVNGVGAMAWLQVCLGIMTLVHYVPTPLGASHQAGSLVLLSLAIWLTHEIKLLKYIPK-