Monarch geneset OGS2.0

DPOGS203266
TranscriptDPOGS203266-TA1632 bp
ProteinDPOGS203266-PA543 aa
Genomic positionDPSCF300229 + 42554-44301
RNAseq coverage115x (Rank: top 58%)
Annotation
HeliconiusHMEL0153630.078.27% 
BombyxBGIBMGA000446-TA0.067.16% 
Drosophilamst-PA4e-9634.78% 
EBI UniRef50UniRef50_D2A4U94e-12643.59%Putative uncharacterized protein GLEAN_15425 n=1 Tax=Tribolium castaneum RepID=D2A4U9_TRICA
NCBI RefSeqXP_972672.18e-12743.59%PREDICTED: similar to misato CG1424-PA [Tribolium castaneum]
NCBI nr blastpgi|3323744188e-13943.12%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323744181e-13543.12%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00512585.2e-35protein polymerization
GO:00432345.2e-35protein complex
KEGG pathway 
InterPro domain[3-323] IPR0030085.2e-35Tubulin/FtsZ, GTPase domain
[4-95] IPR0196051.4e-29Misato Segment II, myosin-like
Orthology groupMCL15380 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203266-TA
ATGTGTACTCGAGAGATTCTGACTTTACAGTTCGGTCATTATACAAATTATGTAGGCAGTCATTTTTGGAATGTCCAGGAATTAAGTTTTGATTACACGGGTACTGTTAAGACTGAATGTAATCACGATATATTGTATCGCGAAGGACAGTCATCTTCTGGTGAAGTTACTTATACTCCCCGTTTGTTATTGGCTGACTTGAAAGGATCTCTAAAAACTTTACCAGCTACTGGAGGTTTAGAGGAGACAAATTTAGAAGGTGATTTTTCATGGGATGAAGTTGAAACAATTAAGGAACCAGAACCTGACAAGAATGAGTTCCTAAAAGATATAAATGCCGACACAACAAATTTACATGATAAAGAATATAAATTAGAGCAGGATATAAATACATGGACCGATTATTTATATCCTCGATTCCATTCTCGCACAGTAAATATTATAAAGGAATATCAACATAATTCAGAAAATGAAAGTTTTGATATCTTCACATCCGGCTGTACTCTATGGAAATCAGATTATGGAGAAACATTTGCAGATAATATCAGGAAATATGTTGAAGAATGTGATAGCTTGCAAGGATTCCAAGTCAACTTTGATTGTACTGATGGATTTTCGGGACTTGCTCTTGGCTGCATAGAACATATATCTGATGAATACTCTAAGACTATATTGTCATATCCGATCATAGCATCACATTTCTCAGACAACAGTCCATCAACAGAAGAAGAAAGAGAGAAGGCAACACTTAAAGATTCATTTAGATTAGTTAATATTGCATTATCTATTGAAGCACTATCCCAGCACGTTAAACTATTTGTTCCTTTATGTACTGGTGAAAAAGGATGGAGAAAGCCGGGAAATCCTCGGCTATTTGACAATATTCATTATAAACCAGAATGCTATTATCATTCCTCAGCATTGTTAGCTTCGGCTATGGACAGTTTAGGCCAGAAATACAGACATAAAAGCAATGTCTACACAATGTCTGATGTTTGTGCCGATATGACAGGTTATGGAAGAAAGATGGCAGCTGCGTCACTTGGAATACCATTAACCTTCAGTGAATCTCAATATCTCATAGAATATTTAAACAGCACCACCAGACCTATTTATCAATCCATCACACCAAGTTGTAAGATTGCTACTGACAAACTTTTCCAGTTAATCACTGTGAGGGGTATTCCAGAAACATACTTGAAAGCACCAATGAAAGAAGCTAAAGAACAGTTGAATATTCCTGCATATAGATGTAACAGTGTAAAGGAAATGTTTGAATTGTATTTCCAAGCAAATAATTTCCTATCTGCAACTAATGTTACTGTATGTGAAAAACCGCTCGAATTAAAAACCCCGTATCCGCGCTTCTTCTCAGATAGTCTTAATAAATATGGTTTTATTAAAAGTGGACCGGAACCAGACAAGGTTGAATCTTGTCCTGTCATAGCAGGACTCCACAATGGCAACTTCATGGCGGATATGATTGAAAAATTACATAGAGATGTAAGCAGAATTAAATTCTCTAAATTGCACAAATTTGGTAACGAAGGTCTCGAATATGGCGATTATAAAGAGAGTTTGGACAGGATAGCTGAATTCAAAGACAATTATGAAGACAATTTCGAGCTTTAA

Protein sequence:

>DPOGS203266-PA
MCTREILTLQFGHYTNYVGSHFWNVQELSFDYTGTVKTECNHDILYREGQSSSGEVTYTPRLLLADLKGSLKTLPATGGLEETNLEGDFSWDEVETIKEPEPDKNEFLKDINADTTNLHDKEYKLEQDINTWTDYLYPRFHSRTVNIIKEYQHNSENESFDIFTSGCTLWKSDYGETFADNIRKYVEECDSLQGFQVNFDCTDGFSGLALGCIEHISDEYSKTILSYPIIASHFSDNSPSTEEEREKATLKDSFRLVNIALSIEALSQHVKLFVPLCTGEKGWRKPGNPRLFDNIHYKPECYYHSSALLASAMDSLGQKYRHKSNVYTMSDVCADMTGYGRKMAAASLGIPLTFSESQYLIEYLNSTTRPIYQSITPSCKIATDKLFQLITVRGIPETYLKAPMKEAKEQLNIPAYRCNSVKEMFELYFQANNFLSATNVTVCEKPLELKTPYPRFFSDSLNKYGFIKSGPEPDKVESCPVIAGLHNGNFMADMIEKLHRDVSRIKFSKLHKFGNEGLEYGDYKESLDRIAEFKDNYEDNFEL-