Monarch geneset OGS2.0

DPOGS210672
TranscriptDPOGS210672-TA1278 bp
ProteinDPOGS210672-PA425 aa
Genomic positionDPSCF300013 - 1165939-1171779
RNAseq coverage246x (Rank: top 42%)
Annotation
HeliconiusHMEL0076930.094.01% 
BombyxBGIBMGA006290-TA9e-17488.92% 
DrosophilaCG3085-PA4e-10846.85% 
EBI UniRef50UniRef50_Q9W1V25e-10646.85%CG3085 n=29 Tax=Endopterygota RepID=Q9W1V2_DROME
NCBI RefSeqXP_967107.16e-12755.08%PREDICTED: similar to GA15915-PA [Tribolium castaneum]
NCBI nr blastpgi|910929701e-12555.08%PREDICTED: similar to GA15915-PA [Tribolium castaneum]
NCBI nr blastxgi|910929702e-12755.08%PREDICTED: similar to GA15915-PA [Tribolium castaneum]
Group
Gene OntologyGO:00002263.4e-96microtubule cytoskeleton organization
GO:00058743.4e-96microtubule
KEGG pathway 
InterPro domain[19-399] IPR0004353.4e-96Tektin
Orthology groupMCL13633 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210672-TA
ATGCAATCTGTAGTTACATTCGAAAAACCTCTACCCCATCTAAGTCTACCGGATTGGGATGCCAGACTCTACGGCTTGCAGGTGACGGCAGACACTAGACGTGCCGATGCCTTCGACCTTCGACACGGCGCACATCAACTCCGCAATGAAACTAGAATTAAGACAGAATGGGATAGCTACCATAATAATAACCGATTGAGAGCTAGAGTATATGAAATAGAACAATGGAAATCAACCCTTCAAGAGCTTTTAGATCGCTTGGATAGAGAAATGAGTGCTCTGAAAGAAGAAAAAGCATCCACAGAGAGGGAGTTGGAGCAATTGAATATGCCGCTACTCGTTTGCTCCGAATGTCTATCAAACAGAGACGGAAGGAGGAGCAGTGAGCTTACTTATGACCTCGCTGACACTGAATTAAAAAAGGAACTATGTGTAACGGAAAGTAATAAGAAAATGCTGATAGATAGATGTCAATCGGCGTGGGAGAAAATTAATAAGTTGGAAGTAGTGAAATTTAAATTACAACTAGATCTCAATGACAAAAATGAGGCTTTACAGATCGACAAAGACATGCTTAGTTTGGAGAAGAACAGCGCCAACATAACTTACAAAACCGATTCACTTAAGAATCCAAATAGAATGATAACATACGAACAATGGCTGGATAAATGCGAAGCAACAAAGAAGATGGCGGTCGACGAGCTTCAAGATACTCTGAGGTTAAGAGAGTCGCTGTTCGTGGCACGTGGTCGGGCAAGGAACGCTCTGCGAGCTCAAACGGACGTTACCAACTACATGATGAGAAGACGGATATATGACACCCAGAGAGCCAGAAATGAACTACAGTGGCAGAAGCTGAAGATGGAAGAAAACATGGACAAACTGGCAACAGAACTAAAGACTATAGGCGAACAGTTCGCAGACAAAGTTAATGCATTAAAAGTAGCCGAAACTCGTTTAGAAACGCGTGGTTACCGACCGGGAGTGGAACTTGCTGCAGATGAGGCGGATATCGGCTTAAAAGAAGAAGTACGTAATCTGAGAGAGACGATCCGTCAGCTGCAGGAAAAACAAGACTGCGCTAAGGCCACATACAACGCTCTAGAGGCTGCCTCGATAAAGATTGCCATAGATCTTAGCGATAAGGAGCAGTCATTGGAAACTGACACTCAGGCCCTTGAAATGAGGGAAGCCTTAGAGCCAAAGAAACCACAGGGCACTGACAAAAATTTGATACTCGCTAGCATACAAGACGAACTGCCGAAAGTTGAGGCTTAA

Protein sequence:

>DPOGS210672-PA
MQSVVTFEKPLPHLSLPDWDARLYGLQVTADTRRADAFDLRHGAHQLRNETRIKTEWDSYHNNNRLRARVYEIEQWKSTLQELLDRLDREMSALKEEKASTERELEQLNMPLLVCSECLSNRDGRRSSELTYDLADTELKKELCVTESNKKMLIDRCQSAWEKINKLEVVKFKLQLDLNDKNEALQIDKDMLSLEKNSANITYKTDSLKNPNRMITYEQWLDKCEATKKMAVDELQDTLRLRESLFVARGRARNALRAQTDVTNYMMRRRIYDTQRARNELQWQKLKMEENMDKLATELKTIGEQFADKVNALKVAETRLETRGYRPGVELAADEADIGLKEEVRNLRETIRQLQEKQDCAKATYNALEAASIKIAIDLSDKEQSLETDTQALEMREALEPKKPQGTDKNLILASIQDELPKVEA-