Monarch geneset OGS2.0

DPOGS202793
TranscriptDPOGS202793-TA2256 bp
ProteinDPOGS202793-PA751 aa
Genomic positionDPSCF300018 - 719573-723023
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0092790.086.08% 
BombyxBGIBMGA010473-TA0.069.69% 
DrosophilaCG9068-PA1e-11732.09% 
EBI UniRef50UniRef50_E2C2D94e-16341.76%Dynein heavy chain 2, axonemal n=12 Tax=Formicidae RepID=E2C2D9_HARSA
NCBI RefSeqXP_967358.20.044.56%PREDICTED: similar to 1-beta dynein [Tribolium castaneum]
NCBI nr blastpgi|1892409690.044.56%PREDICTED: similar to 1-beta dynein [Tribolium castaneum]
NCBI nr blastxgi|1892409692e-18044.56%PREDICTED: similar to 1-beta dynein [Tribolium castaneum]
Group
KEGG pathwaytca:6557040.0 
 K10408 (DNAH)maps-> Huntington's disease
InterPro domain[217-699] IPR0135942.8e-69Dynein heavy chain, N-terminal domain-1
Orthology groupMCL17146 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202793-TA
ATGTCTGACTTAGATGTGAGGAAAAGAGTGAAAGTAACTTGGAGTGATGACTTGAGTCATAACTCAGAAGAAGACAGAGAGCGGGAGCGACAAGCACAGCTTGAACGAGAATTAGCTGAGCAGCCCGTGAAACCTGTCTACGAGCCCGATGAACTGCAAAAGCTGATTACCTACATCATGAAAATGACTACCTTGTATGACCTAAGAGACGAAGACTGGAACGACGAGGCGAAGAAAGGCATTGAAGAATGGCTGACTGAACCGAAAGCATTAATTCTTTGTGTTTATTTTAAAGCCGATAAACTTAAAGCCTCTAGTGATGTACCAATGTCTCCTGTCTACGATTTGACGTATTTTTTGCGACAACCGGACTATGTGTTTAAAGCTGAGACATTTCATGACGATATTGTGTTTGGCACTTTCGTGGACTCTGTTGAGGCAAATTTGATACAGATATTGGAACTGATGTACGCTCCGTATTTTTTTGCCATTACCACATGGCCAGACAGTGTGAAAAGTGAATTTTGTTCTCAATTGCATACGTTTTTGGCAAAACTAACAGATATGTATTATAAAATGCTTGGCCTTACTGTTCTATACATTCCACGAGAAGGCCAACAACTGTCTTTCGAAAAGGCGAGCACTAATAGAGAGTTAGTGAAGAGATTGGAAGGTGTGGTAGTTTACTGGACCCACCAGATCAAATCTTGCATTGAAGATCAGTCTTCGGTTGCCTCCCAGAAAGAACTTCTATGTCCTAGTGATGAATATGAGTTTTGGGTATACAGACATGAGAACCTGAATGCTCTAGCCCATCAACTGCAAAATCCTGCAGTGAAGCATATAACCAAAATACTAGTCACGACCCATTCAACATTTATTCACCAGTTTCAATCTCTTTGTGAAGAGATCATGCAAAAAATTAAGGAAGCTACGTCTAACATCGAATACTTGCAAGTGATAAAACAACCGTGCGCTATTTTAGAGTGCGTGGTTGATCCGGACGAAATTTCAAATCACATTCCCACTATAATAAACCTCTTTAGATTTATTTGGATGGAGTCGCCTTTCTACAACTCTGAAACGCGCATTACAAACCTCTTCAAAGCACTTAGCAACCAAATAATTATTCTGTGCAAGAACTATATCAAGTTAGATGAATTGTTTGATGGACAAACTAAGAAGGCACTGGGGGAATTCACCAAATGCATTGACTGTTGTAAGAAATATAGAGAAATTTATGACCTAATGGCAGAAGCCCATAGCGAGAAAAACCCTGGGACTTGGGAGTTAGATACCGGCTCTATATTTAACTATATCGACTCTTTCGTACAGAGATGTTTCGATATGCTCGATGTATGCAACTGTATGATTATTTTTGCTAGAATTGATGAGCTAGAAGTTATCAGCAAGCCTATGTTTGGGGGAGCCCATGGCGACCAATTCGAAGCTAAATGCGATCAGATAGAGCACATGTTTCATGATGCACTGGATAATGTTAAGGCAGTTGCCACCACTATTCTGGACGTTCAGGCTCCATCCTGGTATGATGATATACTTCAGTTTCGAACGGTTATTAAGGATATCGAGATAATTATCGAGAACTTGGTGGAAACCGTATTCGAAGGTGTCAATCACGTCGAGGAGGCTGTCATTGCGCTATTCTCCTTACATAATTACTCTAAGAGAAAGAATCTGAAACGCATATTCAAAAGAAAGACTGCAGAGTTAAATGTGTGGGCAATGTTCAGTGATGAAGTCCAAGAAGCTAAAAAGGAGACTGTGACGTCTCGCGGGACGTACGTCGCAGACCTTCCGTCTTTCGCTGGTCGAGCAGCGTTGCTTCGAGTCCGCAGGAATAGACTAGCGTATCTCAAAAAGGTTCTTATTGACGCGTCCGTGTGGATGATGCCCTGCAGCAACTCTGAGGACGTGGTAATGCATGTCAATAGATTAATGGGAGCCATGGATGTAGCCATCAGAGAGTTGTGGATTTCTTGGACTCACAATTTGGATGAAAAGTGTTCGGCTGGTTTGAATAAAACGTTGATGCGTAAGAGCGCGGAAAACCCTGGATTGCTGGAATGCAACATTGACGATTGTAATATGATGCAGGTTTATATGAAGAGCAAGACAGTTTTCTATGTCTATGAAAGTGTCCTGGCTGTCGTGAAGGGCTACAATAAGATCCTAGATTCCCTTTCAGAAGAGGAAAGGCTGCTGTTCAAGCCATTGACACTGGTAAGATGTAACTAA

Protein sequence:

>DPOGS202793-PA
MSDLDVRKRVKVTWSDDLSHNSEEDRERERQAQLERELAEQPVKPVYEPDELQKLITYIMKMTTLYDLRDEDWNDEAKKGIEEWLTEPKALILCVYFKADKLKASSDVPMSPVYDLTYFLRQPDYVFKAETFHDDIVFGTFVDSVEANLIQILELMYAPYFFAITTWPDSVKSEFCSQLHTFLAKLTDMYYKMLGLTVLYIPREGQQLSFEKASTNRELVKRLEGVVVYWTHQIKSCIEDQSSVASQKELLCPSDEYEFWVYRHENLNALAHQLQNPAVKHITKILVTTHSTFIHQFQSLCEEIMQKIKEATSNIEYLQVIKQPCAILECVVDPDEISNHIPTIINLFRFIWMESPFYNSETRITNLFKALSNQIIILCKNYIKLDELFDGQTKKALGEFTKCIDCCKKYREIYDLMAEAHSEKNPGTWELDTGSIFNYIDSFVQRCFDMLDVCNCMIIFARIDELEVISKPMFGGAHGDQFEAKCDQIEHMFHDALDNVKAVATTILDVQAPSWYDDILQFRTVIKDIEIIIENLVETVFEGVNHVEEAVIALFSLHNYSKRKNLKRIFKRKTAELNVWAMFSDEVQEAKKETVTSRGTYVADLPSFAGRAALLRVRRNRLAYLKKVLIDASVWMMPCSNSEDVVMHVNRLMGAMDVAIRELWISWTHNLDEKCSAGLNKTLMRKSAENPGLLECNIDDCNMMQVYMKSKTVFYVYESVLAVVKGYNKILDSLSEEERLLFKPLTLVRCN-