Monarch geneset OGS2.0

DPOGS211202
TranscriptDPOGS211202-TA3342 bp
ProteinDPOGS211202-PA1113 aa
Genomic positionDPSCF300007 + 860121-867032
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0124550.085.16% 
BombyxBGIBMGA001869-TA0.079.91% 
DrosophilaDhc62B-PC0.037.78% 
EBI UniRef50UniRef50_E0VN310.042.65%Dynein beta chain, ciliary, putative n=13 Tax=Metazoa RepID=E0VN31_PEDHC
NCBI RefSeqXP_002427535.10.042.65%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420136900.042.65%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420136900.042.65%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
Group
KEGG pathwaynve:NEMVE_v1g2463692e-111 
 K10408 (DNAH)maps-> Huntington's disease
InterPro domain[656-1092] IPR0136023.2e-112Dynein heavy chain, N-terminal domain-2
Orthology groupMCL10001 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211202-TA
ATGGAAGAGAGCGAATTGAGAAAACTGTTGCAGACCAAAAATGGTTGTAGAATTCCTGTCATGCTTCCGATACCGGATGTGCATAAAGTAGACAAATTGCCTTTTATGCCACTGCCGCCTTACAACAGAATTAAGGACAAGAAACAAAAATTTCGTCAATTACTGGAGGAAAAAGCTACAAAGAGAAAAGTGAACATCGCTAGATCTTCGTTTGAGTATAGTGAGTGTGATGCTTTAGAAATATCTCAAGAAAGACATATAGCTGTCCTACGTCAATGTGCGGAAAAAGTGCAACCACCTCCAATGTTAAAATCTTGGGAGAGAAAAATATTCAGTTTAATACCACCTAAATTAAGAAATGCCTATCCCACAATAGCGGAGGATCTTTTAAAAGAGAGTAAAAATGAATGGAACAGAAATCTCCATGACTTGGCCGTAAAAACTGTAATCCGTGATGTTCCTGGAGTACCTAGAAAAAGATATGAGGAGCCACACTTTAAATTTCATGGTGTTACACCGAATTATGAGAAAATGGTCAAGTTTAGAAAGAAGTTACAATCTGGTTCATTACTCCTACATCCATTTATAAGACTTGTCTTAGAATCATCCGAAAAGACGTTTCCAGAATTTATAATTAATTTGTCAAAATACAGAGCGAAGGGACCATTTCATTTGGATGATTTTCATACTAAAGTATTGGAAGAGATAAAAAAAGCTGATTACTTGGTATCCAGTACTTGGTATTCAATTTTAGCGAATTGGTTAAAAAATCCAAGATGCTTGAAAGGCATGAAACCTAAAAGAATACCCGATTTTGTATCATGTGCTACTAAAATTATATCGATGCAAATACAAGAACTCATGCGTCGATCTATTGATGCAATAATAACTTCGCTGGGGAACCCCGAGTGCGTTCCAATCATAAACATAGATTTAGATTTCAATGGCGAATTTATCTATGATCCATCTCTTGAAGAGGTTTTTAATGTTTTCCATAATATAGCCGATGCTATTGCACACATAGCACAAAGGCTAATGCCAATTGAACAATACCTAAAAATACCATATAACTATGATGCTCTGCCTGTTAAGTACAATGATTGGTTAAATAAAGATAGTCACGATAGATTACAGCAGCAGTTGAATATAGTATTTGAACCCCTCGTTCAATATTTAGTTGATCTAAGAGTAGCTTATAGTATGCTGTATGGGGCGCCAGCAAAAACACAGCTTTCTAAATTTATTAACCAGGCCAAGGAGTTTGAAGAGTCGCGAGACAAAATAAAATATTTTCAAGAAATAGATTCTGATATTACAGCTGTTCTGGAAAATGAATATTTTAATTGCGCTATTGTTTGTCAACGGAGAATGATAAACGGGTTGAAAGCTAGAGCTCTAGAATTCATAAATGACATAATTGCTGGCATTGTAAAAACTCATATGGGTGAAAATGATAGCATTTGCAAAGAATTTGAAATTATAGCAGCAAAAGCACTGAAAGAACCTGAAAATGCTGCGGAATTGATTGAACAAGGTGTTTACATTTTACATGCTAAGACCGTCTTGGTAGAGGTTTTAAAAGAGAGAATATTGAGACAAATAAACATAATCTCCAATTTGCTAGAAATGACTTCTCTTTCTCCGGAGCATGTGGCTTCTAATACTCGCACTGTAAATTGGCTCACCGATATTAAACCAATTTTTGAGAGAAATGCTACAGCATATGAAACATTTAAAGCTGAAATGGAGGAAAATCTTTTAACAAAAATTGCTTATTTAAACAAAGAGGTTTCAGAAATAACACCGTATTTAGAGCTTTTAGATAATATGGATGACATAGATCATACTCTTGAATACTTGGAACACCTGAGAAAGTTAGTGCATCGCCTGGATGATTGTGACAAATTAGTCAGTTGGATAAATAACGAGGAAGTAACATTTAAGTTTCCAGTTTCCCACTATTCTGATCTCGAAGAACTAAAAGATTTTATTAAACCTTTCCATAATCTAACGTATTTAGTTCATAAATGGAAACGAAGTTTTTACACTTGGATGGATGGTCCGTTCGAGTATTTAGATCATGAGAAAATTGAACAAGATCATGATTTTTACTACAAAGAATTTCTAAAATTGTCTAAAGCTTACAGAACTAAAATAAAACAACAAATATCGGAAGGCGTAGAAAAGAGATTTCAAGGTTTAGTCGACGATCCCGATATTAATAATTTACCAGCGCCAATGAAACTGTGTGCACAAGCTATAGCCGAAATAAAAAGCTGGCGCCCAAATGTTCAAATGGCACACATAATGTGTAACCCAGCATTAGTACAAAGACATTGGGATGAGATGTCAAATATAGCAGGATTCGATTTGACCCCAAACGCTGGAACATCTTTAAGGAAAATCATTGAATATAATCTTTGGGAAGATATTGATCAATATGAAATAATAAGCGTAGCGGCAACAAAGGAACTGGCTCTTATAACAAATTTAAATAAAATGATGGCTGAATGGACTGATATATGCTTTAAAACAAGTCCATACAAAGATACAGGAATTTACATATTATCTGGCTTAGATGATATACAAAGTGTTTTAGACGATCATATTGTAAAAACTATCGGCATGAGGGGCTCCGCGTTCGTAAAACCTTTTGAAGCTCAAGTTAGAAATTGGTATGAGAAAATAACACGTGTCAACGCTACAATTGACGAATGGGGCAAAGTACAGAGTCAATGGCTATACTTGTTACCTATATTTTCGTCTAAAGATATTGTTGCTCAAATGCAGGAAGAAGGAGTAATGTTCGTTGAGGTTAATAATATTTATCGTCGCTATATGGGTTCAGTAGACAAAGATCCTCACGTTTTAGAGATAGCTGGTGGAATGGGTGTTTTAGAATCATTTAAGATTGCGTCCGGTATGCTTGAAAAAATTAATGACGGTATAAACAACTATCTCGAAAGGAAACGCTTATATTTTCCACGTTTCTTCTTTTTGTCGAACGACGAAATGTTGGAAATCTTGTCAGAAACGAAGAATCCTCTCAAAGTTCAACCGCATCTCAAAAAATGTTTCGAAGGTATAAATCGACTTGTCTTTGATCCAGAGTTCAATATATCAGCAATGATTTCTATGGAAGGAGAACAAGTAGAATTTTTGGAAACTATTAGTGTCGCAGCAGCGAGGGGATCCGTTGAAAAGTGGCTAGTGCAAGTTGAAGATCAGATGTTAAAAGCTGTGAAATCTGAGACTGAATTATCTTATTATGACTACCCTAATTTAAGTCGTGTAGATTGGATATTGTCTTGGGAAGACGGTTATTCCGACATTAGAAACTGA

Protein sequence:

>DPOGS211202-PA
MEESELRKLLQTKNGCRIPVMLPIPDVHKVDKLPFMPLPPYNRIKDKKQKFRQLLEEKATKRKVNIARSSFEYSECDALEISQERHIAVLRQCAEKVQPPPMLKSWERKIFSLIPPKLRNAYPTIAEDLLKESKNEWNRNLHDLAVKTVIRDVPGVPRKRYEEPHFKFHGVTPNYEKMVKFRKKLQSGSLLLHPFIRLVLESSEKTFPEFIINLSKYRAKGPFHLDDFHTKVLEEIKKADYLVSSTWYSILANWLKNPRCLKGMKPKRIPDFVSCATKIISMQIQELMRRSIDAIITSLGNPECVPIINIDLDFNGEFIYDPSLEEVFNVFHNIADAIAHIAQRLMPIEQYLKIPYNYDALPVKYNDWLNKDSHDRLQQQLNIVFEPLVQYLVDLRVAYSMLYGAPAKTQLSKFINQAKEFEESRDKIKYFQEIDSDITAVLENEYFNCAIVCQRRMINGLKARALEFINDIIAGIVKTHMGENDSICKEFEIIAAKALKEPENAAELIEQGVYILHAKTVLVEVLKERILRQINIISNLLEMTSLSPEHVASNTRTVNWLTDIKPIFERNATAYETFKAEMEENLLTKIAYLNKEVSEITPYLELLDNMDDIDHTLEYLEHLRKLVHRLDDCDKLVSWINNEEVTFKFPVSHYSDLEELKDFIKPFHNLTYLVHKWKRSFYTWMDGPFEYLDHEKIEQDHDFYYKEFLKLSKAYRTKIKQQISEGVEKRFQGLVDDPDINNLPAPMKLCAQAIAEIKSWRPNVQMAHIMCNPALVQRHWDEMSNIAGFDLTPNAGTSLRKIIEYNLWEDIDQYEIISVAATKELALITNLNKMMAEWTDICFKTSPYKDTGIYILSGLDDIQSVLDDHIVKTIGMRGSAFVKPFEAQVRNWYEKITRVNATIDEWGKVQSQWLYLLPIFSSKDIVAQMQEEGVMFVEVNNIYRRYMGSVDKDPHVLEIAGGMGVLESFKIASGMLEKINDGINNYLERKRLYFPRFFFLSNDEMLEILSETKNPLKVQPHLKKCFEGINRLVFDPEFNISAMISMEGEQVEFLETISVAAARGSVEKWLVQVEDQMLKAVKSETELSYYDYPNLSRVDWILSWEDGYSDIRN-