Monarch geneset OGS2.0

DPOGS211203
TranscriptDPOGS211203-TA3273 bp
ProteinDPOGS211203-PA1090 aa
Genomic positionDPSCF300007 + 867631-871243
RNAseq coverage11x (Rank: top 84%)
Annotation
HeliconiusHMEL0124550.092.61% 
BombyxBGIBMGA001869-TA0.090.04% 
DrosophilaDhc62B-PC0.057.09% 
EBI UniRef50UniRef50_E0VN310.063.22%Dynein beta chain, ciliary, putative n=13 Tax=Metazoa RepID=E0VN31_PEDHC
NCBI RefSeqXP_002427535.10.063.22%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420136900.063.22%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420136900.063.22%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
Group
KEGG pathwaymdo:1000264760.0 
 K10408 (DNAH)maps-> Huntington's disease
Orthology groupMCL10001 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211203-TA
ATGAACATATATGACATGGCTGCAGAAGGTCTACGGCCTACTCCAGCCAAATCTCATTATATTTTTAATCTACGAGACTTCTCAAGAGTTATTCAAGGATGTGCGTTGTTAAGAAGAGAATCTGCAGACAATAAAAAAACTTTTACTAGAGTATGGGTCCATGAAATTTTGCGTGTTTTTTATGATAGACTTGTTGATGAAATAGATCGTTCTTGGTTCTATAATCTTTTAAGGAAATCAACACAAGAGTTCATGAGGGACACATTTGAATCAGCATTAGACACATATCAAAATGATAAAGGTGAGGTAACTCAAGAAAATATTAAAAAAATGATGTTTGGTTGTTATTTAGACACGGACAGTGTCGAAGGGGAGAGAAGGTATGAAGAAATTCCTACGAAAGAAACATTTTTAAATGTTGCTATCGCTATGTTAACAGAGTATAATGCAATGCATAAGGCAAAAATGACAATTGTATTATTCGATTATGCCTTAGAACATTTATCAAAAATATGTAGATTGCTTTCTATGCCATCGGGTAACGCCTTATTGGTTGGAGTAGGAGGTTCTGGTCGCCAGTCACTCACTAGGTTGGCTAGTACTATTTTAGGTCAACAGGTATATCAACCTGAAATCACCAAGTCATATAGTGTTAAAGATTGGCATGATGATATTAAACTAGTTCTAAGAGAATCCGGTGGTCTTAATAAAGATACAACTTTTCTATTCACGGAAAATCAAATTAAAGAAGAAGTCTTTATTCAAAATTTAGACAGTTTGCTTAATTCCGGTGAAGTTCCTAATTTATACGGCTTAGATGAAATGCAAGAAATATTAGAATTAGTTCGCCTTGCAGCCCAAGGTGGTAATAGAAATTTAGATATTAGTCCATTGCAGATTATGTCATTTTTCGTTGGAAGATGTAAAGCTAAATTACATATTGTTTTGTGTTTCAGTCCTATAGGAAGTTCTTTTAGAACAAGATTAAGGCTGTATCCTTCTCTTGTAAACTGCTGCACTATTGATTGGTATGATAGTTGGCCTGAAGATGCACTAGAAATGGTGGCACATTACTATATGGTTAAAGTAAATGTTAGTGATAAAATTAAAGCAGCTGCAGTTATAGCGTGTAAACAATTTCACGTGGATGCGCGAAAGGTATCAATTGATTTCTTTAATCAATTTGGAAGAAAGACATATATTACATCAGCATCATATCTTAATTTGATCAAATCTTTTACAATATTAACAAATCGAAAACAAAGAGAGTTAAGAGCTGCAAAATTGCGTTATACGAATGGTTTAGATAAACTTAGTCAGGCAGCAGAAGCCGTGTCAATTATGCAACGTGACCTAAATATTTTGAAACCTCAATTAATTGTTATGGCTGCTAAGTCTACAAAGATGATGGAAGAAATCGCAGTAGAAACCGCCACTGCAGATAAAGCTGCAGCACAGGTGCGGGAAGATCAAAAAGTAGCAAATGTTCAAGCAGCAGCTGCGCAAGAGCTAAAAAAGGATTGTGAAGCGGATTTAGCTTTAGCTTTACCTATTTTAGAGGATGCGATTGCTGCATTGAATACTTTGAAACCCGCAGATATTACAATTGTAAAATCAATGAAAAATCCTCCAGCGACAGTAAAGTTAGTGATGGCAGCGGTATGTGTAATGAAGGGGATCCCACCTGATAAAATTCCAGATCCTGATAATCCGGGTAAGAAAATGTTAGATTATTGGGGTCCCAGTAAGAGAATATTAGGAGACATGAGCTTCTTGGATTCGTTGCGCAACTTTGACAAGGATAACATCCCAGTGGCAACAATGCAAAAAATAAGAAAAGAATATCTTTCCCATAAAGATTTTAAGCCACACATTATTGCCAAAGCTTCCACAGCTGCGGAAGGGTTGTGTAAGTGGATAATTGCAATGGATATGTATGATGCAGTAGCAAAAGTTGTTGCTCCGAAAAAAGCGAAGCTGGAAGCGGCTGAAAAAGAATTTGCGGAAACGATGGCGATATTAGAAGAAAAGAAAGCTACCGTAGCTAGATTAGAGGCTAGATTAGCTGAGTTAAATGAAGCCTTAGAAGAAGCAAATATTAAGAAAAAGGCTTTAGAAGATGAAGTTCAACTTTGTATTGATAAATTATATCGTGCTGAGAAGTTAATTGGTGGTCTTGGGGGAGAAAAAGTGAGATGGACAGCAGCAGCTGAAAATTTGCAAACTCTATTTGACAATTTAGCTGGAGATATTCTTGTTTCATCGGGTATCATAGCATATTTATCGCCTTATACTTTACCTATAAGAATAGAGATGATTTCCAAGTGGCGTGATTTAGTTATTGGTCTTGATATGCCACATTCTGAACATTTTGTGTTTAAAGACATTTTAGGTACTGACATTAAAATCCAAAATTGGTGTATAGCTGGTTTACCATGGGATTCGTTTTCTATAGATAATGGTGTTATACAAGATAGTTCTCTTCGTTGGTCTCTACTTGTCGATCCACAAGGACAAGCAAACAAATGGATAAAAACAATGGAAAAATCTAATGATTTGCAAGTCCTTAAGTTTACTGATGGTAATTATATGAAAGTAATAGAAACTTGTTTAGAATACGGAAAACCAGCATTGATTGATTGTATTTTAGAAGACGTTGAACCACCTTTGGATCCAGTTTTATTAAAGCATACTTATGTACAAGGTGGAAAAGAATTTATTGCTTTGGGTGAGAATGTAATTGAATATCATCCTAATTTTAGATTGTACATGACTACGAAACTCAGAAATCCTCATTATTTGCCTGAAGTGTTTAACAAAGTTACATTAATCAATTTTGCTCTTACAAAGGATGGACTGGAGGATCAATTGTTGGGTATTGTTGTGGCCAAAGAAAGACCTGATTTGCAAGAAAAGCGTGAAAAATTGATTGTGCAAGGTGCGGCTAATCGTGCTGCACTTAAACAAGTAGAAGATGACATATTACGAACTCTTCAAGAATCTAAGGGTGATATTCTAGAAGATGAATCTGCTATAGAAGTTTTAGATTCATCAAAACTATTAGCAATTGATATAACTAAAAAACAAGAAGCATCGGTAGAAACAGAAATTATTATAGAAAAATTTAGACTTGGATACAGGCCTATAGCATCACATTCTGCTATAAAGGTTGGGATGAAATATGCAGACTGGATGACCTACCAGCCTATAAGGAGATTAGAAATAGTTTTACAACCCATCAAAAGGGATGGAAAGAAGTCTATGACGATTTAG

Protein sequence:

>DPOGS211203-PA
MNIYDMAAEGLRPTPAKSHYIFNLRDFSRVIQGCALLRRESADNKKTFTRVWVHEILRVFYDRLVDEIDRSWFYNLLRKSTQEFMRDTFESALDTYQNDKGEVTQENIKKMMFGCYLDTDSVEGERRYEEIPTKETFLNVAIAMLTEYNAMHKAKMTIVLFDYALEHLSKICRLLSMPSGNALLVGVGGSGRQSLTRLASTILGQQVYQPEITKSYSVKDWHDDIKLVLRESGGLNKDTTFLFTENQIKEEVFIQNLDSLLNSGEVPNLYGLDEMQEILELVRLAAQGGNRNLDISPLQIMSFFVGRCKAKLHIVLCFSPIGSSFRTRLRLYPSLVNCCTIDWYDSWPEDALEMVAHYYMVKVNVSDKIKAAAVIACKQFHVDARKVSIDFFNQFGRKTYITSASYLNLIKSFTILTNRKQRELRAAKLRYTNGLDKLSQAAEAVSIMQRDLNILKPQLIVMAAKSTKMMEEIAVETATADKAAAQVREDQKVANVQAAAAQELKKDCEADLALALPILEDAIAALNTLKPADITIVKSMKNPPATVKLVMAAVCVMKGIPPDKIPDPDNPGKKMLDYWGPSKRILGDMSFLDSLRNFDKDNIPVATMQKIRKEYLSHKDFKPHIIAKASTAAEGLCKWIIAMDMYDAVAKVVAPKKAKLEAAEKEFAETMAILEEKKATVARLEARLAELNEALEEANIKKKALEDEVQLCIDKLYRAEKLIGGLGGEKVRWTAAAENLQTLFDNLAGDILVSSGIIAYLSPYTLPIRIEMISKWRDLVIGLDMPHSEHFVFKDILGTDIKIQNWCIAGLPWDSFSIDNGVIQDSSLRWSLLVDPQGQANKWIKTMEKSNDLQVLKFTDGNYMKVIETCLEYGKPALIDCILEDVEPPLDPVLLKHTYVQGGKEFIALGENVIEYHPNFRLYMTTKLRNPHYLPEVFNKVTLINFALTKDGLEDQLLGIVVAKERPDLQEKREKLIVQGAANRAALKQVEDDILRTLQESKGDILEDESAIEVLDSSKLLAIDITKKQEASVETEIIIEKFRLGYRPIASHSAIKVGMKYADWMTYQPIRRLEIVLQPIKRDGKKSMTI-