Monarch geneset OGS2.0

DPOGS211204
TranscriptDPOGS211204-TA1728 bp
ProteinDPOGS211204-PA575 aa
Genomic positionDPSCF300007 + 871510-873237
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0124550.089.22% 
BombyxBGIBMGA001869-TA0.088.17% 
DrosophilaDhc62B-PC0.056.25% 
EBI UniRef50UniRef50_E0VN310.064.81%Dynein beta chain, ciliary, putative n=13 Tax=Metazoa RepID=E0VN31_PEDHC
NCBI RefSeqXP_002427535.10.064.81%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420136900.064.81%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420136900.064.81%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00070184.2e-183microtubule-based movement
GO:00302864.2e-183dynein complex
GO:00037774.2e-183microtubule motor activity
KEGG pathwaytgu:1002308391e-174 
 K10408 (DNAH)maps-> Huntington's disease
InterPro domain[5-572] IPR0042734.2e-183Dynein heavy chain
Orthology groupMCL10001 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211204-TA
ATGGGTTACAGTCACAGATTTAATTCAATATCATTAGGACAAGGGCAGGGTCCAATTGCAAAGGCAATGATAGAAAAAGCTCAATTAGAAGGGGGTTGGGTGTGCTTACAAAACTGTCATTTAGCAGTTTCTTGGTTACCCACTTTAGAAAAGCTAATCGAGGGATTTGATTTGACAAACACTGATCTTAGTTTTAGACTCTGGCTTACGAGTTACCCTTCTGATAAATTTCCGCAGTCAGTTTTGCAAGTTGGTGTAAAAATGACCAATGAACCCCCTACTGGACTTCAACATAATTTAAATAGATCTTATCTTTCGGAACCTCTGAAAGAACCTGAATTTTTCGAAGGCTGTCCTGGTAAAGATAAAGCATTTAGTAAGCTTTTGTACGGAATAAGTTTTTTTCATGCCGTGGTACAGGAAAGGAAAAAATTTGGACCACTAGGTTGGAATATTCAGTATGGTTTTAATGACTCAGACTTTCATATTTCAGTTATGCAATTACAGATGTTTTTGAACCAATATGAAGAGATACAATATGTTGCTATAAAGTATTTAACTGGTGAGTGTAATTATGGAGGAAGAGTGACGGATGATTGGGATAGGAGATTAATTGTAACTATTTTAGACAATTATGTCAACGCTAATGTAGTTAATGATCCCAACAACTTATTTTGTGACTTGGGTCCTCAATATGGCCTACCACGGAGATGTGAGTATCAAGACTATTTAAAGCATATTGAATCAGTTCCTGTAAATCCTCCTCCTGAGGTCTATGGGCTACACATGAATGCAGGAATAACGCGTGACTATTCAATATCTATGGCCCTGACGACTTCTTTGGTCCTAGTTGAAGGTGCAGCTGGGGGTGGTGAAGGTGGTAATACTGAAGTAATATTAACTCAGATGGCAACAGAAATTTTGTCAAAGCTACCTGAGTCATTTGATATTGAAACAGCACAGCAAAAGTATCCAGTGGATTATAATGAATCCATGAATACAGTATTGATTCAAGAAATGCAACGATTTAACAAATTATTGAATGAAATTAGAACTTCCCTGATCGATTTACAAAAAGCTGTTAAAGGCGTGATTGTCATGTCACCGGCACTTGATTTGCAGTCTAATTCAATGTTGCTTGGTAAAATTCCAGATAATTGGTCAAAAGTTTCCTATCCTAGTTTAAAACCATTACCAAGTTACGTAGCTGACTTTATTGATCGTCTAGCTATGTTAGAAGATTGGAACCAAAATGGTAAACCGCCAACATTTTGGCTGTCCGGATTTTTTTTCACACAAGCTTTCCTTACTGGTTCCGTTCAAAACTATGCCCGAGCTAAAAAAATACCAATAGATCTACTTATTTTTGATTTCGAAGTATTGCGTGTTGATTATGAACATACTCCTCCAGAATTCGGAGTTTATGTCCAGGGACTTTTCGTAGATGGAGCTAGATGGGATCGGGATAAGTATGCAATTGGCGAACAATTCCCAAAAATATTAAATGATAATATGCCAGCTGTGTGGCTTTTCCCGAAATTGAAAAAAGAGTTCTTAGAAGGTACAAGATACAAATGCCCATTATATAAGACATTAGAAAGAAAAGGTGTTTTGGCGACAACAGGACATTCTTCCAATTTTGTCTTGGCATTTTATTTACCTTCGGATAAACCTTCTGCACACTGGATAAAGCGAAGTGTTGCCCTCATATTACAATTAGACAATTAG

Protein sequence:

>DPOGS211204-PA
MGYSHRFNSISLGQGQGPIAKAMIEKAQLEGGWVCLQNCHLAVSWLPTLEKLIEGFDLTNTDLSFRLWLTSYPSDKFPQSVLQVGVKMTNEPPTGLQHNLNRSYLSEPLKEPEFFEGCPGKDKAFSKLLYGISFFHAVVQERKKFGPLGWNIQYGFNDSDFHISVMQLQMFLNQYEEIQYVAIKYLTGECNYGGRVTDDWDRRLIVTILDNYVNANVVNDPNNLFCDLGPQYGLPRRCEYQDYLKHIESVPVNPPPEVYGLHMNAGITRDYSISMALTTSLVLVEGAAGGGEGGNTEVILTQMATEILSKLPESFDIETAQQKYPVDYNESMNTVLIQEMQRFNKLLNEIRTSLIDLQKAVKGVIVMSPALDLQSNSMLLGKIPDNWSKVSYPSLKPLPSYVADFIDRLAMLEDWNQNGKPPTFWLSGFFFTQAFLTGSVQNYARAKKIPIDLLIFDFEVLRVDYEHTPPEFGVYVQGLFVDGARWDRDKYAIGEQFPKILNDNMPAVWLFPKLKKEFLEGTRYKCPLYKTLERKGVLATTGHSSNFVLAFYLPSDKPSAHWIKRSVALILQLDN-