Monarch geneset OGS2.0

DPOGS203998
TranscriptDPOGS203998-TA3141 bp
ProteinDPOGS203998-PA1046 aa
Genomic positionDPSCF300005 + 1471103-1480308
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0138630.078.34% 
BombyxBGIBMGA002144-TA0.080.72% 
DrosophilaCG9492-PA0.055.52% 
EBI UniRef50UniRef50_Q9VH970.055.52%CG9492 n=24 Tax=cellular organisms RepID=Q9VH97_DROME
NCBI RefSeqXP_002431951.10.060.33%dynein heavy chain, cytosolic, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420230540.060.33%dynein heavy chain, cytosolic, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420230540.060.33%dynein heavy chain, cytosolic, putative [Pediculus humanus corporis]
Group
KEGG pathwaycre:CHLREDRAFT_1372215e-38 
 K10408 (DNAH)maps-> Huntington's disease
InterPro domain[251-814] IPR0135945.7e-123Dynein heavy chain, N-terminal domain-1
Orthology groupMCL24850 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203998-TA
ATGTTCGTGTGTGGTATTAAACCTGGTTCTAGTGAGGAGGAAGAAGAAGATGTAATGGATATTCTTCAGAAAAGACGCGAAGAATGCGAAAGAAAAGCAAGAAGGGGGGAAATGGACCCCAGGTTGGAGTTCACTTTCCAACTTCTTATCGATGGTACAGGCCTACCACGGCATGCCATCATGGATCACGTTTTCGAAGGAAATATGCTAGATGATATAAATCAATTGTTTCTTCCACACATGAGAAATAAGCTGCTGTGGTATTACCAAGATGTAGAAGAAGTGGAGCAACGACCTGTAGTAGAAGGAAATAAGTCTAGACAGCAGAGCCAAAGCAAAGTTCCACCTCCTCAAATAACAATGAAGAAAAAGCTGTTTTTGTCCGATGGGTGGGATGTTCGATTTACAGGAGTCTGTATATACATGTTTCGTATTAATGTAAGCAAGCAATTACCCGAGGAAGGCTTTCATAAAGATTTATTTTGTGGCATTCTTAACGCTGAAAAAGTGGGTTTGGTGACAGCAATAGAACGTGTGATGGAGTACGTGTATATGAGCGCCCTGGCACATCCTAGCAGTGATGGCGATGAAGACGAAACACGCTTCCCCATTGTCAAGAATCAACTTTTGCCTGGCTTACGGTCCTTTTTCTGTGAAGATGTCTGCAATCAAGTGAATCTATTCGACGATGGCAAAGCCCTAATGGCAAATCTGAAGGATCAAACCGAGTTGAAAGAAATGGTTAAGAACTCTAGCAAGCTTATACTTCTCGAGGAAAGAGTTAATGAGTGGATTAAAAAAATTATGGAGATATTAAGTGAAAGTGAGCAGCTTCGTCGTGAAGTGGATTCGAGTGGACCCCAAGATGAGCTAGAATATTGGAAGAAAAGAGGAGCACAATTTTCTCAACTTGTTTCTTATTTACAGGACAGTGAAGTGCAGCTTACTTTAACGTGTCTTCAGCTGGCTAATTCTAAAGTTATTAAACTTTGGCGAGAAACAGATCATAAGATTACATTCTGTTACAATGAAGCTAAAGACAATGCCAAGTTTATTCAGGCTATGGAGAAGTGTTGCCACTCTTTGTATTTGGACGATCCGGTTAAAATCAAGGACTCCATATTAAGTTTACTCCAAACTGTCAGATTAATTCACAGTGTATCACAATTCTATAACACTTCCGAAAGAATATCATCTCTGATGGTTAAGATAACGAACCAAATGATTGAAACTTGTAAACAATACATAACCTGCCGTTCCAAAGAAACTATTTGGTCTCAAGATCGTGATTCTGTGAGGGATAAACTTAAACATTGCATAAACCTCAATAAAACCTATAGAGACACTTATATCTTTGTTAAGAATCAAACATTTTTACCAAATACTGAACAGTTTAGTTTCTCTGAGAACTATGTATTCGGAAAATTTGATACGTTTTGCAAGAGGTTAAATAAAATTTTAACTATGTTCATACTGATGGATGATTATAATCATTTGTTTGAAAAACGGATGGAAGGACTACTCTTAGGCGAAGACCTCGAAGATGCAATGCACTCTTTCAATGAGGCAAAAAAGGCGGTAACGTCATGTCAATATGACTACCTAGATTACCGAAATAACGATTTCGATAAAGATTATCAGGCCTTTGAAGATAAAACTCATACCTTGCGTGAGTCTATTGGTCATACAATTGAAGTAAACTTTGCTAGTGTTTGGGAAACTCCACAGGGAATTAAGTTCCTCACAAGATTTGAGAAGGTCAGTCAAAAAATTCAAATAACAAAACTGAGTGAAAAATATGATAGAGTACTGAAATATAGCGAAAAAGAGGTTGATAAAATAATGAAAATGTTTAAGAGACAAAAGGATGATCCACCGTTGCCCAGAAATTATTCTCCTGTCGCGGGTCGTATAAAATGGGCCCGATGTCTCATGTACAACATGACTGAAACTGTAGAAAGCGTATGCTCGCACGCTGCACTGAACTCGCTACCGACAGCCGCGGATATGATGCGCAAATACGCTACTACACGCACTCTAATACATAACTACGAGGAAAATATGCGAGCTGTTTGGATGAACCAGAATCTCTGGGATGTAGACGACAGTTTGAATAATACAATACTTAAAATAGATGACTCTGGAAGGATCGTTGTCAATTTGGACCATACTATAAAATTACTAATAAGGGAATCGGATTGCTTAGTAAAAATGGGCTTAGAATTGCCTATCGTTTGCCACTCATTATATTCAAAGAAGAAATATTTTACACTAGTAAATGACTCATTACAATTCCTTTTGGAAGACTATATGCGTAGTGTTCGTCAAGTGAAACTAGAAGTAAGACCATTGTTGTTGCCGCAAGTTGTCCGTCTATCTTCACTACTTCTTCCTGGAATACGTTCTGTTTCTTGGACTTGTGAGGAATGGAAAGAATTTGTCGATCGTGCAAATTTTGCTATAAAAAGCTTTGATGTCCTCGTAACCAGAGTTCATGATATTTATAGCAACCGAATCATTTATATGCTGTCCGGTATGCAAGAGGTATCGTTATTAACTTTACCAGATGAAATGCCTTGGTCGGTTGAAGAATTTATTGAATGCGTCGAGACAGGATGCCGATCGGCTTGTGTAGAGCTAAATCGAAAAAGTTTGATGGTGGAAGAAGCAGTAGAAGAAGTATTAGATCTAGTGAAAAAAGCAGCTCAGCAAGTAAAACCTACCGAAATCAACCCAGACTTTGAATTTCTTATCGCTGACGATGATACTCAGTTGATGAGCGGTGCTGCGTCAACGATGAATGAGTCAACCGCCAGCGGTCAGCAGGACTGGTCAGCTGTCTGGGAGTGTTTCGAAAGCCCCCACAGACTACTCTCTGTCCCCGCTGGTGGACTTTCTAAAAGTATGCAGGAGATGGTAAAGAATGCAGTTAATGAAATGCGGCGTTACTATAGTCGTAAAGTTGTCGACGTACTTATAAAGGTCACAAGACGAGCTTTGGATTTGATCATCAAGCAATTCTCTTGCGACTCAGAAGTCATAGTTAAAAACAATCTCAATGAGCTAACCGCGTCAAAGTTTTTGAAGACGGAAGAGATTAAACCTTTTCTTATGACGTTCTATGTGATTTTAGATATTTAA

Protein sequence:

>DPOGS203998-PA
MFVCGIKPGSSEEEEEDVMDILQKRREECERKARRGEMDPRLEFTFQLLIDGTGLPRHAIMDHVFEGNMLDDINQLFLPHMRNKLLWYYQDVEEVEQRPVVEGNKSRQQSQSKVPPPQITMKKKLFLSDGWDVRFTGVCIYMFRINVSKQLPEEGFHKDLFCGILNAEKVGLVTAIERVMEYVYMSALAHPSSDGDEDETRFPIVKNQLLPGLRSFFCEDVCNQVNLFDDGKALMANLKDQTELKEMVKNSSKLILLEERVNEWIKKIMEILSESEQLRREVDSSGPQDELEYWKKRGAQFSQLVSYLQDSEVQLTLTCLQLANSKVIKLWRETDHKITFCYNEAKDNAKFIQAMEKCCHSLYLDDPVKIKDSILSLLQTVRLIHSVSQFYNTSERISSLMVKITNQMIETCKQYITCRSKETIWSQDRDSVRDKLKHCINLNKTYRDTYIFVKNQTFLPNTEQFSFSENYVFGKFDTFCKRLNKILTMFILMDDYNHLFEKRMEGLLLGEDLEDAMHSFNEAKKAVTSCQYDYLDYRNNDFDKDYQAFEDKTHTLRESIGHTIEVNFASVWETPQGIKFLTRFEKVSQKIQITKLSEKYDRVLKYSEKEVDKIMKMFKRQKDDPPLPRNYSPVAGRIKWARCLMYNMTETVESVCSHAALNSLPTAADMMRKYATTRTLIHNYEENMRAVWMNQNLWDVDDSLNNTILKIDDSGRIVVNLDHTIKLLIRESDCLVKMGLELPIVCHSLYSKKKYFTLVNDSLQFLLEDYMRSVRQVKLEVRPLLLPQVVRLSSLLLPGIRSVSWTCEEWKEFVDRANFAIKSFDVLVTRVHDIYSNRIIYMLSGMQEVSLLTLPDEMPWSVEEFIECVETGCRSACVELNRKSLMVEEAVEEVLDLVKKAAQQVKPTEINPDFEFLIADDDTQLMSGAASTMNESTASGQQDWSAVWECFESPHRLLSVPAGGLSKSMQEMVKNAVNEMRRYYSRKVVDVLIKVTRRALDLIIKQFSCDSEVIVKNNLNELTASKFLKTEEIKPFLMTFYVILDI-