Monarch geneset OGS2.0

DPOGS211961
TranscriptDPOGS211961-TA2583 bp
ProteinDPOGS211961-PA860 aa
Genomic positionDPSCF300011 + 1084059-1094991
RNAseq coverage11x (Rank: top 84%)
Annotation
Heliconius% 
BombyxBGIBMGA000909-TA1e-17151.02% 
Drosophilabtv-PD6e-5125.43% 
EBI UniRef50UniRef50_E0VKZ15e-6528.12%Dynein beta chain, ciliary, putative n=2 Tax=Coelomata RepID=E0VKZ1_PEDHC
NCBI RefSeqXP_002426785.19e-6628.12%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420121132e-6428.12%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420121132e-6428.12%dynein beta chain, ciliary, putative [Pediculus humanus corporis]
Group
KEGG pathwayphu:Phum_PHUM2758903e-65 
 K10414 (DYNC2H, DNCH2)maps-> Phagosome
    Vasopressin-regulated water reabsorption
InterPro domain[351-532] IPR0136026.8e-23Dynein heavy chain, N-terminal domain-2
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211961-TA
ATGAATTCTTTATATTCTCACTCACTGCTCAGAAGTGGTCGCGTGCGCTCTCAACCCTCCGAGGAAGAGCTTCGCGCCCGTCTGTACTCGTTGCTGAGACGCCTAGTGACTTTACCGGCAGCTCTGCCCGGCCTCGCCAGCACCACGGACACAGATACAAACACGACTAGTGTTTTCTCGAGCATCGTTGAAAAACACAGTTGGCTCGGGAACAAGGCAGTGTCCCAACTGGAGGGCACTCTGGCAAACCTGGAGCGAGCCTGCGAGCGATGGACGAGGCGGGCGGCGTTAGGCTGTGTGCGACTGAAGGACCTGTGTGATCAGCTCACTTCGCCTGAACAGTGGGAGATCAACTTCAGGGCGTGCAAGGCGTACGGGCAGTCGGTCGCCAAGATGACATTTGAAGATGAGAAAATAGAATGGATCACGGTAGGTACTGTAACGCTTCGCCGTGAGTTTGAGGCCCAAGCGAGGAACCTGTGGGCCTGTTTGATGACGTCACTGACAGCCAGCTGTCGGGGTGACTCTACTGCGCTGGATTCTTTCATGACCGCGGCCTCCGTTATGTTGGAGGATAAAACGCTGCCCAAAAACACTAAGGAACTTGCCGAAATCAGCGCCAAGCAACAAGCGCTGCAAAAGAAAATGCCCGAGATGGAGAAAATGGTTGAGGACCTGAAACGCAAGAGCCATTTGGCGAGGACTTGGGGCGGAGACACGACTCTGGATGGAGTCACGCGCGAGTGGAGCAAACTGAAAGAGACGATGACAGCTCATCAGACGATGTTCGAACACCAGGCCGAGATAGTGAAGACGACCCTGAGCGGTGACTGCGAGAACCTCCGCTCGGCGCTGGAGGCGTGGTCGGGTCGGTGGTCACGACACGACGTGCGCGAGACGACGTACGCCGGACTGGTCGCCGCCAGCCACGACGTGAAACACGCGCTGGACGGATGGAACACGCTCGCCGACCAATCGGAATGTGAGAAGTTTAACATTGACCCGGACATGTCGGAGGCTTGGAGCGAAGCCGAACTAATTAAGACGGGTCTCACGACTCTTTGGACTCCTTTCGAACAATATAACGAAGAGTACGAGTCTCTCGCCGGACAGAAGTGGATCGTGGTCCAGAGGAAGCTTCACCAGCTCGATGACTTCGTGTCCAAGTGGGAAGGAGTGTTGGAACCGTTCACCTCCGTCACTTTAGTCATCAAGAGAGAACTCGACAAGTACTCGGAACTAACGATTGCTTTGAAATATCTGCGTGGATCTGACTTCACGGAGAAACATTGGCGAGAGGTGTTCGGCCTGCTGGGGATGGAGTACGTTCCGGTCGAGGAGCTCACCCTGGGACGACTGCTGGCGGCGGGGCAGGACATCAAGAAACAGATGAAGACGCTCCAGAAAATAAATTCATCGGCGTCGTCGGAGGCTGCCATCCGAAGCGCCCTCGTCGAGTTAGAGTTGTGGTACGAGGGGGCGAGACTCACCATAGAGTATTACACCGACAGGGCCAAGAAAGACACGCCGCTTGTCAGAGACTACAAGGAACTAATAGAAAAGGTAGAGGAGCAGCAGTGGGTGGTGGCGGGGCTGGGGGGCCGGGGGCAGGCAGCCGGCTGGGAGGGGACGCTGGCAGACGCGAGGGGGCTCCTCGAAGCGGCGCGGAGGGCCCAGCGCAGATCGGAATGTGAGAAGTTTAACATTGACCCGGAGATGTCGGAGGCTTGGAGAGAAGCCGAACTAATTAAGACGGGTCTCACGAATCTTTGGACTCCCTTCGAACAATATAACGAAGAGTACGAGTCTCTCGCCGGACAGAAGTGGATCGTGGTCCAGAGGAAGCTTCACCAGCTCGATGACTTCGTGTCCAAGTGGGAAGGAGTGTTGGAACCGTTCACCTCCGTCACTTTAGTCATCAAGAGAGAACTCGACAAGTACTCGGAACTAACGATTGCTTTGAAATATCTGCGTGGATCTGACTTCACGGAGAAACATTGGCGAGAGGTGTTCGGCCTGCTGGGGATGGAGTACGTTGGCCAGCTCCTCGAGTGGGACCCACGACTGCTGGCGGCGGGGCAGGACATCAAGAAACAGATGAAGACGCTCCAGAAAATAAATTCATCGGCGTCGTCGGAGGCTGCCATCCGAAGCGCCCTCGTCGAGTTAGAGTTGTGGTACGAGGGGGCGAGACTCACCATAGAGTATTACACCGACAGGGCCAAGAAAGACACGCCGCTTGTCAGAGACTACAAGGAACTAATAGAAAAGGTAGAGGAGCAGCAGTGGGTGGTGGCGGGGCTGGGGGGCCGGGGGCAGGCAGCCGGCTGGGAGGGGACGCTGGCAGACGCGAGGGGGCTCCTCGAAGCGGCGCGGAGGGCCCAGCGCAGGCCCGATTCGGAGAAAGACGGGAGCGGTCGCGCGGGCGGAGAGGTCGGCAGCTTGCTCAGGGACACATTTAGATACCTGACTCGTTCAAAACTCAGAGATGACTTTCAACGACCTGTTTTGCAAAAAGCTTCCCTCATAATAAAACAAGATAACCGAGTCCGTCTCTGTGTTCGTATGAGGCACGAGAGGTAG

Protein sequence:

>DPOGS211961-PA
MNSLYSHSLLRSGRVRSQPSEEELRARLYSLLRRLVTLPAALPGLASTTDTDTNTTSVFSSIVEKHSWLGNKAVSQLEGTLANLERACERWTRRAALGCVRLKDLCDQLTSPEQWEINFRACKAYGQSVAKMTFEDEKIEWITVGTVTLRREFEAQARNLWACLMTSLTASCRGDSTALDSFMTAASVMLEDKTLPKNTKELAEISAKQQALQKKMPEMEKMVEDLKRKSHLARTWGGDTTLDGVTREWSKLKETMTAHQTMFEHQAEIVKTTLSGDCENLRSALEAWSGRWSRHDVRETTYAGLVAASHDVKHALDGWNTLADQSECEKFNIDPDMSEAWSEAELIKTGLTTLWTPFEQYNEEYESLAGQKWIVVQRKLHQLDDFVSKWEGVLEPFTSVTLVIKRELDKYSELTIALKYLRGSDFTEKHWREVFGLLGMEYVPVEELTLGRLLAAGQDIKKQMKTLQKINSSASSEAAIRSALVELELWYEGARLTIEYYTDRAKKDTPLVRDYKELIEKVEEQQWVVAGLGGRGQAAGWEGTLADARGLLEAARRAQRRSECEKFNIDPEMSEAWREAELIKTGLTNLWTPFEQYNEEYESLAGQKWIVVQRKLHQLDDFVSKWEGVLEPFTSVTLVIKRELDKYSELTIALKYLRGSDFTEKHWREVFGLLGMEYVGQLLEWDPRLLAAGQDIKKQMKTLQKINSSASSEAAIRSALVELELWYEGARLTIEYYTDRAKKDTPLVRDYKELIEKVEEQQWVVAGLGGRGQAAGWEGTLADARGLLEAARRAQRRPDSEKDGSGRAGGEVGSLLRDTFRYLTRSKLRDDFQRPVLQKASLIIKQDNRVRLCVRMRHER-