Monarch geneset OGS2.0

DPOGS214653
TranscriptDPOGS214653-TA1344 bp
ProteinDPOGS214653-PA447 aa
Genomic positionDPSCF301123 - 457-3965
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0040820.071.02% 
BombyxBGIBMGA005048-TA0.094.63% 
DrosophilaDhc93AB-PB0.069.73% 
EBI UniRef50UniRef50_Q9U3Y50.071.75%Dynein heavy chain n=75 Tax=Bilateria RepID=Q9U3Y5_DROME
NCBI RefSeqXP_002427058.10.073.77%ciliary dynein heavy chain, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420126850.073.77%ciliary dynein heavy chain, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420126850.073.77%ciliary dynein heavy chain, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00070181.6e-187microtubule-based movement
GO:00302861.6e-187dynein complex
GO:00037771.6e-187microtubule motor activity
KEGG pathway 
InterPro domain[1-445] IPR0042731.6e-187Dynein heavy chain
Orthology groupMCL10001 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214653-TA
ATGTGCAGCAAAGAGGCTGAATTTAAATCTGTTTTGTTCGCGCTCTGCTACTTCCACGCTGTTGTGGCTGAGAGACGGAAGTTTGGCCCTCAAGGATGGAACAGAGTATATCCGTTCAACTTCGGCGATCTCACGATATGTGTGTACGTGCTGTATAACTACCTGGAGGCGAACCCTCGCGTGCCCTGGGAGGATCTGAGATACCTTTTTGGAGAGATTATGTATGGAGGGCATATCACCGACGACTGGGATCGGAGGCTCTGCCGAACCTTCCTCTTGGAGTACATGCAGCCTGAATTGGTCGATGGAGAGCTCACCCTGGCGCCTGGCTTCATATCCCCTCCGAATTCTGACTACGCCGGCTACCACCAGTACATAGACGACTTCCTTCCTGATGAGACACCGTATCTATACGGATTACATCCTAATGCTGAAATCGGATATTTGACTACAGTATCTCAAAGATTGTTTAAGGTGGTATTTGAAATGCAACCCCGAGACGCCGGAGCGCAGGCTGGAGGCGGTGCTAGTAAAGAAGAAATAGTCAGATATATCCTTGACGATATAATGGACAGGGTGCCGGAGCCGTTCAATTTGGTGGAACTGATGGGGAAGGTGGAGGAACTGACTCCATACACGATCGTAGCACTACAGGAGTGTGAGAGGATGAACAGGCTCATGGGCGAGATACGACGGTCGCTTAAAGAGTTGGAGTTGGGACTTAAGGGAGAGCTGACCATAAGCAGTGATATGGAGAAGTTGATGGAGTCTCTGTTCATGGACCACGTGCCGGCCTCGTGGTCCAACCTCGCCTACCCCAGCCTCTTAGGTCTGGCCGCTTGGTTCTCAGACCTCTGTCTTAGACTCACCGAGTTGGAGAATTGGTCGGGCGACTTTAATTTGCCGCCCGCTGTGTGGCTCGCCGGCTTCTTCAACCCGCAGTCGTTCCTGACAGCCATCATGCAGCAGACGGCTCGTAAGAACGAGTGGCCCCTCGACAAGATGTGCCTCAACTGTGACGTCACCAAAAAGAACAGGGGAGACTTCAACGCTCCGCCCCGCGAGGGTGCCAACATCCACGGTCTGTATATGGAGGGCGCCCGGTGGGACACGGCCACCGGCGGGATCGTGGAGTCCAACATGATGGACCTGTTCCCCATGATGCCCGTCATCTACATCAAGGCCGTCACCCAGGACAAGCAGGACACCAAGAACGTGTACGAGTGCCCCGTCTATAAGATACGTATGCGTGGCCCCACATTCGTGTGGACGTTCAATCTGAAGACGAAGTACAAACCAACAAGATGGACCCTCGCCGGCGTCGCTCTGCTGCTGGGGGTATAA

Protein sequence:

>DPOGS214653-PA
MCSKEAEFKSVLFALCYFHAVVAERRKFGPQGWNRVYPFNFGDLTICVYVLYNYLEANPRVPWEDLRYLFGEIMYGGHITDDWDRRLCRTFLLEYMQPELVDGELTLAPGFISPPNSDYAGYHQYIDDFLPDETPYLYGLHPNAEIGYLTTVSQRLFKVVFEMQPRDAGAQAGGGASKEEIVRYILDDIMDRVPEPFNLVELMGKVEELTPYTIVALQECERMNRLMGEIRRSLKELELGLKGELTISSDMEKLMESLFMDHVPASWSNLAYPSLLGLAAWFSDLCLRLTELENWSGDFNLPPAVWLAGFFNPQSFLTAIMQQTARKNEWPLDKMCLNCDVTKKNRGDFNAPPREGANIHGLYMEGARWDTATGGIVESNMMDLFPMMPVIYIKAVTQDKQDTKNVYECPVYKIRMRGPTFVWTFNLKTKYKPTRWTLAGVALLLGV-