Monarch geneset OGS2.0

DPOGS207366
TranscriptDPOGS207366-TA3288 bp
ProteinDPOGS207366-PA1095 aa
Genomic positionDPSCF300562 - 7796-25963
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0132273e-12161.83% 
BombyxBGIBMGA003806-TA0.070.49% 
DrosophilaGl-PA0.045.72% 
EBI UniRef50UniRef50_B0W9330.046.80%150 kDa dynein-associated polypeptide n=5 Tax=Culicimorpha RepID=B0W933_CULQU
NCBI RefSeqXP_001657515.10.047.55%dynactin [Aedes aegypti]
NCBI nr blastpgi|1571123970.047.55%dynactin [Aedes aegypti]
NCBI nr blastxgi|1571123970.047.55%dynactin [Aedes aegypti]
Group
KEGG pathwayaag:AaeL_AAEL0061450.0 
 K04648 (DCTN1)maps-> Huntington's disease
    Vasopressin-regulated water reabsorption
InterPro domain[459-707] IPR0221572.7e-44Dynein associated protein
Orthology groupMCL11853 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207366-TA
ATGTATCCGTATGTGTGTTCTGGAAATCTAAAAAGGGTGCGGCGAAAGGCGTCACCGGCGCAGACGAAGACTGGAAGCCTGTCCATGTCGAGAACCTCCCTGGCCAGCAGCCGTCAGTCGTTGACGTCATTCGTGTCCCCGACGACCGAGAGAGGGACTTCCCCCGATCTCACAAAACGTGCTTCCTTTGTTGAGACTGGTTTCGTGGAGACTTTAACACCTCAATACACTCCGGGTCAGAGTTTGACCTCGCCATCAACAGCCTCTGAGGATAAACTGGCGAACATACAGGCGCAACAGGAGATTGTGAACCTAAAAGCTGAGGTGGAAGATTTGAAGGAGAAGCTGGAAACTCTGAAAGTCAGACGGGCCGAGGACAGGGAGAAGCTTCGAGAGCTGGAGAGAATGAGGTTACAGCTGGACCAGGCGAATGAGTTCAAGGCAAAGATCATGGAGTCACAGGCACAGCTGCAAAGGGACCTGCAGAGGGCCAAACAAGAGCTGCGTGAAGCTCAAGAAGCCCTGGACCAGCACAACGACGAGACAGCTGACCTGCAAGAGGCAGCTGAAATGGCGGCTCTTGATAAAGAAATGGCGGAGGAGAGGGCGGAGGCTTTACAGCTGGAGCTGGAACAGGCGAGGGAGAAGCTGGAAGAGGCGACGCTAGACCTGCAACTCATGAGGGCTGAGATGGAAGCTGGCGGGAATATACAACACCCGTATGCAGCGGGCGACAGTGGCGCCACCGGTTACGAGGTGAGGCAACTACAGCAACAGAACGTCCGTCTGAGGGACACGCTAGTCCGCCTCCGAGACCTCTCCGCCCACGATAAGCATGCAATGCAGAAAATGATGAAGGATTTGGAGCAATACAAATCGGAGATAGCTGAACTGTCGAGGACTAAGGAGAAGCTGTCAGCGAGGGTTGAGGAGTTGGAGGCTCAGGTCGCTGATCTCAGAGAACAGGTGGACGCCGCTCTAGGCGCTGAAGAGATGGTGGAACAGCTGGCTGAGAAGAAGATGGCTTTGGAAGATCAGGTGGAACAGCTGAAGCAGGACGTATCAGAGCTGGAGGCGCTGCAGGAGGTTCACGAACAGCTGGTGGAGTCCAACCGGGAGCTGGAAATGGATCTGCGCGAAGAGCTGGAAATGGCGCACGCTGCTACCCGGGAGGCGGCCCGTGAGCGTGAAGCGGCCTTGGAGACGATCATGGATAGAGATGCGACCATCATCAAGTTCAGGGAGCTGGTGCAGAAGATGACGGAACAGCAGAACGAGCTCAAGAGCCAGGTTGAGAATAAACAGGGTGACCACGAGCCGTCTCCGGAGGGCGAAGCGCCCGAGGCTGCGCCCCGCGAGCTCGGAGCCCTGGTGCTCCAATCCAGGGCTGCCACCCGCTCTGTAGACCTGCAGTTGAGGGCTCTCGAGCTGGAACAGGCTCGGGCCAGGGCTGATAGATTGGCAGCGTGTCTACCTGATCATTTCATGGCACCCAACGGTGATCACGACGCCATCATGTTCATTCTGCTTCTACAGCGGTTGGACACCAAGTCCGAGATCATACTCGGACAGATCAGGGAGAAGTTCCCACCTGTGAACGTCTGGGATAAGGAATCGGTTATGAGAACCCACACAGCTGTCCAGTACAGCTTCAGATGCCAGCTGGAATACCAGCTGCAAATGATACAGTGCATGACATCTATGTGGTCTGGTGCGCTGGAGCGCTGCAGTCCCGAACTACTACTGCGAGCTGCTTCAGCGCTGCCGGATGCTGCAGCACAGGAGAGAGCACTAGATGCTGCGACCAGTCTGTTGAAGAACAATGAATTAGATGAGAACAGCTCTTTAGATGGCATGGAGCGCTGTTGGTCCTATCTAAGCGCTATGTGGTCCGCTCTGAACATGTCGTCGGTGGAAGGCGCGTCTTGTACACGGGATGTGTTGCTACACTCGTGTTTCGCCCTGGACGCGCTCGCGAGGGCCCTAGCAGCTGATGGGGCGGCGCTACAGCATGTTATGCTGCCGTCCGATCATCAGCAAGAGCTGGGACAGCTGCATGAGGCCATCCAGTCCAGCTGCTCGTCCCTCCAGCAGCAGCTGAAGAGCGTGAGGCGCAGGCTCCAGCCTGGAGTCAAGCCCTCCACTCTGCCTATAGACGCTCAGCTGGTGGATCGTCTCCGAGGGTCCACAGCGGCGTCTCTGAGCAAGTGCGCCCGCGCCACCTCCCTCGCCGCCCGGGCTGCTAGCGCCTGCGCCGACACGGCCGGGGAGAGGGGCGAAGGCGCTCCGCTAGCACACGCAGCCATACAAGCGGAAGAGCAGGGCGTTGTCAAAACTGTGAAGAACGCGCTGTCACAGACGGCCAAGGACGTAGACGCGCTGGCGACCTTCGTGAGGGACCGCGAGTACGACCTGATGTCCAGCACCAACGGAGCTGATGATACGCCGACTCCGCCTATAGTGCTCCGAGCGCAGCTGGTGAAGAAACAGCTGGAAGAGACGAAGACCTTGACAATAAGGCTTGAGAATAAGGAGGCTGATATTAAAGAACTGAAGAAGGCGCTGAAGGCCAAGCAAGAAGAATTGTCTGAGATGCAAATAAGGCGGGAGCTGGGCGAGAGGAAACTGGTCGCTGCGGCGAGGGACGCCGAGCTGAAGTCGCAGCAGCTGCAGCGGCGGCTGGACGACGCGCAGAACCAGTTCAAGAGGGCGGTGGAACGAGAGCGTGCGGCGCGGGTAGCCGCCGTTGGTAGGGCGGAGCGCGCGGCGCTCAGGGCCCTCCGCCCCCTACACCACCCAGCGGCGCCGGAGCACGCCAAGAGGAGGGCGGCTGCTGCTGCCTTGGAGACGGAACTGTCCAAACTACAGGCGGAGTGGACTCTGTTCGTGGCCAGATCGGGTCTGGTGAAGTTCCCCTCGGAGCCCGGCCAGTACGCGCGGGCCTTAGAACAGCACAAGGAGAAACAGAGACGAGTGAGGAAACAGCTGGAGGATAAGCTGATCCGGCTGCAGGTGGAGGCTCGCTTGCTGTTGCTGACTCACCGTCCTTGGCGCGTCTCGCTCGCAGACCTCGCGTGCTTCCCAGCACCGGACCTGGCCGCGGCCCTGGACCCCAAGACGGTGGAGGTTGGCACCATCACGTACCCCGCGGGCGAGGGGCTCAGCGACGACACGATATACGTCACGCCCACCCAGCTGGCCAAGTTGCGCGAGATAGTCACCGAGCTCCAGTCGGACGAGGTTCAGCTCGACCTCAAGCCGCTGGACAGCACCGTGTGCGCAGCTTGA

Protein sequence:

>DPOGS207366-PA
MYPYVCSGNLKRVRRKASPAQTKTGSLSMSRTSLASSRQSLTSFVSPTTERGTSPDLTKRASFVETGFVETLTPQYTPGQSLTSPSTASEDKLANIQAQQEIVNLKAEVEDLKEKLETLKVRRAEDREKLRELERMRLQLDQANEFKAKIMESQAQLQRDLQRAKQELREAQEALDQHNDETADLQEAAEMAALDKEMAEERAEALQLELEQAREKLEEATLDLQLMRAEMEAGGNIQHPYAAGDSGATGYEVRQLQQQNVRLRDTLVRLRDLSAHDKHAMQKMMKDLEQYKSEIAELSRTKEKLSARVEELEAQVADLREQVDAALGAEEMVEQLAEKKMALEDQVEQLKQDVSELEALQEVHEQLVESNRELEMDLREELEMAHAATREAAREREAALETIMDRDATIIKFRELVQKMTEQQNELKSQVENKQGDHEPSPEGEAPEAAPRELGALVLQSRAATRSVDLQLRALELEQARARADRLAACLPDHFMAPNGDHDAIMFILLLQRLDTKSEIILGQIREKFPPVNVWDKESVMRTHTAVQYSFRCQLEYQLQMIQCMTSMWSGALERCSPELLLRAASALPDAAAQERALDAATSLLKNNELDENSSLDGMERCWSYLSAMWSALNMSSVEGASCTRDVLLHSCFALDALARALAADGAALQHVMLPSDHQQELGQLHEAIQSSCSSLQQQLKSVRRRLQPGVKPSTLPIDAQLVDRLRGSTAASLSKCARATSLAARAASACADTAGERGEGAPLAHAAIQAEEQGVVKTVKNALSQTAKDVDALATFVRDREYDLMSSTNGADDTPTPPIVLRAQLVKKQLEETKTLTIRLENKEADIKELKKALKAKQEELSEMQIRRELGERKLVAAARDAELKSQQLQRRLDDAQNQFKRAVERERAARVAAVGRAERAALRALRPLHHPAAPEHAKRRAAAAALETELSKLQAEWTLFVARSGLVKFPSEPGQYARALEQHKEKQRRVRKQLEDKLIRLQVEARLLLLTHRPWRVSLADLACFPAPDLAAALDPKTVEVGTITYPAGEGLSDDTIYVTPTQLAKLREIVTELQSDEVQLDLKPLDSTVCAA-