Monarch geneset OGS2.0

DPOGS211500
TranscriptDPOGS211500-TA3147 bp
ProteinDPOGS211500-PA1048 aa
Genomic positionDPSCF300354 - 217972-235849
RNAseq coverage89x (Rank: top 63%)
Annotation
HeliconiusHMEL0132271e-12161.83% 
BombyxBGIBMGA003806-TA0.069.66% 
DrosophilaGl-PA0.044.06% 
EBI UniRef50UniRef50_B0W9330.045.78%150 kDa dynein-associated polypeptide n=5 Tax=Culicimorpha RepID=B0W933_CULQU
NCBI RefSeqXP_001657515.10.047.80%dynactin [Aedes aegypti]
NCBI nr blastpgi|1571123970.047.80%dynactin [Aedes aegypti]
NCBI nr blastxgi|1571123970.047.70%dynactin [Aedes aegypti]
Group
KEGG pathwayaag:AaeL_AAEL0061450.0 
 K04648 (DCTN1)maps-> Huntington's disease
    Vasopressin-regulated water reabsorption
InterPro domain[551-799] IPR0221572.5e-44Dynein associated protein
[4-137] IPR0009381.9e-26Cytoskeleton-associated protein, Gly-rich domain
Orthology groupMCL11853 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211500-TA
ATGTCGGATAAGAATCTGACGCTGGGTCAGCGTGTGATGGTGATAGGGAAGGAAGTAAAAGGGTCCATTGCATACGTCGGCAACCCAACATTCGCGTCCGGGAAATGGATTGGTATCATTTTGGATGAGCCCAAAGGCAAAAATAATGGTACACTGCGCGGACATGCATACTTCAGCTGCGAAGAGAAGTACGGCGTGTTTGTGAGACAGACCCAGATACAACTCTTGGATTCAGAAGACAACCCCATGGACACCTCCATGACGGCTTCCACCGAGGAAACCAAACCGACCAGACGTCTCAGCAGGGTGCGGCGAAAGGCGTCACCGGCGCAGACGAAGACTGGAAGCCTGTCCATGTCGAGTTCCAGAACCTCCCTGGCCAGCAGCCGTCAGTCGTTGACGTCATTCGTGTCACCGACGACCGAGAGAGGGACTTCCCCCGATCTCACAAAACGTGCTTCCTTTGTTGAGACTGGTTTCGTGGAGACTTTAACACCTCAATACACTCCGGGTCAGAGTTTGACCTCGCCATCAACAGCCTCTGAGGATAAACTGGCGAACATACAGGCGCAACAGGAGATTGTGAACCTAAAAGCTGAGGTGGAAGATTTGAAGGAGAAGCTGGAAACTCTGAAAGTCAGACGGGCCGAGGACAGGGAGAAGCTTCGAGAGCTGGAGAGAATGAGGTTACAGCTGGACCAGGCGAATGAGTTCAAGGCAAAGATCATGGAGTCACAGGCACAGCTGCAAAGGGACCTGCAGAGGGCCAAACAAGAGCTGCGTGAAGCCCAAGAAGCCCTGGACCAGCACAACGACGAGACAGCTGACCTGCAAGAGGCAGCTGAAATGGCGGCTCTTGATAAAGAAATGGCGGAGGAGAGGGCGGAGGCTCTACAGCTGGAGCTGGAACAGGCGAGGGAGAAGCTGGAAGAGGCGACGCTAGACCTGCAACTCATGAGGGCTGAGATGGAAGCTGGCGGGAATATACAACACCCGTATGCAGCGGGCGACAGTGGCGCCACCGGTTACGAGGTGAGGCAACTACAGCAACAGAACGTCCGTCTGAGGGACACGCTGGTCCGCCTCCGAGACCTCTCCGCCCACGATAAGCATGCAATGCAGAAAATGATGAAGGATTTGGAGCAATACAAGTCGGAGATAGCTGAACTGTCGAGGACTAAGGAGAAGCTGTCAGCGAGGGTTGAGGAGTTGGAGGCTCAGGTCGCTGATCTCAGAGAACAGGTGGACGCCGCTCTAGGCGCTGAAGAGATGGTGGAACAGCTGGCTGAGAAGAAGATGGCTTTGGAAGATCAGGTGGAACAGCTGAAGCAGGACGTATCAGAGCTGGAGGCGCTGCAGGAGGTTCACGAACAGCTGGTGGAGTCCAACCGGGAGCTGGAAATGGATCTGCGCGAAGAGCTGGAAATGGCGCACGCTGCTACCCGGGAGGCGGCCCGTGAGCGTGAAGCGGCCTTGGAGACGATCATGGATAGAGATGCGACCATCATCAAGTTCAGGGAGCTGGTGCAGAAGATGACGGAACAGCAGAACGAGCTCAAGAGCCAGGTTGAGAACAAACAGGGTGACCACGAGCCGTCTCCGGAGGGCGAAGCGCCAGAGGCTGCGCCCCGCGAGCTCGGAGCCCTGGTGCTCCAATCCAGGGCTGCCACCCGCTCTGTAGACCTGCAGTTGAGGGCTCTCGAGCTGGAACAGGCTCGGGCCAGGGCTGATAGATTGGCGGCGTGTCTACCTGATCATTTCATGGCACCCAACGGTGATCACGACGCCATCATGTTCATTCTGCTTCTACAGCGGTTGGACACCAAGTCCGAGATCATACTCGGACAGATCAGGGAGAAGTTCCCACCTGTGAACGTCTGGGATAAGGAATCGGTTATGAGAACCCACACAGCTGTCCAGTACAGCTTCAGATGCCAGCTGGAATACCAGCTGCAAATGATACAGTGCATGACATCTATGTGGTCTGGTGCGCTGGAGCGCTGCAGTCCTGAACTACTACTTCGAGCTGCTTCAGCGCTGCCGGATGCTGCAGCACAGGAGAGAGCACTAGATGCTGCGACCAGCCTGCTAAAGAACAATGAATTAGATGAGAACAGCTCTTTAGATGGCATGGAGCGGTGTTGGTCCTATCTAAGCGCTATGTGGTCCGCTCTGAACATGTCGTCGGTGGAAGGCGCGTCTTGTACTAGGGATGTGTTGCTACACTCGTGTTTCGCCCTGGACGCGCTCGCGAGAGCCCTAGCAGCTGATGGGGCGGCGTTACAGCATGTTATGCTGCCGTCCGATCATCAGCAAGAGCTGGGACAGCTGCATGAGGCCATCCAGTCCAGCTGCTCGTCCCTCCAGCAGCAGCTGAAGAGCGTGAGGCGCAGGCTCCAGCCTGGAGTCAAGCCCTCCACTCTGCCTATAGACGCTCAGCTGGTGGATCGTCTCCGGGGGTCCACAGCGGCGTCTCTGAGCAAGTGCGCCCGCGCCACCTCCCTCGCCGCCCGGGCTGCTAGCGCCTGCGCCGACACGGCCGGGGAGAGGGGCGAAGGCGCTCCGCTAGCACACGCAGCCATACAAGCGGTGTGGCTGGCGGCCTTCGATAAGATATACCAGCAGGAAGAGCAGGGCGTTGTCAAAACTGTGAAGAACGCGCTGTCACAGACGGCCAAGGACGTAGACGCGCTGGCGACCTTCGTGAGGGACCGCGAGTACGACCTGATGTCCAGCACCAACGGAGCTGATGATACGCCGACTCCGCCTATAGTGCTCCGAGCGCAGCTGGTGAAGAAACAGCTGGAAGAGACGAAGACCTTGACAATAAGGCTTGAGAATAAGGAGGCTGATATTAAAGAACTGAAGAAGGCGCTGAAGGCCAAGCAAGAAGAATTGTCTGAGATGCAAATAAGGCGGGAGCTGGGCGAGAGGAAACTGGTCGCTGCGGCGAGGGACGCCGAGCTGAAGTCGCAGCAGCTGCAGCGGCGGCTGGACGACGCGCAGAACCAGTTCAAGAGGAAAGAGAAGGAGTTCGAGGAGACTATGGACCACCTGCAGCAGGACATAGACCTGCTGGCCAGCGAACGAGGAGCTCTCAGGGACAAGCTGAAGCTATACGCTAAGAGGTCACATCACGGTCAGTAA

Protein sequence:

>DPOGS211500-PA
MSDKNLTLGQRVMVIGKEVKGSIAYVGNPTFASGKWIGIILDEPKGKNNGTLRGHAYFSCEEKYGVFVRQTQIQLLDSEDNPMDTSMTASTEETKPTRRLSRVRRKASPAQTKTGSLSMSSSRTSLASSRQSLTSFVSPTTERGTSPDLTKRASFVETGFVETLTPQYTPGQSLTSPSTASEDKLANIQAQQEIVNLKAEVEDLKEKLETLKVRRAEDREKLRELERMRLQLDQANEFKAKIMESQAQLQRDLQRAKQELREAQEALDQHNDETADLQEAAEMAALDKEMAEERAEALQLELEQAREKLEEATLDLQLMRAEMEAGGNIQHPYAAGDSGATGYEVRQLQQQNVRLRDTLVRLRDLSAHDKHAMQKMMKDLEQYKSEIAELSRTKEKLSARVEELEAQVADLREQVDAALGAEEMVEQLAEKKMALEDQVEQLKQDVSELEALQEVHEQLVESNRELEMDLREELEMAHAATREAAREREAALETIMDRDATIIKFRELVQKMTEQQNELKSQVENKQGDHEPSPEGEAPEAAPRELGALVLQSRAATRSVDLQLRALELEQARARADRLAACLPDHFMAPNGDHDAIMFILLLQRLDTKSEIILGQIREKFPPVNVWDKESVMRTHTAVQYSFRCQLEYQLQMIQCMTSMWSGALERCSPELLLRAASALPDAAAQERALDAATSLLKNNELDENSSLDGMERCWSYLSAMWSALNMSSVEGASCTRDVLLHSCFALDALARALAADGAALQHVMLPSDHQQELGQLHEAIQSSCSSLQQQLKSVRRRLQPGVKPSTLPIDAQLVDRLRGSTAASLSKCARATSLAARAASACADTAGERGEGAPLAHAAIQAVWLAAFDKIYQQEEQGVVKTVKNALSQTAKDVDALATFVRDREYDLMSSTNGADDTPTPPIVLRAQLVKKQLEETKTLTIRLENKEADIKELKKALKAKQEELSEMQIRRELGERKLVAAARDAELKSQQLQRRLDDAQNQFKRKEKEFEETMDHLQQDIDLLASERGALRDKLKLYAKRSHHGQ-