Monarch geneset OGS2.0

DPOGS200821
TranscriptDPOGS200821-TA1110 bp
ProteinDPOGS200821-PA369 aa
Genomic positionDPSCF300071 - 635043-639004
RNAseq coverage462x (Rank: top 27%)
Annotation
HeliconiusHMEL0114770.088.65% 
BombyxBGIBMGA009875-TA6e-15277.57% 
DrosophilaDmn-PA1e-6238.48% 
EBI UniRef50UniRef50_E2AM041e-6240.20%Probable dynactin subunit 2 n=9 Tax=Endopterygota RepID=E2AM04_CAMFO
NCBI RefSeqXP_001847700.14e-7541.49%dynactin subunit 2 [Culex quinquefasciatus]
NCBI nr blastpgi|944693524e-7444.30%dynactin [Aedes aegypti]
NCBI nr blastxgi|944693527e-7544.30%dynactin [Aedes aegypti]
Group
Gene OntologyGO:00058691.8e-72dynactin complex
GO:00070171.8e-72microtubule-based process
KEGG pathwaycqu:CpipJ_CPIJ0058751e-74 
 K10424 (DCTN2)maps-> Huntington's disease
    Vasopressin-regulated water reabsorption
InterPro domain[12-366] IPR0069961.8e-72Dynamitin subunit 2
Orthology groupMCL12112 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200821-TA
ATGGCTAATCCTAAGTACGAAAATCTTCCCGGCATAGCCTATGACCAACCCGATGTATATGAAACCGATGATTTGCCGGAGGCTGACCAGCCGGATCCGTATGAAGAGGAAGAAAATAGCTGTATTGAACAGTTACATTTATCAGTGAAAGATTCTTTCAATACATTCAAAGGAAAATTTCTAACTGGGAATGTTGATTTTTCTGATAGACTCAGCAGGAAAAATCGTATCGGATACAGAGCTGGTGAGTGGGAATTGGCAGCGGAGGGTGACACAGAGACAACATTGGAGCGCTACAATAGACTTCGCTGTGAGTTCTCAGAGCTGTTGGAAGAAGTCACACAGAAAGAAAATAAAGCCATAGAGATCGAAAAGGAGGAATACAGCAAACTTGCAGCACAGATCAATTCAACAAAGAAACTTCTAGAAGAATTAAAACTAGAAGAGGGGGAACAAATTGATCCCAAGGCGGAGAAATTGAAGGAATACTTAAGTGTCAGTGGAAAGAAACAAGGTGATAAGGTCACAGCACATTTGAAACTCAAGCCTGAAGTCAATTTAGCGCAGACGGCCAGAATAGCACAGTTGGAGCACAGATTACATAAACTGGAGCAGGCGGCTGGTGTGAGAGATGAAGAAGCGTTCCGGAGGCTACAGACTGTTACTGGAGAGGCTACTCTATGTGGTGCTGCGGCCTCACTAGCCAGTCAGGCAGCGTTACTACGCCCCGCAGACCTGGCGGCCGCGGAGGCCAGAGTGACGTCATTACTTAGTAACATTGATGCTCTGAAGCAAGCAGTCAAACCAGCCCAGCCTGAAGTCGAGGAGAAGGTTAAAGAATTGTACAAACTATTAAAGCAAATAGATGGTGTGTCACATTCTGAAATCTTAGAGAGAATGGAGGCTCTGGAAGCTTTACATAATCAAGCAAGCAACTTTGGTAAATCTCTGACCGAATTGGAGACGTTACAAAGTACAATCGCTAGCGGCGTTCAAAACAACAAGGCGCTCCTTGAAGGAGTGCAAGAAGTCTTCGCGCACAATATTGATAGCTTACAAAAAGAAATCAAGAAATTAGACGAAAGAATCGGCAAACTGACACCTGCGTGA

Protein sequence:

>DPOGS200821-PA
MANPKYENLPGIAYDQPDVYETDDLPEADQPDPYEEEENSCIEQLHLSVKDSFNTFKGKFLTGNVDFSDRLSRKNRIGYRAGEWELAAEGDTETTLERYNRLRCEFSELLEEVTQKENKAIEIEKEEYSKLAAQINSTKKLLEELKLEEGEQIDPKAEKLKEYLSVSGKKQGDKVTAHLKLKPEVNLAQTARIAQLEHRLHKLEQAAGVRDEEAFRRLQTVTGEATLCGAAASLASQAALLRPADLAAAEARVTSLLSNIDALKQAVKPAQPEVEEKVKELYKLLKQIDGVSHSEILERMEALEALHNQASNFGKSLTELETLQSTIASGVQNNKALLEGVQEVFAHNIDSLQKEIKKLDERIGKLTPA-