Monarch geneset OGS2.0

DPOGS210246
TranscriptDPOGS210246-TA1428 bp
ProteinDPOGS210246-PA475 aa
Genomic positionDPSCF300196 + 650054-653359
RNAseq coverage463x (Rank: top 27%)
Annotation
HeliconiusHMEL0146620.077.73% 
BombyxBGIBMGA002379-TA2e-14663.81% 
DrosophilaCG12042-PA2e-10841.88% 
EBI UniRef50UniRef50_UPI00015B5FB72e-10745.17%UPI00015B5FB7 related cluster n=1 Tax=unknown RepID=UPI00015B5FB7
NCBI RefSeqXP_002050344.13e-11342.41%GJ20263 [Drosophila virilis]
NCBI nr blastpgi|1953832606e-11242.41%GJ20263 [Drosophila virilis]
NCBI nr blastxgi|910917961e-11243.40%PREDICTED: similar to dynactin P62 subunit [Tribolium castaneum]
Group
KEGG pathwaydvi:Dvir_GJ202631e-112 
 K10426 (DCTN4)maps-> Huntington's disease
    Vasopressin-regulated water reabsorption
InterPro domain[2-427] IPR0086037.9e-131Dynactin p62
Orthology groupMCL12425 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210246-TA
ATGGCTTATTTAACTCAGCCAGATTATGTTAAGTATATATGTTCCTGTGGACAACTTAAACCTATCACTAACTTATATTTCTGTCGGCATTGCTTAAAAATACGTTGCGGTTTTTGCATCTGCCATGAAGTGGACTCCCATTATTGCGCGAACTGTTTGGAGAACATGCCATCATCGGAGGCGAGGTTGAAGAAGAATCGCTGCAGTAGTTGCTTTGCCTGTCCCAGTTGTTTCCATACACTGTCGACTCGAGCGACCCTCGCCCGGGACCCCGCGGGGGGTGATGGCAAGCCTCCAAAGAAGATGTACTATTTATCGTGTTTCAACTGTCGCTGGACCTCGCGGGATGTTGGCATACCAGATCAGCCAGTGGCATCTGGAGGATGGCCGGAAAGAACGAATCCCTTCACGAGTCGGTTCAACCAACTCCTCGAATACTATAAAGCGATCGCGCAGCAAGAGAAGCAGGAGAAGTTAGACAAGGAGAGGAAGAAGTTTGTCACCAGAGGGAAATACATCAATCTCACGGACAAGACTGGTCTGACGGGAGCCATGGCTAGGAACATAGCCGGGCTACCCTCCAGCGAGAGTTCGTCGACGGTCATTAGTAATTTCGTGCCGAGCCAAGCCAGCGCGGATGTGGAGGAGCTGCCCGAGGACTACCACCTGAAGGAAATCGACTTGAAACAGATAACTCGTATGTCTCAGCGCACGTCGTGTCCTGAGCTGCAGCCGTCGCTGGCGTCTCGTCTGTCTCCGAGGCCTCGTGCGTTGTGTGTGAAGCGCAGTCAGCGGTGCCGCGCCTGTGACCATAACCTCACCAAGCCCGAGTACAACCCTGGATCCGTCAAGTTCAAGATCGAGCTGCTAGCCTTCTACCACGTGCCGGAAGTGAAGATAATATCGTACGAAACCATCAAGCCGGGCGGAACATCGGCCCTGCTGTTGAAGTTGTCCAACCCGACGGCACACGAGATGACCATCAGGATGCTGCAGCCTGAAGACGTACCGGACACTGAGGAGGAACACAAGACCATCGAGAAGACACTCGAGAAGTCACTCAGTTTGGAAAAGGACGTATCGTGGAACGCTTGTTTCCCTCGGCGACCCCCGCCGTCCCGCGCCGGGTCGGTCGGTCTGCCCGCCGTTAGCCTCACCCTCGCGCAGAGAGACGACGCCGCGGAGTACGACGACGAGCCGCCGGGAGATAATGATGATATAATTCTGTGGCGGAAGTCCAACAAGCTGGCTCTGAGGCTGAAGCTGACTACAGACCCTGAAGCCAGAGTTGGGACGGACGCATATTTCTCATTCGCTATACAATACTCGTACAGGAACACTGTGCCGCCCTCCGCCACCGTGGTGCAGCACAGCGACCACACGCTGTACACCACGGTCATACTGGATTTGGGGAAAGTCTGTGAATAA

Protein sequence:

>DPOGS210246-PA
MAYLTQPDYVKYICSCGQLKPITNLYFCRHCLKIRCGFCICHEVDSHYCANCLENMPSSEARLKKNRCSSCFACPSCFHTLSTRATLARDPAGGDGKPPKKMYYLSCFNCRWTSRDVGIPDQPVASGGWPERTNPFTSRFNQLLEYYKAIAQQEKQEKLDKERKKFVTRGKYINLTDKTGLTGAMARNIAGLPSSESSSTVISNFVPSQASADVEELPEDYHLKEIDLKQITRMSQRTSCPELQPSLASRLSPRPRALCVKRSQRCRACDHNLTKPEYNPGSVKFKIELLAFYHVPEVKIISYETIKPGGTSALLLKLSNPTAHEMTIRMLQPEDVPDTEEEHKTIEKTLEKSLSLEKDVSWNACFPRRPPPSRAGSVGLPAVSLTLAQRDDAAEYDDEPPGDNDDIILWRKSNKLALRLKLTTDPEARVGTDAYFSFAIQYSYRNTVPPSATVVQHSDHTLYTTVILDLGKVCE-