Monarch geneset OGS2.0

DPOGS201574
TranscriptDPOGS201574-TA1032 bp
ProteinDPOGS201574-PA343 aa
Genomic positionDPSCF300201 + 304692-310397
RNAseq coverage301x (Rank: top 37%)
Annotation
HeliconiusHMEL0063210.094.46% 
BombyxBGIBMGA006121-TA1e-8493.63% 
DrosophilaDlic-PC2e-13567.82% 
EBI UniRef50UniRef50_Q9VZ203e-13367.82%Dynein light intermediate chain, isoform A n=34 Tax=Coelomata RepID=Q9VZ20_DROME
NCBI RefSeqXP_395283.31e-15175.67%PREDICTED: similar to Dlic2 CG1938-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3071936658e-15475.95%Cytoplasmic dynein 1 light intermediate chain 2 [Harpegnathos saltator]
NCBI nr blastxgi|3320257749e-15176.49%Cytoplasmic dynein 1 light intermediate chain 2 [Acromyrmex echinatior]
Group
Gene OntologyGO:00037741.4e-186motor activity
KEGG pathwayame:4118164e-151 
 K10416 (DYNC1LI, DNCLI)maps-> Phagosome
    Vasopressin-regulated water reabsorption
InterPro domain[7-342] IPR0084671.4e-186Dynein 1 light intermediate chain
[3-342] IPR0227801.1e-168Dynein family light intermediate chain
Orthology groupMCL11052 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201574-TA
ATGGAGACAAACGGCCAAGCTAATGGGCTTAAGTCTAAAAAGAAAGATGACGCAGATTCGAAGGATAATTTGTGGTCTGCTATACTAGAGGAGGTTCAAAATCAGGGGAATACTAAATTACCTTCCAATAAGAACGTGCTAGTTCTGGGCGACAACGAAACTGGGAAGACAACTCTCATAGCTAAGTTACAGGGGGTTGAAGACCCTAAAAAAGGATCAGCTCTTGAATATGCATTCATAGATGTTAGAGATGAGTACCGTGATGACCACACTAGGCTCAGTGTATGGGTACTCGACGGAGATCCAGGTCACACAAATCTGCTCAAGTTTGCCCTTAACGAAGAAACATTCCCCCACACATTAGTGATGCTCACAGTGGCAATGACAACGCCATGGGGTATACTAGATCAGTTACAAAGCTGGGCGTCTGTGCTCGGTGATCACATAGACAAATTGGATCTCACCCCCGAACAAAGGTTACAAAGTAAGAAGCAGCAGGTACAGAAGTGGCAGAGGTATACGGAGCCGGGCGACGAGCTCGAGGCCAACGCCTCGTCCCCGATGAAGCGTTCCTCCCGCAACCTGTCCGACGACCTGGACAGCGACGATGAAGACAACCAGCTGCCCGAGGCTGTGCTCACAACTAACCTTGGCCTGGACATCGTGGTTGTTGCCACTAAGACTGACTACATGAGTACCCTGGAGAAGGAGCACGACTATCGCGACGAGCATTTCGACTTCATGCAGCAGTGGATCCGTCGGTTCTGTCTCCAATACGGAGCGGCGTTATTCTACACCAGCTCCAAAGAGGACAAGAACTGCGACCTGCTCTACAAATATCTCACACACCGGATATACGGTCTGCCATTCAGGACGCCGGCGCTCATAGTGGAGAAAGATGCTGTGCTCATACCGGCGGGTTGGGACAGCATGAAGAAGATCAGTATCCTGTACGAGAACATGCAGACGTGCCAGCCCGACGACTACTACAGAGACGCGATCGTGCAACCCGCTACCAGGAAGGTTGGTTGA

Protein sequence:

>DPOGS201574-PA
METNGQANGLKSKKKDDADSKDNLWSAILEEVQNQGNTKLPSNKNVLVLGDNETGKTTLIAKLQGVEDPKKGSALEYAFIDVRDEYRDDHTRLSVWVLDGDPGHTNLLKFALNEETFPHTLVMLTVAMTTPWGILDQLQSWASVLGDHIDKLDLTPEQRLQSKKQQVQKWQRYTEPGDELEANASSPMKRSSRNLSDDLDSDDEDNQLPEAVLTTNLGLDIVVVATKTDYMSTLEKEHDYRDEHFDFMQQWIRRFCLQYGAALFYTSSKEDKNCDLLYKYLTHRIYGLPFRTPALIVEKDAVLIPAGWDSMKKISILYENMQTCQPDDYYRDAIVQPATRKVG-